Robinson Zhang

Robinson Zhang

AI/ML Engineer

AI/ML Engineer with 6+ years of experience | Python + JavaScript + SQL | Specialized in AWS, LLM, RAG, Models, API, and Full Stack Development. Passionate about building scalable ML infrastructure and deploying AI solutions that drive real business impact.

About Me

I am an AI/ML Engineer with 6+ years of hands-on experience in building machine learning infrastructure, fine-tuning models, and deploying them at scale. My expertise spans across Large Language Models (LLMs), RAG pipelines, deep learning, NLP, and full-stack development.

Throughout my career, I have successfully developed over 104 APIs, built advanced ML platforms like DIVE, and engineered enterprise RAG chatbots. I have extensive experience with AWS cloud services (Lambda, SageMaker, Bedrock, ECR, ECS, SQS), and have achieved significant business impact including 90% accuracy in automated product classification and 30% cost reduction through ML automation.

I am actively seeking opportunities as an ML/AI Engineer where I can leverage my expertise in LLMs, RAG, MLOps, and cloud infrastructure to build innovative AI solutions that solve complex business challenges.

Skills & Expertise

Programming Languages

Python (Pandas, NumPy)JavaScriptTypeScriptReactSQL

ML Frameworks & Models

PyTorchScikit-LearnLangChainLangGraph V1.0XGBoostK-meansMLPClassifierCNNBERTWhisperSenseVoiceSmallLightGBM

LLM & GenAI

GenAIDeepSeekOpenAIClaudeLLAMAHuggingFace ModelsRAGLLM Fine-tuning

AWS Cloud Services

LambdaSageMakerBatchBedrockECRECSSQSS3RDSEMRRoute53AmplifyAPI Gateway

DevOps & Infrastructure

CI/CDServerlessTerraformGitHub ActionsBitbucketAirflowDockerKubernetes/K8SFastAPI

Monitoring & MLOps

GrafanaKibanaMLflowEFK StackPrometheus

Databases & Data

PostgreSQLSnowflakeDynamoDBPineconeDatabricksETLData MiningData Cleaning

Specializations

RAG (Retrieval-Augmented Generation)ASR (Automatic Speech Recognition)NLPFull-Stack DevelopmentMLOpsLLMOps

Professional Experience

Machine Learning Engineer

Songbenco Inc., Montreal, QC

Jan 2025 – Present
  • Developed AI/ML projects using LLMs, including document data extraction and speech-to-report systems
  • Built applications integrating AI/ML features for online shopping platforms, CNN image and AI agent project

Machine Learning Engineer

Pivotree Inc. (CVE: PVT), Ottawa, ON

Mar 2023 – Dec 2024
  • Built machine learning infrastructure, fine-tuned models, and deployed them privately at scale
  • Effectively utilized deep learning techniques and neural networks for complex problem-solving
  • Developed over 104 APIs across jobs, messages, named services, models, and platform basic services
  • Successfully deployed ML solutions on AWS Lambda, ECR, leveraging Batch and SQS for efficient microservices bulk
  • Extensive experience with Large LLMs, including RAG pipelines, fine-tuning, development, and evaluation
  • Worked with GPU acceleration, distributed computing, and parallel processing to make ML workloads faster
  • Utilized tech stack including TypeScript, Python, Next.js, AWS Lambda, API Gateway, SQS, and DynamoDB
  • Designed and developed DIVE Platform - an advanced ML platform featuring web frontend, microservices architecture, serverless deployment, and CI/CD pipelines. Achieved 90% accuracy in automated product classification, cutting customer costs by 30%
  • Developed AI-powered SKU Build platform for auto-classification and automated SKU research, creating a clean, consistent, and accurate 4.5M SKU taxonomy. Enabled MRO distributor to find products 60% faster
  • Engineered enterprise RAG chatbot integrating OpenAI and AWS Bedrock for intelligent HR policy inquiries, implementing optimized vector search for rapid knowledge retrieval

Data Engineer

WayBase Inc., Toronto, ON

Oct 2021 – Dec 2022
  • Managed large datasets for data scientists. Built ETLs and scalable data CI/CD Pipelines
  • Participated in National Reports Project, utilizing Snowflake and DBT for data storage and querying, extracting Big Data
  • Improved data consistency by 80% through optimized data loading and transformation (ELT) processes
  • Utilized Snowflake Snowpark (Python and SQL) for data processing and analyzed big data of website logs
  • Successfully executed Pipeline Project, scraping data from the internet and comparing data using Python
  • Improved data accuracy by 50% through effective data comparison and insertion of new data
  • Demonstrated expertise in containerization by converting programs to containers and running them on Kubernetes
  • Implemented scalable data architecture, designing programs as subscribed services, resulting in 2x improvement in data processing time

Machine Learning Engineer

HEKA Company Inc., Montreal, QC

Feb 2021 – Oct 2021
  • Developed NLP solution using TF-IDF and deep learning, connecting 1000+ physiotherapists with patients
  • Developed natural language processing (NLP) solution for patients, leveraging TF-IDF algorithm to analyze symptoms and generate accurate treatment recommendations
  • Utilized AI technology to identify the most relevant keywords in patient symptoms and compare them to a vast database of medical knowledge
  • Improved patient understanding of their symptoms by 75%, and streamlined the diagnosis process for healthcare providers, reducing diagnosis time by 60%

IT Engineer

Xiamen International Bank, CN

Jan 2012 – Jan 2021
  • Developed ML classification models to categorize customers for anti-fraud analysis in banking systems.
  • Conducted data mining and financial risk analysis to support anti-fraud initiatives and operational risk management.

My Projects

Stock Analysis

AI

A comprehensive stock analysis platform combining multiple analyst perspectives including technical, fundamental, sentiment, and valuation analysis.

Tech Stack:
LangGraphLLM
View Project

Medical Insurance Claim Automation

AI

Medical insurance claim automation system using LLM for multi-source document extraction and conflict resolution.

Tech Stack:
Open LLMTesseract
View Project

Data Normalization (SKU Build)

AI

AI-powered SKU Build platform for auto-classification and automated SKU research, creating clean, consistent, and accurate product taxonomies.

Tech Stack:
LLMAPIs
View Project

ML Platform (DIVE)

ML

Advanced Machine Learning platform featuring web frontend, microservices architecture, serverless deployment, and CI/CD pipelines. Achieved 90% accuracy in automated product classification.

Tech Stack:
MLAWS SageMakerModels
View Project

Object Detection

ML

Computer vision project implementing YOLOv8 for real-time object detection, specifically trained to identify and detect yellow cars in images and video streams using deep learning and CNN architectures.

Tech Stack:
CNNYOLOv8
View Project

Data Classification

ML

Developed a robust ML pipeline featuring leakage-free preprocessing and feature selection. I optimized performance through RandomizedSearchCV fine-tuning and advanced Voting and Stacking ensembles, evaluated using comprehensive classification metrics for peak accuracy.

Tech Stack:
Fine-tuningEnsemble
View Project

Finance Fraud Detection

ML

Built end-to-end time-aware fraud detection system using feature engineering, and time-series validation for transaction risk scoring.

Tech Stack:
Catboost
View Project

Online Store

Full-Stack

An AI-powered e-commerce platform featuring intelligent ad copy, product descriptions, and smart search for selling high-quality images, with a shopping cart function and multilingual support.

Tech Stack:
EmbeddingRAG
View Project

City Property Tax Query

Full-Stack

A serverless, multilingual property tax web app featuring Google Maps address search, interactive data visualization, and cross-year comparisons, built on a scalable AWS microservices architecture for high performance.

Tech Stack:
GoolgeMapsTypeScript
View Project

Python GIS

Data

Python GIS tool that processes coordinates and identifies geographical areas using shapefiles. Utilizes Shapely library to map coordinates to geocodes from Statistics Canada, enabling spatial analysis and data enrichment.

Tech Stack:
Data Pipeline
View Project

Education

Master of Interdisciplinary AI

Specialized in Artificial Intelligence and Machine Learning

Master of Business Administration

Business management and strategic planning

Bachelor of Automation and Robotics Engineering

Foundation in engineering principles, automation systems, and robotics

Get In Touch

I'm actively seeking opportunities as an ML/AI Engineer. If you're looking for someone passionate about artificial intelligence and machine learning, let's connect!