
Robinson Zhang
AI/ML Engineer
AI/ML Engineer with 6+ years of experience | Python + JavaScript + SQL | Specialized in AWS, LLM, RAG, Models, API, and Full Stack Development. Passionate about building scalable ML infrastructure and deploying AI solutions that drive real business impact.
About Me
I am an AI/ML Engineer with 6+ years of hands-on experience in building machine learning infrastructure, fine-tuning models, and deploying them at scale. My expertise spans across Large Language Models (LLMs), RAG pipelines, deep learning, NLP, and full-stack development.
Throughout my career, I have successfully developed over 104 APIs, built advanced ML platforms like DIVE, and engineered enterprise RAG chatbots. I have extensive experience with AWS cloud services (Lambda, SageMaker, Bedrock, ECR, ECS, SQS), and have achieved significant business impact including 90% accuracy in automated product classification and 30% cost reduction through ML automation.
I am actively seeking opportunities as an ML/AI Engineer where I can leverage my expertise in LLMs, RAG, MLOps, and cloud infrastructure to build innovative AI solutions that solve complex business challenges.
Skills & Expertise
Programming Languages
ML Frameworks & Models
LLM & GenAI
AWS Cloud Services
DevOps & Infrastructure
Monitoring & MLOps
Databases & Data
Specializations
Professional Experience
Machine Learning Engineer
Songbenco Inc., Montreal, QC
- Developed AI/ML projects using LLMs, including document data extraction and speech-to-report systems
- Built applications integrating AI/ML features for online shopping platforms, CNN image and AI agent project
Machine Learning Engineer
Pivotree Inc. (CVE: PVT), Ottawa, ON
- Built machine learning infrastructure, fine-tuned models, and deployed them privately at scale
- Effectively utilized deep learning techniques and neural networks for complex problem-solving
- Developed over 104 APIs across jobs, messages, named services, models, and platform basic services
- Successfully deployed ML solutions on AWS Lambda, ECR, leveraging Batch and SQS for efficient microservices bulk
- Extensive experience with Large LLMs, including RAG pipelines, fine-tuning, development, and evaluation
- Worked with GPU acceleration, distributed computing, and parallel processing to make ML workloads faster
- Utilized tech stack including TypeScript, Python, Next.js, AWS Lambda, API Gateway, SQS, and DynamoDB
- Designed and developed DIVE Platform - an advanced ML platform featuring web frontend, microservices architecture, serverless deployment, and CI/CD pipelines. Achieved 90% accuracy in automated product classification, cutting customer costs by 30%
- Developed AI-powered SKU Build platform for auto-classification and automated SKU research, creating a clean, consistent, and accurate 4.5M SKU taxonomy. Enabled MRO distributor to find products 60% faster
- Engineered enterprise RAG chatbot integrating OpenAI and AWS Bedrock for intelligent HR policy inquiries, implementing optimized vector search for rapid knowledge retrieval
Data Engineer
WayBase Inc., Toronto, ON
- Managed large datasets for data scientists. Built ETLs and scalable data CI/CD Pipelines
- Participated in National Reports Project, utilizing Snowflake and DBT for data storage and querying, extracting Big Data
- Improved data consistency by 80% through optimized data loading and transformation (ELT) processes
- Utilized Snowflake Snowpark (Python and SQL) for data processing and analyzed big data of website logs
- Successfully executed Pipeline Project, scraping data from the internet and comparing data using Python
- Improved data accuracy by 50% through effective data comparison and insertion of new data
- Demonstrated expertise in containerization by converting programs to containers and running them on Kubernetes
- Implemented scalable data architecture, designing programs as subscribed services, resulting in 2x improvement in data processing time
Machine Learning Engineer
HEKA Company Inc., Montreal, QC
- Developed NLP solution using TF-IDF and deep learning, connecting 1000+ physiotherapists with patients
- Developed natural language processing (NLP) solution for patients, leveraging TF-IDF algorithm to analyze symptoms and generate accurate treatment recommendations
- Utilized AI technology to identify the most relevant keywords in patient symptoms and compare them to a vast database of medical knowledge
- Improved patient understanding of their symptoms by 75%, and streamlined the diagnosis process for healthcare providers, reducing diagnosis time by 60%
IT Engineer
Xiamen International Bank, CN
- Developed ML classification models to categorize customers for anti-fraud analysis in banking systems.
- Conducted data mining and financial risk analysis to support anti-fraud initiatives and operational risk management.
My Projects
Stock Analysis
AIA comprehensive stock analysis platform combining multiple analyst perspectives including technical, fundamental, sentiment, and valuation analysis.
Medical Insurance Claim Automation
AIMedical insurance claim automation system using LLM for multi-source document extraction and conflict resolution.
Data Normalization (SKU Build)
AIAI-powered SKU Build platform for auto-classification and automated SKU research, creating clean, consistent, and accurate product taxonomies.
ML Platform (DIVE)
MLAdvanced Machine Learning platform featuring web frontend, microservices architecture, serverless deployment, and CI/CD pipelines. Achieved 90% accuracy in automated product classification.
Object Detection
MLComputer vision project implementing YOLOv8 for real-time object detection, specifically trained to identify and detect yellow cars in images and video streams using deep learning and CNN architectures.
Data Classification
MLDeveloped a robust ML pipeline featuring leakage-free preprocessing and feature selection. I optimized performance through RandomizedSearchCV fine-tuning and advanced Voting and Stacking ensembles, evaluated using comprehensive classification metrics for peak accuracy.
Finance Fraud Detection
MLBuilt end-to-end time-aware fraud detection system using feature engineering, and time-series validation for transaction risk scoring.
Online Store
Full-StackAn AI-powered e-commerce platform featuring intelligent ad copy, product descriptions, and smart search for selling high-quality images, with a shopping cart function and multilingual support.
City Property Tax Query
Full-StackA serverless, multilingual property tax web app featuring Google Maps address search, interactive data visualization, and cross-year comparisons, built on a scalable AWS microservices architecture for high performance.
Python GIS
DataPython GIS tool that processes coordinates and identifies geographical areas using shapefiles. Utilizes Shapely library to map coordinates to geocodes from Statistics Canada, enabling spatial analysis and data enrichment.
Education
Master of Interdisciplinary AI
Specialized in Artificial Intelligence and Machine Learning
Master of Business Administration
Business management and strategic planning
Bachelor of Automation and Robotics Engineering
Foundation in engineering principles, automation systems, and robotics
Get In Touch
I'm actively seeking opportunities as an ML/AI Engineer. If you're looking for someone passionate about artificial intelligence and machine learning, let's connect!