I'm a Principal Data Scientist & AI Lead with 7+ years of experience building end-to-end ML and Generative AI solutions across enterprise environments. I hold an MSc in Data Science from the University of Surrey (UK) and a BTech in Computer Science from KL University (India).
Across my career I've shipped 10+ AI solutions — from an Agentic AI platform used daily by 200+ people and an optimization engine saving $2.4M/year, to automation tools delivering 5x productivity gains and NLP systems saving 72 hours of manual work per week. I've worked across supply chain, recruitment tech, traffic systems, and LLM training data pipelines.
Outside of work: food, movies, anime, and whatever's new in tech.
Shipped Shipment Recommendation Engine ($2.4M/year savings), Agentic AI platform consolidating 5 legacy systems for 200+ users, AI Excel Template Analyser (65%→95% accuracy, 5x productivity), Newsletter AI Platform (85% effort reduction), and multiple production RAG chatbots using LangGraph + LangFuse. Spearheaded org-wide GenAI adoption strategy. Led a cross-functional team of 12.
Led a team of 10 generating high-quality Python/SQL reasoning datasets for LLM fine-tuning. Designed dataset quality standards and end-to-end curation pipelines feeding directly into Cohere's model training.
Deployed a Naïve Bayes ticket classifier (72 hrs/week saved), real-time license plate detection on edge hardware (OpenCV + Tesseract), a Llama-based conversational agent, and a traffic demand forecasting POC using Prophet/SARIMA.
EDA on large-scale insurance datasets, Python pipelines for COBOL 3→4 migration, SQL optimisation on mainframe systems, and data quality fixes improving platform reliability.