Class-Conditional Regularization for Cross-Lingual Representation Stability
Under review at PRL
Full-stack ML engineering from research to production
Large Language Models
Retrieval Augmented Gen
Agentic AI
Prompt Engineering
LLM Framework
Agent Graphs
LlamaIndex
Tool Calling
Agent Memory
Model Context Protocol
Deep Research
LLM Routing
Machine Learning
Natural Language Proc.
Computer Vision
Deep Learning
TensorFlow
HuggingFace
Model Finetuning
Async APIs
Spring Boot
Lightweight APIs
API Design
LLM Gateway
AuthN / AuthZ
Rate Limiting
Caching
Async Workers
Cloud Platform
Cloud Services
Containers
Kubernetes
Observability
Model Deployment
Prototyping
AI Products
Automation
Version Control
Model Evaluation
Cost Optimization
SQL DB
NoSQL
Vector DBs
Query Lang
Data Pipelines
Embeddings
Vector Indexing
Primary Lang
Systems
Enterprise
Concurrency
ALWAYS AN ACTIVE LEARNER :)
Full-stack ML engineering from research to production
Production systems that drive real business impact
Built a from-scratch C++17 HNSW engine with pybind11 bindings, reaching 89% recall@10 and 11.7ms P50 latency on a 1M-vector benchmark.
Designed a Go-based workflow engine with DAG execution, retries, fan-in/fan-out routing, and exactly-once transactional state handling.
Implemented a multi-provider gateway for OpenAI, Gemini, Grok, and DeepSeek with schema standardization, dynamic routing, and graceful fallbacks.
Shipped production healthcare Q&A pipelines with hybrid retrieval, cross-encoder reranking, multi-agent orchestration, and MCP-based context isolation.
Building applied AI systems with measurable production impact
Sep 2024 - Present
Jan 2024 - Sep 2024
Strong engineering fundamentals behind the ML systems work
Bachelor of Engineering in Information Technology
2021 - 2024
CGPA
9.07
Work spanning representation learning, multimodal modeling, and data mining
Under review at PRL
DOI referenced in resume
DOI referenced in resume
Open to ML engineering, applied AI, and GenAI platform roles, including remote opportunities.
Contact Me→