Tejas Garg
Building ML Systems That Work Beyond Demos
Third-year undergrad • Seeking Summer 2026 ML/SWE internships
Building ML Systems That Work Beyond Demos
Third-year undergrad • Seeking Summer 2026 ML/SWE internships
Third-year CS undergrad focused on building ML systems that work beyond demos. I care about the gap between research and production — making models interpretable, reliable, and deployable.
Recent work: reproduced a diffusion classifier from scratch and built custom XAI for it, created a real-time PPE monitoring pipeline with temporal filtering, and experimented with GRPO to teach LLMs explicit reasoning. I like projects where the engineering is as hard as the ML.
B.Tech in Computer Science & Engineering
Specialization in AI & Machine Learning
Indian Institute of Information Technology, Nagpur
Expected 2027
Explainable AI • LLM Pipelines & Evaluation • Reinforcement Learning for Reasoning • Diffusion Models • Real-time Computer Vision • Production ML Systems
Reproduced DiffMIC-v2 from scratch — a dual-granularity conditional diffusion model for diabetic retinopathy grading. Matched the paper's 84.1% accuracy on APTOS 2019. Then built a custom XAI framework with 6 explainability techniques (temporal trajectories, attention maps, faithfulness validation) because traditional XAI doesn't work for 1000-step iterative inference.
Event-driven video processing system that turns noisy frame-level PPE detections into stable violation events. Built with YOLOv8/v11, SAM3, FastAPI, and Next.js. Handles real-world constraints: occlusion, limited GPU, long-running streams. Uses temporal filtering with EMA fusion and hysteresis thresholds to reduce alert spam.
Multi-stage LLM pipeline for reviewable SQL generation. Breaks down intent into reasoning steps, generates SQL, then validates and auto-corrects. Built-in security layer for prompt injection. Benchmarked on Spider dataset: 78% execution accuracy. Shows each decision so humans can audit before execution.
Experimented with Group Relative Policy Optimization to induce explicit reasoning in Mistral-7B via XML-structured traces. SFT warmup on GSM8K, then GRPO to encourage step-by-step thinking. Improved test accuracy from 41.2% to 52.5% (+11.3%). Learned critical lessons about evaluation consistency in RL for LLMs.
Full-stack medical analytics dashboard that extracts biomarkers from lab reports (PDF/images) using Gemini vision, visualizes them with interactive gauges, and provides an AI chatbot for questions about your results. Glassmorphic UI with health score aggregation.