Hi, I'm Gaurav Chaudhari
Software Engineer specializing in Generative AI, building high-performance inference microservices and scalable voice agent pipelines. Passionate about creating AI solutions that make a real impact.
About Me
Transforming Ideas into Intelligent Solutions
I am an AI-focused Software Engineer with 6 months of professional experience and 3 months of internship experience at Anvex AI, working on real-world AI/ML systems and production deployments.
I have hands-on experience building inference APIs using FastAPI and Docker, optimizing model performance and reducing latency for real-time AI applications. My work includes contributing to LLM fine-tuning, RAG pipelines, and Voice AI systems, where I improved extraction accuracy by up to 75% and fine-tuned transformer models achieving 92% accuracy in multi-class tasks.
I hold a B.E. in Artificial Intelligence and Data Science from New Horizon Institute of Technology & Management, Thane, and secured 1st place in an LLM Fine-Tuning Hackathon, demonstrating strong fundamentals and practical problem-solving skills.
As a fresher, I am highly motivated to grow in advanced AI systems, LLM engineering, and scalable backend architectures, while delivering measurable impact through efficient and well-engineered solutions.
Professional Impact
Reduced agent response time by building moderate-grade RAG systems and optimized inference pipelines. Designed 20+ Chain-of-Thought prompts ensuring deterministic AI behavior.
Achievements
🏆 1st Place - LLM Fine-Tuning Hackathon
🎓 Head Technical Secretary, Student Association
Continuous Learning
📚 DeepLearning.AI - Generative AI with Large Language Models
📚 Stanford/Coursera - Machine Learning Specialization
Experience
Software Engineer ML
Vashi, India
- Architected high-performance inference microservices using FastAPI on Docker, reducing model latency by <2300ms for real-time interaction.
- Engineered BoundVoice: a scalable voice agent pipeline integrating VAPI SDK and SIP Trunking to manage concurrent telephony streams with dynamic context injection.
- Implemented server-grade RAG system with referencing for LLM outputs, increasing structured data extraction accuracy by 75% and eliminating hallucinated fields.
- Built Anvex Voice: a moderate-grade RAG with context in real-time, reducing agent response time during live calls.
- Fine-tuned SLMs and NVIDIA Parakeet TTS models on proprietary datasets, optimizing inference cost and reducing Word Error Rate (WER) for domain vocabulary.
AI Engineer Intern
Vashi, India
- Designed and evaluated 20+ Chain-of-Thought (CoT) prompt templates, ensuring deterministic behavior for enterprise-facing AI agents.
- Optimized conversational workflows using user-level analytics, achieving a 30% uplift in engagement and a 15% reduction in call drop-offs.
Featured Projects
Showcasing my expertise in AI/ML, from computer vision to RAG systems
CavScan: Deep Learning Diagnostic System
End-to-end dental cavity detection pipeline using Vision Transformers (ViT) trained on 90K X-rays
Key Achievements:
- •Developed pipeline achieving 0.04 IoU on test set
- •Optimized inference for edge deployment with <200ms latency
- •Added Grad-CAM for explainability
Jun 2024 - Mar 2025
Mira: Knowledge-Based RAG Assistant
Domain-semantic RAG pipeline utilizing hybrid search (Dense + Keyword) to retrieve context from unstructured PDFs and handwritten notes
Key Achievements:
- •Engineered Dense + Keyword hybrid search
- •Integrated Cross-Encoder Reranking to filter context
- •Reduced hallucination by ensuring high context relevance
- •Deployed retrieval microservice using FastAPI with <100ms latency
Jan 2025 - Feb 2025
News Recommender System
Machine learning-based application that suggests relevant news articles using transformer embeddings and FAISS similarity search
Key Achievements:
- •Uses Sentence Transformers for text embeddings
- •Implements FAISS for efficient similarity search
- •Provides Flask-based user interface
- •Supports diverse and expanding news topics
2024
Technical Skills
A comprehensive toolkit for building cutting-edge AI solutions
Generative AI
Machine Learning
Multimodal AI
Infrastructure & DevOps
Data & Databases
Also Proficient In
Certificates
Professional certifications and course completions across AI, ML, and data disciplines
Fundamentals of MCP
Anthropic
Data Analytics Job Simulation
Deloitte Australia · Forage
Data Analyst Certification
OneRoadmap
Postman API Fundamentals Student Expert
Postman
Introduction to Statistics
Stanford University · Coursera
Generative AI with Large Language Models
Coursera · DeepLearning.AI · July 2024
Building Deep Learning Models with TensorFlow
IBM · Coursera
Introduction to Deep Learning & Neural Networks with Keras
IBM · Coursera
Machine Learning with Python
IBM · Coursera
Python and Artificial Intelligence
AWS Community · DevTown
Get In Touch
I'm always open to discussing new projects, creative ideas, or opportunities to be part of your vision.
Let's Connect
Whether you're looking to hire an AI/ML engineer, collaborate on a project, or just want to say hi, I'd love to hear from you!