Jio Platforms Ltd | Data Scientist | Enterprise AI Platforms (Aug’23 - Present)
- Real-Time TTS Engine: Delivered cross-lingual TTS APIs with efficient GPU utilization & ultra-fast response times
- Orchestrated distributed training on 4 NVIDIA A100 GPUs using PyTorch DDP, reducing training time by 40%
- Employed CUDA Graphs to eliminate repetitive execution overhead, achieving 0.8s inference for 15-token inputs
- Enabled 50+ concurrent TTS streams via NVIDIA MIG instances, maintaining 0.2 RTF at 90% GPU utilization
- Developed a Python API wrapper to orchestrate LLaMA-driven semantic tokenization & VQGAN-based vocoder
- Deployed Gunicorn/Nginx with round-robin distribution, cutting API errors to 0.5% during peak traffic spikes
- Multilingual OCR System: Advanced Regional Multi-Language OCR with High-Accuracy Language classifier model
- Fine-tuned PaddleClas PULC based language classifier model for 13 languages, achieving a 93.54% F1 score
- Designed multi-stage OCR pipeline with adaptive routing using confidence scores, improving the accuracy by 25%
- Validated the OCR pipeline at enterprise scale, delivering 85% accuracy for 13 languages in production workflows
- Object & Landmark Detection System: Advanced Multi-Object and Landmark Recognition with Scalable Deployment
- Fine-tuned RT-DETR (AP = 0.884 for object detection) & DINOv2 (93.51% F1 score for landmark classification)
- Improved the model performance through grid search optimization of learning rate, momentum, & weight decay
- Designed high-performance pipeline with Flask API, JWT, batch processing, and optimized for real-time response
- Devised scalable Docker/Kubernetes deployments with HPA auto-scaling & Logstash, handling 4x peak loads
- HR Assistant Platform: Implemented AI-driven HR solutions at Jio Platforms Ltd, streamlining employee support
- Engineered multi-format document extraction pipeline, processing 1200+ files, with OCR & semantic analysis
- Architected Graph RAG pipeline with dynamic agentic chunking, improving the retrieval accuracy by 30%
- Devised modular retrieval pipelines leveraging Qdrant, Graph RAG connectivity & Cross-Encoder refinement
- Leveraged LangChain for multi-turn chats, improving coherence, response clarity, and personalization by 20%
Blockchain-Based Voting Solution
- Developed an end-to-end decentralized voting platform to organize secure elections using the Azure Blockchain
- Applied the concepts of cryptography to ensure the anonymity of voters on a public blockchain during the election
- Constructed the front-end using Angular and implemented the service to encrypt and decrypt votes using Python
Automatic Speech Recognition System
- Built a Speech Recognition system using vector quantized Codebook Matching and MFCC feature extraction
- Achieved 75.89% accuracy on clean set and 47.65% accuracy on noisy set of Google Speech Commands dataset
IITB-RISC Processor | Processor Design
- Designed 16-Bit, 6-Stage Pipeline processor, based on Turing-Complete Instruction Set in VHDL from scratch
- Programmed 14 Instructions including branch, arithmetic, & memory interface; tested it on Altera Nano FPGA
- Implemented a 6-stage pipelined processor with Branch Predictors, Priority Encoders and the Hazard Detection
Social Networking Website | ITSP IIT Bombay
- Developed a Social Networking Website with features of a Chat, News Feed, and Posts aimed at college students
- Implemented a feature of Auto Delete of the posts in the News Feed after all the targeted users have viewed it
- Designed the front-end of the website using CSS & HTML and the back-end using MySQL Connector & PHP