Jio Platforms Ltd | Data Scientist | Enterprise AI Platforms (Aug’23 - Present)

  • Real-Time TTS Engine: Delivered cross-lingual TTS APIs with efficient GPU utilization & ultra-fast response times
    • Orchestrated distributed training on 4 NVIDIA A100 GPUs using PyTorch DDP, reducing training time by 40%
    • Employed CUDA Graphs to eliminate repetitive execution overhead, achieving 0.8s inference for 15-token inputs
    • Enabled 50+ concurrent TTS streams via NVIDIA MIG instances, maintaining 0.2 RTF at 90% GPU utilization
    • Developed a Python API wrapper to orchestrate LLaMA-driven semantic tokenization & VQGAN-based vocoder
    • Deployed Gunicorn/Nginx with round-robin distribution, cutting API errors to 0.5% during peak traffic spikes
  • Multilingual OCR System: Advanced Regional Multi-Language OCR with High-Accuracy Language classifier model
    • Fine-tuned PaddleClas PULC based language classifier model for 13 languages, achieving a 93.54% F1 score
    • Designed multi-stage OCR pipeline with adaptive routing using confidence scores, improving the accuracy by 25%
    • Validated the OCR pipeline at enterprise scale, delivering 85% accuracy for 13 languages in production workflows
  • Object & Landmark Detection System: Advanced Multi-Object and Landmark Recognition with Scalable Deployment
    • Fine-tuned RT-DETR (AP = 0.884 for object detection) & DINOv2 (93.51% F1 score for landmark classification)
    • Improved the model performance through grid search optimization of learning rate, momentum, & weight decay
    • Designed high-performance pipeline with Flask API, JWT, batch processing, and optimized for real-time response
    • Devised scalable Docker/Kubernetes deployments with HPA auto-scaling & Logstash, handling 4x peak loads
  • HR Assistant Platform: Implemented AI-driven HR solutions at Jio Platforms Ltd, streamlining employee support
    • Engineered multi-format document extraction pipeline, processing 1200+ files, with OCR & semantic analysis
    • Architected Graph RAG pipeline with dynamic agentic chunking, improving the retrieval accuracy by 30%
    • Devised modular retrieval pipelines leveraging Qdrant, Graph RAG connectivity & Cross-Encoder refinement
    • Leveraged LangChain for multi-turn chats, improving coherence, response clarity, and personalization by 20%

Blockchain-Based Voting Solution

  • Developed an end-to-end decentralized voting platform to organize secure elections using the Azure Blockchain
  • Applied the concepts of cryptography to ensure the anonymity of voters on a public blockchain during the election
  • Constructed the front-end using Angular and implemented the service to encrypt and decrypt votes using Python

Automatic Speech Recognition System

  • Built a Speech Recognition system using vector quantized Codebook Matching and MFCC feature extraction
  • Achieved 75.89% accuracy on clean set and 47.65% accuracy on noisy set of Google Speech Commands dataset

IITB-RISC Processor | Processor Design

  • Designed 16-Bit, 6-Stage Pipeline processor, based on Turing-Complete Instruction Set in VHDL from scratch
  • Programmed 14 Instructions including branch, arithmetic, & memory interface; tested it on Altera Nano FPGA
  • Implemented a 6-stage pipelined processor with Branch Predictors, Priority Encoders and the Hazard Detection

Social Networking Website | ITSP IIT Bombay

  • Developed a Social Networking Website with features of a Chat, News Feed, and Posts aimed at college students
  • Implemented a feature of Auto Delete of the posts in the News Feed after all the targeted users have viewed it
  • Designed the front-end of the website using CSS & HTML and the back-end using MySQL Connector & PHP