Experiences
Technical Lead - R&D
Reverie Language Technologies Limited
July 2025 - Present
- Led and mentored a team of three engineers in the design, development, and delivery of end-to-end speech solutions.
- Trained multilingual STT model, leveraging a massive dataset of 100,000+ hours to achieve high-accuracy across diverse Indian languages and dialects.
- Developed code-mixed STT models that handle seamless English-Indian language switching without compromising on quality, delivering accuracy comparable to monolingual benchmarks”
- Trained Proof-of-Concept (POC) models for Speech-to-Speech (S2S) and Speech-to-Text Translation (S2TT) systems.
- Optimized distributed training pipelines via DeepSpeed and PyTorch, orchestrating efficient GPU utilization for faster iteration on massive-scale datasets.
Senior Software Engineer - R&D
Reverie Language Technologies Limited
October 2023 - June 2025
- Designed and implemented scalable data preparation pipelines for large-scale speech and text corpora, including data cleaning, normalization, deduplication, tokenization, and efficient sharding for distributed training.
- Led the transition from Hybrid to Transformer-based STT by benchmarking toolkits like NeMo, K2, and PyTorch, resulting in a 10% average relative improvement in accuracy across multiple languages.
- Improved domain-specific accuracy by 15% through Parameter-Efficient Fine-Tuning (PEFT), enabling rapid model adaptation for specialized client requirements without the overhead of full fine-tuning.
- Adapted pre-trained LLMs for Indic languages, enhancing linguistic coverage and model performance in multilingual and low-resource environments.
- Worked extensively on tokenization strategies and language modeling workflows, to improve decoding and overall STT performance.