Ayush Maheshwari

Ayush Maheshwari

Sr. Solutions Architect at NVIDIA
PhD in NLP/ML from CSE, IITB

Biography

🎉 Update: PhD completed (July 2024) • Now at NVIDIA.


I am Ayush Maheshwari (आयुष माहेश्वरी), working as Senior Solutions Architect at NVIDIA in the NVAITC India Team.

At NVIDIA, I focus on:

  • Foundation Models: Building multilingual and domain-specific foundation models for Indian languages and scientific domains. With additional focus on AI for Science models such as geospatial foundation models, air pollution models, etc.
  • Research Collaborations: Collaborating with academic institutions and research organizations to advance AI applications in science and technology.

My work involves architecting AI solutions, conducting applied research, and enabling the broader research community through technical workshops and collaborations.

Previously, I have completed my PhD from CSE, IITB (India) with Prof. Ganesh Ramakrishnan. I was fortunate to be funded by Ekal fellowship from Ekal foundation during my PhD. My research interests lie in the area of Natural Language Processing, Graphs from machine learning perspective. I have worked on constrained neural machine translation and semi- and un-supervised machine learning problems with data-programming.

During my PhD, I was a key member of neural machine translation project, UDAAN, which helps publishers to quickly translate technical content in Indian languages. The project is open-source and used by several Indian government technical education agencies and official languages departments. In my spare time, I enjoy playing tabletennis, cricket and reading about Indian culture, Ramáyaṇa and Mahábhárat.

Download my resumé (Last updated: May 2025)

Interests
  • Large Language Models
  • Natural Language Processing
  • Human-in-the-loop AI
  • Neural Machine Translation
  • Machine Learning
  • Information Retrieval
Education
  • PhD in Computer Science, Jan 2019 - Aug 2023 (Defended July 2024)

    Indian Institute of Technology Bombay

Updates

  • [Dec 25] Paper on benchmarking LLMs on extremely-low resource Indic languages ! [IndicParam: Benchmark to evaluate LLMs on low-resource Indic langauges]

  • [May 25] Our paper on domain aware lexicon generation is accepted at ACL Main Conference 2025 ❤️ 😊 [LexGen: Domain aware multilingual lexicon generation]

  • [Jan 25] Our paper on synthetic data generation, ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification is accepted at NAACL 2025 🎆🎇 [Paper]

  • [Nov 24] Delighted to announce that my PhD work is awarded with Impactful Research Award 2023 by IIT Bombay ❤️ 🎇!

  • [Oct 24] Joined NVIDIA 😌! Excited to be a part of the future of AI 😇

  • [Sep 24] Our paper on Dictionary Constrained Disambiguation for Improved NMT is accepted at EMNLP 2024 [Paper] ❤️ 😌

Click here for updates archive

Experience

 
 
 
 
 
NVIDIA
Senior Solutions Architect
Oct 2024 – Present Gurugram

Part of NVIDIA AI Technology Center (NVAITC) India, driving research engagements and strategic collaborations.

Responsibilities include:

  • Leading enterprise AI solution architecture and deployment strategies
  • Collaborating with academic and research institutions on cutting-edge AI projects
  • Providing technical expertise on NVIDIA GPU acceleration, large language models, and AI infrastructure
  • Architecting scalable AI solutions for diverse industry verticals
  • Mentoring teams on best practices for AI/ML model development and deployment
 
 
 
 
 
Research Scientist
Sep 2023 – Sep 2024 Bengaluru
  1. Led a team of 5 people to build Indic-large language models from scratch.
  2. Developing data collection & processing pipelines for training and evaluation.
  3. Training of tokenizer and designing model training architecture.
  4. Training the model on large Intel Gaudi-2 cluster.
  5. Instruction tuning and preference training of the pre-trained models.
 
 
 
 
 
Adobe Research
Research Intern
May 2021 – Aug 2021 Bengaluru

Worked on prototyping new service for Adobe PDF in the legal domain Responsibilities include:

  • Modeling of the problem
  • Designing, developing and prototyping using ML
  • Deployment and Demonstration
 
 
 
 
 
IIT Bombay
Project Engineer
Jan 2016 – Dec 2018 Mumbai
Develop software solutions for security agencies
 
 
 
 
 
Tata Consultancy Services
System Engineer
Oct 2011 – Jul 2013 Mumbai

Projects

UDAAN - An NMT pipeline + Post-editing tool to translate document (Best Paper Award at CODS-COMAD 2023)
An end-to-end Machine Translation and post-editing platform that has translated >50 books across 10 Indian languages. 100+ professional translators actively use UDAAN to upload documents, obtain MT output, and edit translations. Received Presidential recognition and Best Paper Award at CODS-COMAD 2023. Includes >100 digitized dictionaries freely available from CSTT.
UDAAN - An NMT pipeline + Post-editing tool to translate document (Best Paper Award at CODS-COMAD 2023)
SPEAR - Programmatically label and quickly build training data
A widely-adopted open-source Python library for programmatic data labeling with 100+ GitHub stars. SPEAR reduces data labeling efforts by implementing cutting-edge data programming approaches (Snorkel, ImplyLoss, Learning to Reweight). Integrates semi-supervised learning for efficient training and inference. Featured at EMNLP 2021.
SPEAR - Programmatically label and quickly build training data
Temples of India
Temples of India is a not-for-profit knowledge platform to document and store possibly all details of temples across Indian subcontinent. We aim to present each detail related to the temple such as its location, images of the temple, videos, open and close timings, etc.
Temples of India

Awards & Honors

Impactful Research Award 2023
Awarded for impactful research contributions during PhD at IIT Bombay
Best Paper Award
Our paper on translation post-editing tool (UDAAN) won the best paper award at CODS-COMAD 2023
Ekal Fellowship for PhD Research
Funded by Ekal Fellowship throughout PhD research at IIT Bombay

1:1 Mentoring & Consultations

Connect with me on Topmate

💡 Book a 1:1 Session with Me

I offer mentoring sessions for PhD students, researchers, and professionals working in:

🎓 PhD Applications & Research 🤖 NLP & Machine Learning 💼 Career Guidance 📝 Paper Reviews & Feedback
📅 Book a Session on Topmate →

📧 For collaborations or speaking opportunities, reach out via the contact section below

Recent & Upcoming Talks