I am Ayush Maheshwari (आयुष माहेश्वरी), working as Senior Solutions Architect at NVIDIA in the NVAITC India Team. Currently working on AI for Science problems in multiple domains including materials discovery, scientific reasoning, geospatial FM, etc. Additionally, working on FM development for languages and scientific domain.
Previously, I have completed my PhD from CSE, IITB (India) with Prof. Ganesh Ramakrishnan. I was fortunate to be funded by Ekal fellowship from Ekal foundation during my PhD. My research interests lie in the area of Natural Language Processing, Graphs from machine learning perspective. I have worked on constrained neural machine translation and semi- and un-supervised machine learning problems with data-programming.
During my PhD, I was a key member of neural machine translation project, UDAAN, which helps publishers to quickly translate technical content in Indian languages. The project is open-source and used by several Indian government technical education agencies and official languages departments.
In my spare time, I enjoy playing tabletennis, cricket and reading about Indian culture, Ramáyaṇa and Mahábhárat.
Download my resumé (Last updated: May 2025)
PhD in Computer Science, Jan 2019 - Aug 2023 (Defended July 2024)
Indian Institute of Technology Bombay
[Dec 25] Paper on benchmarking LLMs on extremely-low resource Indic languages ! [IndicParam: Benchmark to evaluate LLMs on low-resource Indic langauges]
[May 25] Our paper on domain aware lexicon generation is accepted at ACL Main Conference 2025 ❤️ 😊 [LexGen: Domain aware multilingual lexicon generation]
[Jan 25] Our paper on synthetic data generation, ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification is accepted at NAACL 2025 🎆🎇 [Paper]
[Nov 24] Delighted to announce that my PhD work is awarded with Impactful Research Award 2023 by IIT Bombay ❤️ 🎇!
[Oct 24] Joined NVIDIA 😌! Excited to be a part of the future of AI 😇
[Sep 24] Our paper on Dictionary Constrained Disambiguation for Improved NMT is accepted at EMNLP 2024 [Paper] ❤️ 😌
[July 24] I have successfully defended my PhD thesis on Knowledge Integration in Language Processing Models using Constraint Ingestion and Generation 😇😊🎆 {Arxiv soon!}
[July 24] Our paper on ‘Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation’ will be presented at LoResMT workshop at ACL 2024 🎉.
[Feb 24] Our paper on development of English - Sanskrit parallel corpus is accepted at LREC-COLING 2024. [Pre-print]
📝 Serving in the PC for ARR Industry Track 2023 - Present.
📝 Serving in the PC for ARR, 2022 - Present.
Worked on prototyping new service for Adobe PDF in the legal domain Responsibilities include: