We release Sāmayik, a dataset of around 53,000 parallel English-Sanskrit sentences, written in contemporary prose. Sanskrit is a classical language still in sustenance and has a rich documented heritage. However, due to the limited availability of digitized content, it still remains a low-resource language.
Ayush Maheshwari,
Ashim Gupta,
Amrith Krishna,
Atul Kumar Singh,
Ganesh Ramakrishnan,
G. Anil Kumar,
Jitin Singla