Exactly after 25 years (1991-2015) of the first pan-Indian effort for generating digital text corpora in all the Indian languages included in the 8th Schedule of the Constitution of India it is.
Project:Indian Language Corpora Initiative (ILCI) Department of Information Technology (DIT), Government of India created parallel annotated corpora in the tourism & health domains in 11 Indianlanguages with Hindi as the source language.
The Central Institute of Indian Languages in the past co-ordinated the development of 45 plus million word corpora in Scheduled Languages under the scheme of Technology Development for Indian Languages of the Ministry of Communication and Information Technology.
Similarly, Gupta and Kaur (2016) test their extractive methods on 150 random documents from two corpora – Punjabi text obtained by translating Hindi corpora by CILT, IIT Bombay and a Pun-.
The 27 th annual meeting of the Indian Ocean Tuna Commission (IOTC), that took place from 8 to 12 May 2023 in Mauritius, delivered some important results for
The 27 th annual meeting of the Indian Ocean Tuna Commission (IOTC), that took place from 8 to 12 May 2023 in Mauritius, delivered some important results for
A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English