This repository provides download and cleaning scripts for various datasets of NLP in Indic languages and converting them into a standard format.