Unicode normalization and grapheme parsing of Indic languages

Published in LREC-COLING, 2023

This paper presents Unicode normalization techniques and grapheme parsing for Indic scripts.

Recommended citation: Ansary, M. N., Adib, Q. A. R., …, Sushmit, A. (2023). "Unicode normalization and grapheme parsing of Indic languages." LREC-COLING.
Download Paper