posted on 2021-06-29, 01:16authored byDiptesh Kanojia
This thesis investigates distributional semantics for cognate detection, false friends' detection and computational phylogenetics to present the insights drawn from our research, for 14 Indian languages pairs. Shared vocabulary facilitates second language learning and enables the computational models to perform cross-lingual learning for natural language processing (NLP) tasks. Distributional semantics aids NLP as it allows these models to understand natural languages. Our investigations use cross-lingual features to help detect cognates and false friends across languages. We also generate typological trees for Indian languages and, additionally, propose the division of the text into meaningful functional units which aid phylogenetic tree generation.
History
Campus location
Australia
Principal supervisor
Gholamreza Haffari
Additional supervisor 1
Pushpak Bhattacharyya
Additional supervisor 2
Malhar Kulkarni
Year of Award
2021
Department, School or Centre
Information Technology (Monash University Clayton)
Additional Institution or Organisation
Indian Institute of Technology Bombay, India (IITB)