posted on 2018-05-28, 22:26authored byRamakrishna Balakrishna Bairi
This thesis studies the problem of automatically evolving a hierarchy of categories to organize the documents in a collection, considering user preferences (e.g., categories biased to a particular field). It makes use of a massive knowledge graph to guide the machine learning models to evolve the category structure and organizes the documents accordingly. The categorization also adapts to the growing document collection. It also presents a novel technique for categorizing “short texts” having very few words. This work has applications in machine learning tasks such as automatic creation of “Wikipedia Disambiguation” like pages, automatic generation of Table of Contents, drill-down search, etc.
History
Campus location
Australia
Principal supervisor
Mark Carman
Additional supervisor 1
Ganesh Ramakrishnan
Year of Award
2018
Department, School or Centre
Monash Research Academy
Additional Institution or Organisation
Indian Institute of Technology Bombay, India (IITB)