Monash University
Browse

Entity Matching using Language Models

Download (1.57 MB)
thesis
posted on 2021-12-06, 09:53 authored by KAI SHENG TEONG
Linking records that belong to the same entity, also known as entity matching is an important part of data integration and management. Many existing solutions for entity matching requires the user to have a deep understanding of the data schema and available techniques in order to match correctly. But this is not the case in real-world use cases where data has lots of variety, "dirty" and the schema of the databases are not the same. This thesis presents a framework for entity matching using fine-tuned language models, where it does not need an aligned schema to perform matching.

History

Campus location

Malaysia

Principal supervisor

Soon Lay Ki

Additional supervisor 1

Tin Tin Su

Year of Award

2021

Department, School or Centre

School of Information Technology (Monash University Malaysia)

Course

Master of Philosophy

Degree Type

MASTERS

Faculty

Faculty of Information Technology

Usage metrics

    Faculty of Information Technology Theses

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC