Towards Open-World Vision Applications by Learning Image-Region Representation

DAO, DUY SON

doi:10.26180/25751211.v1

Towards Open-World Vision Applications by Learning Image-Region Representation

thesis

posted on 2024-05-04, 19:39 authored by DUY SON DAO

Open-World Vision Applications use computer vision techniques to analyze and understand visual data in a dynamic environment. Researchers are developing novel approaches for learning image-region representation, i.e., focusing on regions within an image. It offers advantages such as a more comprehensive understanding of visual content, enhanced adaptability, and integration of contextual information. The challenges of learning image-region representation for open-world vision applications include model generalization and data availability. This research proposes learning frameworks for Open-Vocabulary Multi-Label Classification (OVML) and Open-Vocabulary Semantic Segmentation (OVS).

History

Campus location

Australia

Principal supervisor

Jianfei Cai

Additional supervisor 1

Dinh Phung

Year of Award

2024

Department, School or Centre

Data Science & Artificial Intelligence

Additional Institution or Organisation

Data Science and Artificial Intelligence

Course

Doctor of Philosophy

Degree Type

DOCTORATE

Faculty

Faculty of Information Technology

Usage metrics

Keywords

Open-Vocabulary Multi-Label Classification Open-Vocabulary Semantic Segmentation

Licence

In Copyright

Towards Open-World Vision Applications by Learning Image-Region Representation

History

Campus location

Principal supervisor

Additional supervisor 1

Year of Award

Department, School or Centre

Additional Institution or Organisation

Course

Degree Type

Faculty

Usage metrics

Categories

Keywords

Licence

Exports