Monash University
Browse

Towards End-to-End Deep Learning for Visual Detection and Tracking

Download (32.23 MB)
thesis
posted on 2023-08-07, 09:49 authored by TIANYU ZHU
Visual object detection and tracking is a complex computer vision problem that often requires multiple components and can be decomposed in various ways. Traditional deep learning methods divide the high-level problem into separate components optimized for surrogate tasks and combine them heuristically during inference. In this thesis, we explore a different methodology that decomposes the problem in a way that as many aspects of the problem can be addressed by an integrated machine learning process. As a result, the model is learnt to directly optimize the high-level problem end-to-end. This enables knowledge sharing between sub-task modules and streamlines the model.

History

Campus location

Australia

Principal supervisor

Tom Drummond

Year of Award

2023

Department, School or Centre

Electrical and Computer Systems Engineering

Additional Institution or Organisation

Australian centre for robotic vision

Course

Doctor of Philosophy

Degree Type

DOCTORATE

Faculty

Faculty of Engineering

Usage metrics

    Faculty of Engineering Theses

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC