Viewpoint invariant human activity recognition using pose series

Htike, Zaw Zaw

doi:10.4225/03/58d1d23ac8ca0

Restricted Access

Reason: Access restricted by the author. A copy can be requested for private research and study by contacting your institution's library service. This copy cannot be republished

Viewpoint invariant human activity recognition using pose series

thesis

posted on 2017-03-22, 01:24 authored by Htike, Zaw Zaw

There is a growing interest in the problem of vision-based human activity recognition, motivated by its numerous promising applications in many domains. Because the camera position is arbitrary in many applications, practical human activity recognition systems should be viewpoint invariant. Nevertheless, the viewpoint issue has been neglected by the vast majority of computer vision researchers because of the inherent difficulty of training their systems across all possible viewpoints. Several state-of-the-art activity recognition systems claim to be viewpoint invariant. These can be broadly categorized by their sensory requirements: those requiring multiple synchronized cameras and those requiring only a single uncalibrated camera. While multi-camera systems work well, they are often not feasible, or practical, or both, in many domains and applications. Current single-camera systems are either too complex, hence not real-time capable, or require activity training data from multiple views, which again is not always feasible or practical, or both, in many domains and applications. Therefore, this thesis proposes a novel generic framework to recognize and classify human activities from a monocular video source from arbitrary viewpoints without requiring training activities using multiple views. The proposed framework comprises two stages: human pose recognition and human activity recognition. In the pose recognition stage, an ensemble of invariant pose models performs inference on each video frame. Each pose model estimates the probability that the given frame contains the corresponding pose. Over a sequence of frames, invariant pose models collectively concoct a multivariate time series. The activity recognition stage employs time series analysis to classify activities. The system has been rigorously tested on a number of standard benchmark datasets and has been found to outperform current state-of-the-art systems in terms of both its processing speed and classification accuracy. The framework developed in this thesis, as supported by the results, lays the foundation for monocular viewpoint invariant human activity recognition. Moreover, this framework can be extended and tailored to multiple domains and applications with diverse requirements.

History

Campus location

Australia

Principal supervisor

Simon Egerton

Additional supervisor 1

Kuang Ye Chow

Year of Award

2011

Department, School or Centre

Information Technology (Monash University Malaysia)

Course

Doctor of Philosophy

Degree Type

DOCTORATE

Faculty

Faculty of Information Technology

Usage metrics

Keywords

Restricted access and full embargo thesis(doctorate)ethesis-20111018-005051 Human activity recognition monash:64599 1959.1/488426 2011

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Restricted Access

Viewpoint invariant human activity recognition using pose series

History

Campus location

Principal supervisor

Additional supervisor 1

Year of Award

Department, School or Centre

Course

Degree Type

Faculty

Usage metrics

Categories

Keywords

Licence

Exports