copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Parsing Videos of Actions with Segmental Grammars

H. Pirsiavash, and D. Ramanan. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, page 612-619. (2014)
DOI: 10.1109/CVPR.2014.85

Abstract

Real-world videos of human activities exhibit temporal structure at various scales, long videos are typically composed out of multiple action instances, where each instance is itself composed of sub-actions with variable durations and orderings. Temporal grammars can presumably model such hierarchical structure, but are computationally difficult to apply for long video streams. We describe simple grammars that capture hierarchical temporal structure while admitting inference with a finite-state-machine. This makes parsing linear time, constant storage, and naturally online. We train grammar parameters using a latent structural SVM, where latent subactions are learned automatically. We illustrate the effectiveness of our approach over common baselines on a new half-million frame dataset of continuous YouTube videos.

Links and resources

BibTeX key: PirsiavashRamanan14CVPR
entry type: inproceedings
booktitle: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA
year: 2014
pages: 612-619
file: IEEE Digital Library:2014/PirsiavashRamanan14CVPR.pdf:PDF;Related MIT News:http\://newsoffice.mit.edu/2014/techniques-from-natural-language-processing-enable-computers-to-search-video-0514:URL
groups: public
intrahash: 2a7d54c472dcb065b78eb681e939b8e2
DOI: 10.1109/CVPR.2014.85
timestamp: 2014.10.17
username: flint63

@flint63's tags highlighted

Cite this publication

search on

Meta data

Last update 6 years ago
Created 10 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Parsing Videos of Actions with Segmental Grammars

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Parsing Videos of Actions with Segmental Grammars

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Parsing Videos of Actions with Segmental Grammars

Comments and Reviews
(0)