copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Evaluating Multimedia Features and Fusion for Example-based Event Detection

G. Myers, R. Nallapati, J. Hout, S. Pancoast, R. Nevatia, C. Sun, A. Habibian, D. Koelma, K. Sande, A. Smeulders, and C. Snoek. Machine Vision and Applications, 25 (1): 17-32 (January 2014)
DOI: 10.1007/s00138-013-0527-8

Abstract

Multimedia event detection (MED) is a challenging problem because of the heterogeneous content and variable quality found in large collections of Internet videos. To study the value of multimedia features and fusion for representing and learning events from a set of example video clips, we created SESAME, a system for video SEarch with Speed and Accuracy for Multimedia Events. SESAME includes multiple bag-of-words event classifiers based on single data types: low-level visual, motion, and audio features; high-level semantic visual concepts; and automatic speech recognition. Event detection performance was evaluated for each event classifier. The performance of low-level visual and motion features was improved by the use of difference coding. The accuracy of the visual concepts was nearly as strong as that of the low-level visual features. Experiments with a number of fusion methods for combining the event detection scores from these classifiers revealed that simple fusion methods, such as arithmetic mean, perform as well as or better than other, more complex fusion methods. SESAME's performance in the 2012 TRECVID MED evaluation was one of the best reported.

@flint63's tags highlighted

Cite this publication

@article{MyersNallapatiEtAl14mva, abstract = {Multimedia event detection (MED) is a challenging problem because of the heterogeneous content and variable quality found in large collections of Internet videos. To study the value of multimedia features and fusion for representing and learning events from a set of example video clips, we created SESAME, a system for video SEarch with Speed and Accuracy for Multimedia Events. SESAME includes multiple bag-of-words event classifiers based on single data types: low-level visual, motion, and audio features; high-level semantic visual concepts; and automatic speech recognition. Event detection performance was evaluated for each event classifier. The performance of low-level visual and motion features was improved by the use of difference coding. The accuracy of the visual concepts was nearly as strong as that of the low-level visual features. Experiments with a number of fusion methods for combining the event detection scores from these classifiers revealed that simple fusion methods, such as arithmetic mean, perform as well as or better than other, more complex fusion methods. SESAME's performance in the 2012 TRECVID MED evaluation was one of the best reported.}, added-at = {2014-09-20T17:25:45.000+0200}, author = {Myers, Gregory K. and Nallapati, Ramesh and Hout, Julien van and Pancoast, Stephanie and Nevatia, Ramakant and Sun, Chen and Habibian, Amirhossein and Koelma, Dennis C. and Sande, Koen E. A. van de and Smeulders, Arnold W. M. and Snoek, Cees G. M.}, biburl = {https://www.bibsonomy.org/bibtex/20c2d5fb81f7ac429819c33bc826e8acc/flint63}, doi = {10.1007/s00138-013-0527-8}, file = {SpringerLink:2014/MyersNallapatiEtAl14mva.pdf:PDF}, groups = {public}, interhash = {71678fb9a930c5bc31fa7f7c8ed96ac4}, intrahash = {0c2d5fb81f7ac429819c33bc826e8acc}, issn = {0932-8092}, journal = {Machine Vision and Applications}, keywords = {v1205 springer paper ai multimedia speech video image semantic analysis recognition knowledge processing zzz.vitra}, month = {#jan#}, number = 1, pages = {17-32}, timestamp = {2018-04-16T12:06:16.000+0200}, title = {Evaluating Multimedia Features and Fusion for Example-based Event Detection}, username = {flint63}, volume = 25, year = 2014 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Evaluating Multimedia Features and Fusion for Example-based Event Detection

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Evaluating Multimedia Features and Fusion for Example-based Event Detection

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Evaluating Multimedia Features and Fusion for Example-based Event Detection

Comments and Reviews
(0)