Multimedia White Papers

Audio-Visual Feature Extraction for Semi-Automatic Annotation of Meetings

Overview This paper presents the building blocks of the semi-automatic annotation tool which supports multi-modal and multi-level annotation of meetings. The main focus is on the proper design and functionality of the modules for recognizing meeting actions. The key features, identity and position of the speakers, are provided by different modalities (audio and video). Three audio algorithms (Voice Activity Detection, Speaker Identification and Direction of Arrival) and three video algorithms (Detection, Tracking and Identification) form the low-level feature extraction components. Low-level features are automatically merged and the recognized actions are proposed to the user by visualizing them. The annotation labels are related but not limited to events during meetings.

Further White Paper Details
PublisherGraz University of Technology File FormatPDF
Date PublishedJuly 2006 Downloads133
FormatWhite Papers   
Topics
Thin clients switch on digitally excluded

Thin clients switch on digitally excluded

Case study: Digital inclusion project tackles social exclusion in Liverpool more

Renault goes multilingual

Renault goes multilingual

Case study: Translation tech turns docs into 23 languages… more


Quick Sitemap Links: