This paper presents the building blocks of the semi-automatic annotation tool which supports multi-modal and multi-level annotation of meetings. The main focus is on the proper design and functionality of the modules for recognizing meeting actions. The key features, identity and position of the speakers, are provided by different modalities (audio and video). Three audio algorithms (Voice Activity Detection, Speaker Identification and Direction of Arrival) and three video algorithms (Detection, Tracking and Identification) form the low-level feature extraction components. Low-level features are automatically merged and the recognized actions are proposed to the user by visualizing them. The annotation labels are related but not limited to events during meetings.
Related white papers
Wireless Internet and Multimedia Connections
Internet and multimedia applications have experienced tremendous growth in recent months. Just as new software applications seem to have an undaunted appetite for more computer memory and disk space, information consumers have an...
Convergence of Telecommunications and Multimedia: Towards Wireless and Internet Communications
Integration between fixed and mobile systems is "stage one" in the development of convergence. This is followed by enhancements to wireless access that enable true multimedia services to be delivered...
Transportation Service Point Solutions
Companies in the transportation industry can now dramatically improve operations, increase revenues, and improve customer relationships by leveraging available technologies. They can streamline customer interaction and increase customer loyalty by linking over...
Bandwidth Consumption Control and Service Differentiation for Video Streaming
Multimedia streaming is resource demanding. It may starve other applications such as file transfer sharing the network, for example, in a smart home. To address the problem, a fuzzy logic...
Load Balancing for Multimedia Streaming in Heterogeneous Peer-to-Peer Systems
Multimedia streaming of mostly user generated content is an ongoing trend, not only since the upcoming of Last.fm and YouTube. A distributed decentralized multimedia streaming architecture can spread the (traffic)...
Expand Service Offerings With Cisco Managed Media Solution V2.1
The Cisco Managed Media Solution 2.1 is a platform for delivering live and on-demand digital media, differentiating an operator's services from other content providers on the Internet. The Managed Media...
MPEG and RTP
This white paper will explain how MPEG is and is not sent via Real Time Protocol.


