ACADI |
Automatic Character (in Audiovisual Document) Indexing (ACADI)Leader: Julien Pinquier, UPS-IRIT
Description: We propose a system which permits to describe and structure audiovisual documents without training, nor corpus knowledge, and to visualize with an interface the principal interventions. Our goal is to fuse three segmentation systems: face, costume and speaker detectors to obtain the best association between voices and appearing persons in an audio/video sequences. We propose an interface as a tool used in a verification-aided fashion of the segmentation result. Current-state of the art of the showcase:
The first step in the building of our interface is to be able to parse results from the segmentation systems. As we used XML file format for data exchange this has been straight forward. The application built up a sequence object, from the XML file results, containing the segmentation of the sequence from the three detectors (see figure 1). Using audio/video decoders we can retrieve images from the video to illustrate the segmentation and also to play the video segments. The interface developed to visualize those results already provide the primary requirements:
VIdeo of the Automatic Character (in Audiovisual Document) Indexing (ACADI) |