Use this resource - and many more! - in your textbook!
AcademicPub holds over eight million pieces of educational content for you to mix-and-match your way.
Speaker identification and video analysis for hierarchical video shot classification
By: Nam, J.; Tewfik, A.H.; Enis Cetin, A.;
1997 / IEEE / 0-8186-8183-7
This item was taken from the IEEE Conference ' Speaker identification and video analysis for hierarchical video shot classification ' We present a new video shot classification and clustering technique to support content-based indexing, browsing and retrieval in video databases. The proposed method is based on the analysis of both the audio and visual data tracks. The visual stream is analyzed using a 3-D wavelet transform and segmented into shot units which are matched and clustered by visual content. Simultaneously, speaker changes are detected by tracking voiced phonemes in the audio signal. The clues obtained from the video and speech data are combined to classify and group the isolated video shots. This integrated approach also allows effective indexing of the audio-visual objects in multimedia databases.
Video Shots Matching
Hierarchical Video Shot Classification
Visual Data Tracks
Audio Data Tracks
Visual Stream Analysis
3d Wavelet Transform
Speaker Changes Detection
Voiced Phonemes Tracking
Audio-visual Objects Indexing
Content Based Retrieval
Video Signal Processing