A platform for research: civil engineering, architecture and urbanism
Clustering of Imperfect Transcripts Using a Novel Similarity Measure
Abstract There has been a surge of interest in the last several years in methods for automatic generation of content indices for multimedia documents, particularly with respect to video and audio documents. As a result, there is much interest in methods for analyzing transcribed documents from audio and video broadcasts and telephone conversations and messages. The present paper deals with such an analysis by presenting a clustering technique to partition a set of transcribed documents into different meaningful topics. Our method determines the intersection between matching transcripts, evaluates the information contribution by each transcript, assesses the information closeness of overlapping words and calculates similarity based on Chi-square method. The main novelty of our method lies in the proposed similarity measure that is designed to withstand the imperfections of transcribed documents. Experimental results using documents of varying quality of transcription are presented to demonstrate the efficacy of the proposed methodology.
Clustering of Imperfect Transcripts Using a Novel Similarity Measure
Abstract There has been a surge of interest in the last several years in methods for automatic generation of content indices for multimedia documents, particularly with respect to video and audio documents. As a result, there is much interest in methods for analyzing transcribed documents from audio and video broadcasts and telephone conversations and messages. The present paper deals with such an analysis by presenting a clustering technique to partition a set of transcribed documents into different meaningful topics. Our method determines the intersection between matching transcripts, evaluates the information contribution by each transcript, assesses the information closeness of overlapping words and calculates similarity based on Chi-square method. The main novelty of our method lies in the proposed similarity measure that is designed to withstand the imperfections of transcribed documents. Experimental results using documents of varying quality of transcription are presented to demonstrate the efficacy of the proposed methodology.
Clustering of Imperfect Transcripts Using a Novel Similarity Measure
Ibrahimov, Oktay (author) / Sethi, Ishwar (author) / Dimitrova, Nevenka (author)
2002-01-01
13 pages
Article/Chapter (Book)
Electronic Resource
English
Clustering of Imperfect Transcripts Using a Novel Similarity Measure
British Library Conference Proceedings | 2002
|A novel similarity measure for pseudo-generalized fuzzy rough sets
British Library Online Contents | 2019
|A novel method to measure the semantic similarity of HPO terms
British Library Online Contents | 2017
|Comparing and Clustering Residential Layouts Using a Novel Measure of Grating Difference
Springer Verlag | 2021
|Fault detection and isolation of DURUMI-II using similarity measure
British Library Online Contents | 2009
|