Use this resource - and many more! - in your textbook!
AcademicPub holds over eight million pieces of educational content for you to mix-and-match your way.
Generating phonetic cognates to handle named entities in English-Chinese cross-language spoken document retrieval
By: Tang, K.; Berlin Chen; Wai-Kit Lo; Meng, H.M.;
2001 / IEEE / 0-7803-7343-X
This item was taken from the IEEE Conference ' Generating phonetic cognates to handle named entities in English-Chinese cross-language spoken document retrieval ' We have developed a technique for automatic transliteration of named entities for English-Chinese cross-language spoken document retrieval (CL-SDR). Our retrieval system integrates machine translation, speech recognition and information retrieval technologies. An English news story forms a textual query that is automatically translated into Chinese words, which are mapped into Mandarin syllables by pronunciation dictionary lookup. Mandarin radio news broadcasts form spoken documents that are indexed by word and syllable recognition. The information retrieval engine performs matching in both word and syllable scales. The English queries contain many named entities that tend to be out-of-vocabulary words for machine translation and speech recognition, and are omitted in retrieval. Names are often transliterated across languages and are generally important for retrieval. We present a technique that takes in a name spelling and automatically generates a phonetic cognate in terms of Chinese syllables to be used in retrieval. Experiments show consistent retrieval performance improvement by including the use of named entities in this way.
Phonetic Cognate Generation
English-chinese Spoken Document Retrieval
Cross-language Spoken Document Retrieval
English News Story
Pronunciation Dictionary Lookup
Mandarin Radio News Broadcasts
Digital Multimedia Broadcasting
Natural Language Interfaces