Publication in refereed journal
香港中文大学研究人员 ( 现职)
程伯中教授 (电子工程学系) |
蒙美玲教授 (系统工程与工程管理学系) |
卢伟杰博士 (系统工程与工程管理学系) |
全文
没有全文档案提供 |
引用次数
Scopushttp://aims.cuhk.edu.hk/converis/portal/Publication/8Scopus source URL
其它资讯
摘要This paper presents the application of a multi-scale paradigm to Chinese spoken document retrieval (SDR) for improving retrieval performance. Multi-scale refers to the use of both words and subwords for retrieval. Words are basic units in a language that carry lexical meaning, and subword units (such as phonemes, syllables or characters) are building components for words. Retrieval using subword indexing units is better than retrieval using words because of the robustness of subword units to out-of-vocabulary (OOV) words during speech recognition and ambiguities in word segmentation. Experimental results have demonstrated that subword bigrams can bring improvement in retrieval performance over words (~9.56%). Application of multi-scale fusion to SDR aims at combining the lexical information of words and the robustness of subwords. This work presents the first detailed investigation for a Cantonese broadcast news retrieval task using two different multi-scale fusion approaches: pre-retrieval fusion and post-retrieval fusion. Multi-scale retrieval using both words and syllable bigrams achieves improvement in retrieval performance (~ 1.90%) over retrieval on the composite scales.
着者Lo W.-K., Meng H.M., Ching P.C.
期刊名称International Journal of Speech Technology
出版年份2004
月份4
日期1
卷号7
期次2-3
出版社Kluwer Academic Publishers
出版地Netherlands
页次203 - 219
国际标準期刊号13http://aims.cuhk.edu.hk/converis/portal/Publication/81-2416
语言英式英语
关键词Cantonese speech recognition, Multi-scale fusion, Speech retrieval