頁籤選單縮合
題名 | Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS |
---|---|
作者 | Lin, Cheng-yuan; Jang, Roger Jyh-shing; Chen, Kuan-ting; | 書刊名 | International Journal of Computational Linguistics & Chinese Language Processing |
卷期 | 10:2 民94.06 |
頁次 | 頁145-166 |
分類號 | 312.85 |
關鍵詞 | Speech assessment methods phonetic alphabet; Speech corpus; Sequential forward selection; K-nearest neighbor rule; Leave-one-out; Speaker-adapted model; Context-dependent hidden Markov model; HMM; |
語文 | 英文(English) |
英文摘要 | Precise phone/syllable boundary labeling of the utterances in a speech corpus plays an important role in constructing a corpus-based TTS (text-to-speech) system. However, automatic labeling based on Viterbi forced alignment does not always produce satisfactory results. Moreover, a suitable labeling method for one language does not necessarily produce desirable results for another language. Hence in this paper, we propose a new procedure for refining the boundaries of utterances in a Mandarin speech corpus. This procedure employs different sets of acoustic features for four different phonetic categories. In addition, a new scheme is proposed to deal with the “periodic voiced + periodic voiced” case, which produced most of the segmentation errors in our experiment. Several experiments were conducted to demonstrate the feasibility of the proposed approach. |
本系統之摘要資訊系依該期刊論文摘要之資訊為主。