頁籤選單縮合
題 名 | Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS |
---|---|
作 者 | Lin, Cheng-yuan; Jang, Roger Jyh-shing; Chen, Kuan-ting; | 書刊名 | International Journal of Computational Linguistics & Chinese Language Processing |
卷 期 | 10:2 民94.06 |
頁 次 | 頁145-166 |
分類號 | 312.85 |
關鍵詞 | Speech assessment methods phonetic alphabet; Speech corpus; Sequential forward selection; K-nearest neighbor rule; Leave-one-out; Speaker-adapted model; Context-dependent hidden Markov model; HMM; |
語 文 | 英文(English) |
英文摘要 | Precise phone/syllable boundary labeling of the utterances in a speech corpus plays an important role in constructing a corpus-based TTS (text-to-speech) system. However, automatic labeling based on Viterbi forced alignment does not always produce satisfactory results. Moreover, a suitable labeling method for one language does not necessarily produce desirable results for another language. Hence in this paper, we propose a new procedure for refining the boundaries of utterances in a Mandarin speech corpus. This procedure employs different sets of acoustic features for four different phonetic categories. In addition, a new scheme is proposed to deal with the “periodic voiced + periodic voiced” case, which produced most of the segmentation errors in our experiment. Several experiments were conducted to demonstrate the feasibility of the proposed approach. |
本系統中英文摘要資訊取自各篇刊載內容。