查詢結果分析
來源資料
相關文獻
- 塔米爾文語音辨識系統之設計研究
- 西文語音辨識系統之設計研究
- 以連續型隱藏式馬可夫模型來計算中文簽名之動態相似度值
- An Overview of RNN-Based Mandarin Speech Recognition Approaches
- 以色彩、紋理、和外形為內容之影像擷取方法
- 基於VQ/HMM之國語文句翻語音中音節音長與振幅參數產生之方法
- An Overview of Mandarin-Speech Tone Recognition
- A Survey on Automatic Speech Recognition with an Illustrative Example on Continuous Speech Recognition of Mandarin
- 中文連續音節辨識之研究
- 隱藏式馬可夫模型介紹
頁籤選單縮合
題 名 | 塔米爾文語音辨識系統之設計研究 |
---|---|
作 者 | 陳志堅; 林威廷; | 書刊名 | 資訊科技國際期刊 |
卷 期 | 8:1 2014.06[民103.06] |
頁 次 | 頁60-69 |
分類號 | 312.8454 |
關鍵詞 | 塔米爾文語音辨識系統; 梅爾頻率倒頻譜係數; 線性預估倒頻譜係數; 隱藏式馬可夫模型; 音位結構學; Tamil speech recognition system; Mel-frequency cepstral coefficients; Linear predictive cepstral coefficients; Hidden Markov model; Phonotactics; |
語 文 | 中文(Chinese) |
中文摘要 | 本論文針對南印度、斯里蘭卡及新加坡等地常用之塔米爾語,作語音辨識之研究,根據塔米爾語之發音方式,找出 149類常用單音節,以每輪每次錄製兩個一四聲不同聲調之相同單音,錄製五輪共十次之策略,作為單音訓練之方式,建立塔米爾語資料庫中,共 149類的聲紋特徵。本系統萃取梅爾頻率倒頻譜係數與線性預估倒頻譜係數作為語音之特徵,並利用隱藏式馬可夫模型及音位結構學比對,建立辨識系統。在 CPU時脈 2.5 GHz的 Intel Core 2 Quad個人電腦與 Ubuntu 10.04作業系統環境下,本論文實作了 3,500筆塔米爾文常用語詞辨識系統,系統之正確率可達到 88.74%,平均辨識時間約在 1.5秒以內。而系統之總訓練時間約為 1.5小時。 |
英文摘要 | Tamil language is widely used in southern India, Sri Lanka and Singapore. This paper investigates the design and implementation strategies for a Tamil speech recognition system. It utilizes the speech features of the 149 common Tamil mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Tamil pronunciation rules. These 10 utterances are collected through reading five rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone one, while the second one has falling pitch of tone four. Mel-frequency cepstral coefficients, linear predicted cepstral coefficients, and hidden Markov model are used as the two feature models and the syllable recognition model respectively. The recognized syllable strings are then refined by phonotactical rules to obtain the optimal result. Under the Intel Core 2 Quad 2.5 GHz personal computer and Ubuntu 10.04 operating system environment, a correct phrase recognition rate of 88.74% can be reached for a 3,500 Tamil phrase database. The average computation time for the system is within 1.5 seconds, and the training time for the systems is about one and half hours. |
本系統中英文摘要資訊取自各篇刊載內容。