頁籤選單縮合
題名 | The Formosan Language Archive: Linguistic Analysis and Language Processing= |
---|---|
作者 | Zeitoun, Elizabeth; Yu, Ching-hua; |
期刊 | International Journal of Computational Linguistics & Chinese Language Processing |
出版日期 | 20050600 |
卷期 | 10:2 民94.06 |
頁次 | 頁167-199 |
分類號 | 312.13 |
語文 | eng |
關鍵詞 | Formosan languages; Formosan language archive; Corpora; Linguistic analysis; Language processing; |
英文摘要 | In this paper, we deal with the linguistic analysis approach adopted in the Formosan Language Corpora, one of the three main information databases included in the Formosan Language Archive, and the language processing programs that have been built upon it. We first discuss problems related to the transcription of different language corpora. We then deal with annotation rules and standards. We go on to explain the linguistic identification of clauses, sentences and paragraphs, and the computer programs used to obtain an alignment of words, glosses and sentences in Chinese and English. We finally show how we try to cope with analytic inconsistencies through programming. This paper is a complement to Zeitoun et al. [2003] in which we provided an overview of the whole architecture of the Formosan Language Archive. |
本系統之摘要資訊系依該期刊論文摘要之資訊為主。