頁籤選單縮合
題 名 | 資訊檢索之中文詞彙擴展=Expansion of Chinese Words in Information Retrieval |
---|---|
作 者 | 陳光華; 莊雅蓁; | 書刊名 | 資訊傳播與圖書館學 |
卷 期 | 8:1 2001.09[民90.09] |
頁 次 | 頁59-75 |
分類號 | 028.7 |
關鍵詞 | 資訊檢索; 查詢問句擴展; 索引典; 同義詞; Information retrieval; Query expansion; Thesaurus; Synonym; |
語 文 | 中文(Chinese) |
中文摘要 | 本研究主要探討議題有三:一,自動建構之同義詞典對資訊檢索之輔助效益;二,以何種索引典詞彙關係來擴展查詢問句可得到最佳的效益;三,同義詞典與索引典整合輔助檢索的效益分析。限於詞彙資源取得不易,本研究採用實驗文件資料庫為基礎,以進行查詢問句擴展的實驗。首先,蒐集原始查詢問句,再以不同的詞彙來源,包括以同義詞典及索引典分別擴展查詢問句,以及整合兩者再擴展查詢問句,建構多組不同的查詢問句擴展模式。實驗結果的效益評估,由人工進行相關判斷,再依判斷所得計算檢索結果的求準率。研究顯示,以同義詞典詞彙群內詞彙數量較少的層次來擴展查詢問句,可得到較好的檢索效益;不過以索引典各種詞彙關係來擴展查詢問句時,檢索結果沒有顯著的差異。整體而言,以整合所有詞彙關係的擴展模式有較好的檢索效益可略為提升。但如再以索引典進行二次擴展時,檢索效益反而附低。實驗亦發現自重建構的同義詞典內容,受斷詞品質的優劣所影響,因此,對查詢問句擴展的檢索效益而言,字串比對方式亦是重要的影響因素。 |
英文摘要 | This thesis aims at three important issues for query expansion: whether the automatic constructed synonym dictionary could enhance the retrieval effectiveness, which relationship of thesaurus has the best performance, and the effectiveness of the integration of synonym dictionary and thesaurus. In the experiments of query expansion, the queries are expanded in different models, including expanding by either synonym dictionary or thesaurus or both. Finally, performance is evaluate din precision. The results show that query expansion using second level of synonym dictionary has better performance. Though the effects of different relationships prescribed in the thesaurus are similar, expanding by union of all relationships shows better performance. The model of first expanding query by synonym dictionary then modifying it by thesaurus has improved retrieval performance slightly, but the performance is decreased in further expansion. We also find that the correctness of word segmentation has a great impact on the quality of synonym dictionary. The mode of string mapping is another important factor. |
本系統中英文摘要資訊取自各篇刊載內容。