查詢結果分析
來源資料
相關文獻
- 以文件倉儲概念實現動態群聚與多重文件摘要之研究--以中文電子新聞為例
- 網路論壇FAQ知識之自動轉換設計
- WWW資訊檢索的新趨勢--欄位檢索
- Social Dimensions of the Digital Revolution
- 網路文件自動分類
- Searching for Information on the Internet Using Medical World Search
- Information Extraction: Beyond Document Retrieval
- An Assessment of Character-based Chinese News Filtering Using Latent Semantic Indexing
- 架構在WWW與Z39.50上的近似自然語言OPAC檢索系統
- 中文全文資訊檢索研究架構與重要議題探討
頁籤選單縮合
題名 | 以文件倉儲概念實現動態群聚與多重文件摘要之研究--以中文電子新聞為例=A Study on Multi-Document Summarization Based on Document Warehousing and Dynamic Clustering--Using Internet News as Examples |
---|---|
作者姓名(中文) | 魏玲玉; 曾守正; | 書刊名 | 資訊管理學報 |
卷期 | 13:3 民95.07 |
頁次 | 頁153-176 |
分類號 | 028.7 |
關鍵詞 | 資訊檢索; 文件倉儲; 多文件摘要; 文件群聚; Information retrieval; Document warehouse; Multi-document summarization; Document clustering; |
語文 | 中文(Chinese) |
中文摘要 | 由於電子文件的數量成爆炸性成長,如何有效率地將文件歸納,以方便日後快速瀏覽與查詢,已經是知識管理領域中刻不容緩的課題。傳統上仰賴反轉索引檔(Inverted Index File) 為基礎的全文檢索技術,往往搜尋出相當龐大且雜亂的文件資料,所以還需經過進一步的篩選,才能找到真正有用的文件。這樣的應用模式已經無法滿足使用者快速瀏覽與查詢的需求。在本論文中,我們應用文件倉儲的概念將文件予以結構化儲存,配合多維度查詢的機制,找出具有相關性的文件以進行多重文件摘要與動態群眾之研究。整體概念透過實作DNCSS系統 (Dynamic News Clustering and Summarization System) 來驗證其效果,我們應用資料倉儲處理數值資料的概念來處理文件資料,建立文件倉儲將文件所包含的結構化資訊應用在文件儲存、搜尋與整合上,並提供多維度查詢。更運用動態群眾的概念,幫助使用者組織對文件倉儲作查詢所回傳之查詢結果。最後以多文件摘要系統對每一個文件群眾結果產生一份多文件摘要,方便使用者瀏覽文件集合的精要內容,以更有效率的方式取得有用的資訊。我們以台灣地區各大網路新聞文件為實例來驗證本系統之效果,經人工評估後獲得相當正面之評價,顯示本研究確實能提供使用者快速且有效地獲取符合需求的文件資訊。 |
英文摘要 | As electronic documents proliferate drastically, for contemporary knowledge management, it is indispensable to provide a mechanism for integrating and sorting huge volume of documents for quick browsing and efficient query processing. Traditionally, full-text searching systems were usually based on inverted-index, which is usually huge in volume and unsorted. That makes users suffer from easily determining the information embedded in the collection. Therefore, for document searching over the Internet, such systems are no longer satisfactory for user's need. In this paper, we propose a general framework for document clustering and multi-document summarization based on the concept of document warehousing. Based on our framework, we have implemented a prototype system, named DNCSS (Dynamic News Clustering and Summarization System) to be the test bed of our approach. The system adopts the concept of document warehousing, which models text-oriented documents into multi-dimensional viewpoints. The constructed document warehouse can be regarded as the main repository for our system and it flexibly organizes document structure information for user's searching and querying. Moreover, the retrieved documents from the document warehouse will be further clustered by some clustering techniques to provide a more organized structure. Finally, our system generates a multi-document summary for each cluster to support users finding distilled information more efficiently. We have collected the most famous on-line news in TAIWAN from the Internet as the testing examples to verify the effectiveness of our system. The evaluation result shows that our approach positively alleviates users from reading large amount of related news and elaborating the necessary conclusion effectively. |
本系統之摘要資訊系依該期刊論文摘要之資訊為主。