頁籤選單縮合
題 名 | Parallel Information Retrieval on Cluster of Workstations |
---|---|
作 者 | 馬詠程; 陳添福; 鍾崇斌; | 書刊名 | 中華民國資訊學會通訊 |
卷 期 | 3:3 2000.09[民89.09] |
頁 次 | 頁11-21 |
專 輯 | 網際網路與分散式系統 |
分類號 | 028.7 |
關鍵詞 | 並行資訊檢索; Parallel information retrieval; |
語 文 | 英文(English) |
英文摘要 | The rapid growth of Internet brings new challenges on designing a scalable informationretrieval system. To reduce the user response time, we investigate the problem of parallelizingBoolean query processing on a cluster of workstations. The key issue is to partition the posting filesuch that, during parallel query processing, each workstation consults only its own locally residentdata to complete its task. This is achieved by making all postings corresponding to a document asnon-separable objects in the posting file partitioning. Following the partitioning by document IDprinciple, we develop partitioning algorithms to transform a sequential information retrieval systemto a parallel information retrieval system. The partitioning schemes are designed to balanceworkload between workstations without increasing the average time to process a posting. Theexperiment shows that almost linear speed-up can be achieved. This work shows that parallelprocessing technique is feasible to build a scalable information retrieval system. |
本系統中英文摘要資訊取自各篇刊載內容。