论文部分内容阅读
随着Internet在我国逐步得到普遍应用以及WWW上中文信息量的不断增长,迫切需要研制适合我国国情的中英文Web索引和检索服务系统。WWW的信息发现和搜索引擎又称为robot,负责搜索和获取指定范围内的有关数据。本文对Web搜索引擎的工作原理和关键技术进行了讨论和分析,并介绍了我们在研制中英文Web索引和检索服务器方面所做的工作,包括系统总体结构和汉语分词技术等。
With the gradual universal application of the Internet in our country and the increasing amount of Chinese information on the WWW, it is urgent to develop a Chinese-English Web indexing and retrieval service system that suits our national conditions. WWW’s information discovery and search engines, also known as robots, are responsible for searching and retrieving relevant data within a specified range. This paper discusses and analyzes the working principle and key technologies of Web search engine, and introduces the work we have done in developing Chinese and English Web indexing and retrieval servers, including the overall structure of the system and the Chinese word segmentation technology.