论文部分内容阅读
“蝎子计划(Scorpion Project)”是美国 OCLC 利用《杜威十进分类法》电子编辑支持系统(ESS)对数字信息资源进行自动分类和主题识别的一个研究项目。本文简要介绍了该项目的进展情况、实施原理,描述了 Scorpion 对数字信息资源进行自动分类和主题识别的具体流程,并将其与我们自行研发的基于《中图法》知识库的中文信息自动标引和自动分类系统进行对比分析,以探讨 Scorpion 对中文信息自动分类和主题识别的借鉴意义。
“Scorpion Project” is a research project that OCLC uses the “Dewey Decimal Classification” Electronic Editing Support System (ESS) to automatically classify and identify digital information resources. This paper gives a brief introduction of the project’s progress and implementation principle, describes the specific process of Scorpion’s automatic classification and topic identification of digital information resources, and compares it with our self-developed Chinese information based on the “China Library Law” knowledge base Indexing and automatic classification system for comparative analysis to explore Scorpion Chinese automatic classification of information and the theme of reference significance.