华东师范大学学报(自然科学版) ›› 2006, Vol. 2006 ›› Issue (3): 93-98.

• 计算机科学 • 上一篇    下一篇

一种知识型网络爬虫的设计与实现

杨德仁, 顾君忠   

  1. 华东师范大学 计算机应用研究所,上海 200062
  • 收稿日期:2005-07-30 修回日期:2006-01-09 出版日期:2006-05-25 发布日期:2006-05-25
  • 通讯作者: 杨德仁

Design and Implement of a Knowledge-Based Crawler(Chinese)

YANG De-ren, GU Jun-zhong   

  1. Institute of Computer Application,East China Normal University,Shanghai 200062,China
  • Received:2005-07-30 Revised:2006-01-09 Online:2006-05-25 Published:2006-05-25
  • Contact: YANG De-ren

摘要: 介绍了网页可达性原理、一种知识建模方法以及知识模型与网页知识之间的映射机制;阐述了知识型网络爬虫的组件及其实施的关键技术,提出了一种知识相关度计算模型,可计算页面的知识含量.这种知识提取方法可用于构建新一代智能搜索引擎.

关键词: 知识, 网络爬虫, 模型, 映射机制, 实施技术, 知识, 网络爬虫, 模型, 映射机制, 实施技术

Abstract: Web page arrival principles,a knowledge modeling method,and a mapping mechanism between the model and pages were introduced. The main components of knowledge-based Crawler and its several key implementation techniques were presented. A knowledge relativity model was offered. This knowledge extraction method is useful to build next generation intelligent search engine.

Key words: crawler, model, mapping mechanism, implementing techniques, knowledge, crawler, model, mapping mechanism, implementing techniques

中图分类号: