河北大学学报(自然科学版) ›› 2010, Vol. 30 ›› Issue (1): 107-112.DOI: 10.3969/j.issn.1000-1565.2010.01.023

• • 上一篇    

基于启发式规则的Deep Web接口发现

杨丽华,袁方,姚增利,王煜   

  1. 河北大学,数学与计算机学院,河北,保定,071002
  • 出版日期:2010-01-25 发布日期:2010-01-25
  • 基金资助:
    河北省教育厅科学研究重点项目

Discovery of Deep Web Interface Based on Heuristic Rules

YANG Li-hua,YUAN Fang,YAO Zeng-li,WANG Yu   

  • Online:2010-01-25 Published:2010-01-25

摘要: 为了有效地利用Deep Web资源,Deep Web数据集成成为当前研究的热点之一.能否高效地发现Deep Web站点是Deep Web数据集成的基础和关键.在此,提出了一种Deep Web接口发现方法,包括基于领域知识来确定合适的查询提交词和用启发式规则发现领域内Deep Web接口.实验结果表明,该方法达到了较高的准确率和召回率,具有良好的可行性和实用性.

关键词: 领域知识, 启发式规则, DeepWeb接口发现

Abstract: To make use of deep web resource effectively, Deep Web data integration has become one of the hot-spot in current study. It is the basis and crucial to integrate deep web data that whether or not discovery deep web sites efficiently. In this case, we present a deep web interface discovery method, which includes to deterimine the query terms based on domain knowledge and to discovery deep web interfaces with heuristic rules. The experimental results show that the method can achieve high accuracy and recall with good feasibility and practicability.

Key words: domain knowledge, heuristic rules, deep web interface discovery

中图分类号: