鲜学丰, 赵朋朋, 辛洁, 方巍, 崔志明. 基于领域样本查询的Deep Web数据库分类[J]. 微电子学与计算机, 2010, 27(3): 20-23.
引用本文: 鲜学丰, 赵朋朋, 辛洁, 方巍, 崔志明. 基于领域样本查询的Deep Web数据库分类[J]. 微电子学与计算机, 2010, 27(3): 20-23.
XIAN Xue-feng, ZHAO Peng-peng, XIN Jie, FANG Wei, CUI Zhi-ming. Classfication of Deep Web Databases Based on the Domain Sample Query[J]. Microelectronics & Computer, 2010, 27(3): 20-23.
Citation: XIAN Xue-feng, ZHAO Peng-peng, XIN Jie, FANG Wei, CUI Zhi-ming. Classfication of Deep Web Databases Based on the Domain Sample Query[J]. Microelectronics & Computer, 2010, 27(3): 20-23.

基于领域样本查询的Deep Web数据库分类

Classfication of Deep Web Databases Based on the Domain Sample Query

  • 摘要: 提出了一种基于领域样本查询的方法以分类这类Web数据库.通过分析领域的高级查询接口自动获取领域主属性并使用领域知识为主属性构建查询样本, 然后对查询接口提交试探查询, 根据返回结果页面的结果模式和记录内容估计Web数据库与领域的相关程度.通过在多个领域的Web数据库上进行实验验证, 说明该方法分类只提供简单查询接口的Web数据库是有效的, 取得了较高的分类精确率, 召回率和F-measure值.

     

    Abstract: An approach based on the domain sample query is proposed in this paper to classify the web database.it obtains domain of the main attributes by analyzing descriptive attribute labels in the advanced query interfaces, the correllations of between web database with simple query interface and domain can be estimated by result schema and records of result pages, which obtained by submitting probing queries to simple query interface.The experiments on several domains have proved that this approach is effective and can achieve high classification precision, recall and F-measure values.

     

/

返回文章
返回