数据抽取

  • 网络extract;ETL;Extraction;data extraction;data pump
数据抽取数据抽取
  1. Web的半结构化数据抽取的方法及其实现

    The Means and Realization of Partly-Structured Data Extract on Web

  2. 根据所要抽取网页的特点,提出了一种基于网页结构和ontology领域知识的自动网页数据抽取。

    We present a novel approach to automatically extract web data based on the structure of web page and the specific ontology .

  3. 一种可行的Web数据抽取包装器的设计方法

    A practical designing method for the wrapper of Web data extraction

  4. deepweb数据抽取和语义标注技术研究

    Research on Technology of Deep Web Data Extraction and Semantic Annotation

  5. 数据抽取及语义分析在Web数据挖掘中的应用

    Application of data extraction and semantic analysis in Web mining

  6. 基于网页结构的Web数据抽取方法研究

    Study of Web Data Extraction Based on Webpage Structure

  7. 基于表格特征的Web数据抽取方法

    Extraction of Web Table Data Based on Table Character

  8. deepweb数据抽取及集成技术研究

    Research on Deep Web Oriented Information Extraction and Integration

  9. 面向结构的Web表格数据抽取系统

    Web Form Data Extraction System Based on Structure

  10. 基于结果模式的deepweb数据抽取

    Deep Web Data Extraction Based on Result Pattern

  11. Web数据抽取技术研究进展

    The Progress of Web Data Extraction Technology

  12. 本体驱动的半结构化Web生物数据抽取

    Ontology-driven Extracting of Semi-structure Web Biological Data

  13. 基于XML的分布式异构数据抽取系统设计与实现

    Design and Implementation of XML-based Extraction System for Distributed Heterogeneous Data

  14. XML在Web数据抽取中的应用研究

    Research on Applications of XML-based Web Data Extraction

  15. 基于XML面向Web的数据抽取技术研究

    Web Data Extraction Technology Research Based on XML

  16. JavaXML与面向Web的智能数据抽取

    Intelligence Data Extraction Based on Java XML and Web

  17. Java与XML实现数据抽取

    Implementing the Data Extraction with Java and XML

  18. 基于本体的Web数据抽取Wrapper研究与实现

    Research and Implementation a Wrapper for Web Data-Extraction Based on Ontology

  19. XML技术的出现,为解决基于Web的数据抽取提供了一个良好的机遇。

    The discovery of XML technology is provided a good opportunity to solve the data extraction on the Web .

  20. 文章分析了基于XML的Web数据抽取模型,详细论述了如何利用XML技术从Web页面中抽取数据。

    This paper analyses a model of XML-based data extraction and discusses the process of data extraction from web pages .

  21. 本文在分析了大量网站后提出一种针对相似网页的半自动化WEB数据抽取器。

    After analyzing a lot of sites , this paper put forward a semi-automation data extractor aiming at WEB site with similar web pages .

  22. HTML表格数据抽取与集成

    Data Extraction and Integration in HTML Tables

  23. 课题将结合实际,将理论与现实需求相结合,提炼出Web数据抽取在商业银行客户风险监控中的实际意义和影响。

    The research has combined actual requirement with theory and finds out the benefits and impacts of web data gathering technology in customer risk monitor of commercial bank .

  24. 填入值之后,单击“ReadLDAP”按钮来启动数据抽取。

    Once the values have been entered , click the ' Read LDAP ' button to start the data extraction .

  25. 由于XML形式的病历文档存在对应的XML模式(Schema),从而为标准电子病历消息的转换和数据抽取提供了可能。

    Because medical record documents expressed by XML exists the correspondence XML Schema , the standard electronic medical record messages transformation and data extract become possible .

  26. 实验表明,该方法的抽取性能在查全率和F值方面优于其它的一些数据抽取方法。

    Experimental results on data sets showed that the approach with tree automata compared favorable against some other approaches in the F-score and recall .

  27. 论文以隐马尔科夫模型(HMM)进行数据抽取中的若干关键问题进行研究。

    This paper made a research on the web information extraction 's several key problems with HMM.

  28. 首先研究提出了一个包括网页浏览导航、原始数据抽取、以及数据语义化集成三阶段完整Web信息抽取的过程模型,以及面向复杂应用处理的抽取集成数据模型。

    First of all , we study and propose a complete WIE process model that consists of three stages : Web page navigation process , raw data extraction process and data semantic integration process .

  29. 同时,研究了Web资源质量元数据的量化方法,分析了数据抽取技术,采用基于正则表达式的数据抽取技术来获取质量元数据。

    At the same time , quantitative methods of quality web resources metadata was studied , data extraction technologies are analyzed . Data extraction technology based on regular expressions was used to obtain quality metadata .

  30. 数据抽取是指通过技术手段将Web页面上的数据抽取出来,保存为XML文档或关系模式,作为下一步处理的基础。

    Data extraction is the means of extracting information in a web page by some technical methods and saving information in XML format or relational schema , as the basis of further process .