数据清洗

  • 网络Data cleaning;data cleansing;data clearing
数据清洗数据清洗
  1. Web数据清洗研究

    Research of the Web Data Cleaning

  2. 基于XML数据清洗的应用研究

    Study on Data Cleaning Based on XML and Its Application

  3. 通过开发Web文本数据清洗系统,重点研究和讨论了所涉及的Web文本清洗的关键技术。

    Through the development of the system , the key technologies involved in the system are discussed .

  4. 数据清洗及XML技术在数字报刊中的研究与应用

    Research and Application of Data Cleaning and XML Technologies Based on Digital Newspaper

  5. 分析了XML语言在数据清洗上的应用优势;

    Analyze the privilege of XML on data cleansing ;

  6. XML与数据清洗的研究

    The Research of Data Cleansing with XML

  7. 基于RFID应用的综合性数据清洗策略

    Integrated Data Cleaning Strategy Based on RFID Applications

  8. 基于伪事件的RFID数据清洗方法

    RFID Data Cleaning Method Based on Pseudo Event

  9. 一种基于Token匹配的中文数据清洗方法

    An approach for Chinese data cleaning based on token

  10. 一种ODS环境下的混合数据清洗策略

    A Combined Data Cleansing Strategy Under ODS Environment

  11. 阐述Web挖掘和推荐系统的一些基本概念和基础知识,对推荐系统工作流程中的数据清洗进行了研究,并对数据清洗模块进行了设计与实现。

    Described some basic concepts and basic knowledge of recommendation systems ; researched the date preprocessing of the recommended work flow in E-Commerce recommendation system , designed and realized the date preprocessing module . 2 .

  12. 基于虚拟空间粒度的RFID数据清洗方法bspace

    BSpace : A Data Cleaning Approach for RFID Data Streams Based on Virtual Spatial Granularity

  13. 通过具体的应用验证了数据清洗系统对数据的正确性、有效性、完整性与一致性都有良好的检测与控制能力,由此证明了基于多Agent的数据清洗系统的实用性。

    Through specific application , this thesis verifies that Data cleansing system has good detection and control capability at the accuracy , effectiveness , integrity and consistency of data , and verifies the practicability of data cleansing system based on multi-agent .

  14. 针对现有检测复制记录技术存在的不足,提出了采用Canopy聚类技术进行聚类复制记录的数据清洗方法,并通过实验结果验证了所提算法的有效性和准确性。

    After analyzing problems of existing techniques for duplicate records detection , this paper proposes an approach of data cleaning , by using the Canopy clustering technique to cluster duplicate records . Experiment results show effectiveness and accuracy of these algorithms .

  15. 在给出ETL过程中数据清洗模型的基础上,针对已知和未知的错误类型,以及语义上的错误,提出了一种自动清洗和人为清洗相混合的数据清洗策略,具有较好的现实意义。

    After discussing the data cleansing model in ETL , and to solve the known or unknown error and semantic error , this paper proposes a data cleansing strategy of combination of automatic and manual methods that has a better realism significance .

  16. 借助于粗糙属性向量树(RAVT)的巧妙构造,提出了两种能同时完成属性约简、数据清洗和规则提取的快速递推矩阵算法(RMC)和分布式并行矩阵算法(PMC)。

    Based on a Rough Attribute Vector Tree ( RAVT ), two kinds of fast matrix computation algorithms & Recursive Matrix Computation ( RMC ) method and Parallel Matrix Computation ( PMC ) method are proposed for data cleaning and rules extraction finished synchronously in rough information system .

  17. 实际的开发案例证明:使用DCPM模型建模数据清洗流程并基于C+ADC框架进行数据清洗应用开发,能够快速地构建基于构件的灵活的、可扩展的数据清洗应用软件。

    A practical development case has proven that development of data cleansing application based on DCPM ( Data Cleansing Process Model ) and C + ADC ( Component-extended Agile Data Cleaning ), can construct quickly a flexible and extendable component-based data cleansing application software .

  18. 垂直搜索中的数据清洗和排序算法研究

    Research on Data Cleaning and Ranking Algorithm in Vertical Search Engine

  19. 一种基于聚类树的增量式数据清洗算法

    An incremental algorithms of data cleansing based on clustering tree

  20. 该文提出并实现了一个可扩展的数据清洗框架。

    This paper presents an open and extensible framework for data cleaning .

  21. 交通流数据清洗的关键理论及方法研究

    Study on Key Theory and Methods for Data Cleaning of Traffic Flow

  22. 数据清洗方法与构件的综合技术研究

    An integrated technology of method and component for data cleaning

  23. 定量专利分析的样本选取与数据清洗

    Sample selection and data cleansing for quantitative analysis of patents

  24. 系统提供了方便、易用的可视化的数据清洗流程定义环境。

    The system provides a visual environment to define the data cleaning workflow .

  25. 数据清洗技术在期刊元数据整合中的应用

    The Application of Data Cleaning in Periodical Metadata Integration

  26. 基于软件总线模型的数据清洗系统的研究与实现

    Research and Implementation of the Data Clean System Based on Software Bus Model

  27. 基于聚类分析技术的数据清洗研究

    Improved Algorithms for Data Cleansing Based on Clustering Analysis

  28. 基于聚类模式的数据清洗技术

    Towards Data-Mining : Data Cleaning Based on Clustering Techniques

  29. 然后总结了数据清洗技术的原理方法。

    Second , summarize the principle and the method of data cleansing techniques .

  30. 因此,必须进行数据清洗来提高信息系统的数据质量。

    So , data cleaning is vital to improve data quality of information system .