结构化文档
- 网络Structured document
-
作者把Word域和数据库结合,运用VBA编程,把数据库中的数据插入到文档中,解决了结构化文档的管理问题。
In this paper , we solve the problem of structured document management by inserting the data of database into the document with VBA and WORD TextFields .
-
一种基于内容权值的结构化文档检索方法
Method to Query Structured Document Based on Content Weight
-
如果完全忽略结构的话,应用于诸如HTML或XML之类半结构化文档的文本分析提供的见解很有限。
Textual analysis applied to semi-structured documents like HTML or XML provides limited insights if it completely ignores structure .
-
文档模型在一个文件中定义,它们指定对结构化文档(例如XML格式的文档)进行解析和建立索引的模型。
The document model , defined in a file , specifies a model for parsing and indexing structured documents of format XML , for example .
-
为了提高大规模半结构化文档集的聚类质量,提出了一种新的XML文档聚类方法。
To improve the clustering quality of massive extensible markup language ( XML ) document collections , this paper proposes a novel XML document clustering method .
-
可以使用NetSearchExtender文档模型配置结构化文档(比如XML)中要搜索的范围。
You can use a Net Search Extender document model to configure the scope of search within structured documents , such as XML .
-
IBMOptimDataRedaction是能够高效保护非结构化文档和表格中敏感信息的解决方案。
IBM InfoSphere Guardium Data Redaction is a solution to efficiently protect sensitive information in unstructured documents and forms .
-
提供了对不同数据源的支持,例如基于文件的半结构化文档、关系和层次数据库、Web和目录服务,以及应用服务器和事务服务器。
Support is provided for a diverse set of data sources , like file based semi-structured documents , relational or hierarchical databases , Web and directory services , and application and transaction servers .
-
基本文本搜索(BasicTextSearch)DataBlade模块允许在存储在表列中的非结构化文档库中搜索词和短语。
The Basic Text Search DataBlade module allows you to search words and phrases in an unstructured document repository stored in a column of a table .
-
数据增长的另一个趋势是非结构化文档在数据中的比重日益增大,而对非结构化文档的检索缺乏像数据库检索的SQL语言这样简单的工具。
Another trend is unstructured document data has a increasing proportion of all data , but we lack simple tool likes SQL that for database to retrieval unstructured documents .
-
通过分析WEB的特点,采用一种Fabric的索引技术,加强了搜索引擎对结构化文档特别是可以转化为XML文档的HTML页面的搜索能力。
By analyzing the WEB decided to use the Fabric as indexing XML document method , Enhanced the ability of search engine that searching for structure documents and especially for the HTML which can change to XML .
-
RDFa是将资源描述框架(RDF)模型编码到结构化文档中的标准方法。
RDFa is a standard way of encoding the Resource Description Framework ( RDF ) model into structured documents .
-
基于IHE技术框架设计思想和结构化文档技术,设计了Full-PACS中的报告系统,实现了诊断报告的跨科室共享和结构化诊断报告生成。
Design Full-PACS reporting system based on IHE and structured file techniques .
-
由于传统的财务报告都是PDF、DOC、HTML等格式的非结构化文档,甚至是纸介质的打印版,难于查询,更难于进行数据分析,无法获得信息使用者所需求的信息。
The traditional finical report is non-structural document such as PDF , DOC or HTML , even is printed with paper , which is very difficult to finding and analyzing data .
-
可扩展的标记语言(XML)的出观是Web发展的必然结果,它最初是为了解决HTML在结构化文档描述上的缺陷作为SGML的精简的子集在Web上来使用的。
XML ( extensible Markup Language ) is the answer . XML began as a project to address HTML 's limitations on structured documents , by selecting a simple to implement yet extensible subset of SGML for use on the Web .
-
分析了Web文档的特点,指出其主要形式HTML文档是一种结构化文档,结构由标签显式地定义,不同文档结构对检索性能的贡献不同。
Characters of Web documents are studied , the fact is most of them are HTML documents , a kind of structured documents . Its structure is defined explicitly by predefined HTML tags , which has different importance and influence on the performance of search engine .
-
本文介绍了一个协同编辑环境下结构化文档版本管理系统的设计过程,并在一个真实的协同编辑器Z-Office上完整地实现了该系统。
This paper introduces a design of structural document version management system which implements in a real cooperative editor ( Z-Office ) .
-
半结构化文档数据流的快速频繁模式挖掘
Fast mining frequent patterns in semi-structured data stream
-
半结构化文档中语义信息抽取方法的研究
Research on Semantic Information Extraction for Semi-structured Documents
-
基于语义的半结构化文档检索
Semantic Based Information Retrieval from Semi-structured Documents
-
半结构化文档中非标记化表格的抽取
Untagged Table Extraction in Semi-structured Documents
-
搜索结构化文档或附加列中的数值范围
Search on numeric ranges , which could be either in structured documents or within additional columns
-
但是,关系数据库的弱势在于存储半结构化文档方面。
However , a weakness of relational databases is in storing documents with a semi-structured nature .
-
本发明涉及结构化文档管理设备、搜索设备、存储和搜索方法及程序。
The present invention concerns a structured-document management apparatus , retrieval apparatus , storage and retrieval method and procedures .
-
从文本或半结构化文档中自动地抽取用户关心的内容信息且表示成计算机能理解的形式是一项极具实用价值的挑战性研究。
Automatic extraction of content information from text or semi - structured documents is a demanding and challenging technology .
-
半结构化文档的逻辑结构自动发现可以改善文档的浏览方式,提高文档内容构件的复用性,有效克服了半结构化文档难于利用的弱点。
Automatic identification of logical structures in semi-structured documents enables reading by browsing and the reuse of content components .
-
结构化文档由标题、章节、段落等逻辑结构组成。
Structured documents consist of a few logical components , such as title , sections , subsections and paragraphs .
-
企业合规部门可以创建和部署结构化文档模板,以在文档中实施品牌认知或其他合规原则。
A corporate compliance department can create and deploy structured document templates that enforce branding or other compliance guidelines in documents .
-
主要任务就是从底层的结构化文档集合中找到用户需要的最合适的答案。
The main task of a question-answer system is to locate the most matching answer from the underlying structured document collection .
-
这些非结构化文档信息的集成就成了信息整合的关键问题,成为了信息资源管理的核心。
These unstructured document information integration has become the key issues of information integration has become the core of the information resource management .