One of the difficulties in building an SQL-like query lange for the Web is the absence of a database schema for this huge, heter

admin2009-02-15 75

问题 One of the difficulties in building an SQL-like query lange for the Web is the absence of a database schema for this huge, heterogeneous repository of information. However, if we are interested in HTML documents only, we can construct a virtual(66)from the implicit structure of these files. Thus, at the highest level of(67), every such document is identified by its Uniform Resource Locator(URL), has a title and a text Also, Web servers provide some additional information such as the type, length, and the last modification date of a document. So, for data mining purposes, we can consider the site of all HTML documents as arelation:
Document(url,(68), text, type, length, modify)
Where all the(69)are character strings. In this framework, anindividual document is identified with a(70)in this relation. Of course, if some optional information is missing from the HTML document, the associate fields will de left blank, but this is not uncommon in any database.

选项 A、field
B、relation
C、script
D、tuple

答案D

解析在万维网上建立一个类似于SQL的查询语言的困难之一是缺乏一种适用于这种巨大的、异构型信息仓库的数据库模式。然而，如果仅限于HTML文档，我们就可以由这种文件的隐含结构建立一种虚拟模式。这样，在最高抽象级别，每个文档都可以由统一资源定位器(URL)来标识，有一个标题和一个文本。同时，由Web服务器了来提供某些附加的信息，例如，类型、长度和文档的最后修改日期。这样，对于数据挖掘应用来说，我们可以把所有HTML文档的集合看做一个关系：
Document (ur1，title，text，type，length，modify)
这里，所有的属性都是字符串。在这种框架下，一个单独的文档可以用这种关系的一个元组来标识。当然，如果某些任选信息在HTML文档中缺失，有关字段就留做空白，但这种情况在任何数据库中都是常见的。

转载请注明原文地址:https://kaotiyun.com/show/WIJZ777K

本试题收录于：网络工程师上午基础知识考试题库软考中级分类

网络工程师上午基础知识考试

软考中级

相关试题推荐

随机试题

最新回复(0)

One of the difficulties in building an SQL-like query lange for the Web is the absence of a database schema for this huge, heter

网络工程师上午基础知识考试

软考中级

()不是常用的缩短项目工期的方法。

在项目实施过程中，客户提出新的功能需求时，正确的做法是()。

项目管理过程中，()不完全属于监控过程组。

信息技术服务标准(ITSS)的IT服务生命周期模型中，()是在规划设计基础上，依据ITSS建立管理体系、提供服务解决方案。

GB/T16260—1996给出的质量特性中，不包括()。

在物联网的关键技术中，射频识别(RFID)是一种_________________。

在OSI参考模型中，数据链路层处理的数据单位是(22)。

30.以下关于数据仓库描述中，正确的是______。

Data　mining　is　an(66)research　field　in　database　and　artificial　intelligence.　In　this　paper,　the　data　mining　techniques　are　intro

Manyoftheworld’spollutionproblemshavebeencausedbythecrowdingoflargegroupsofpeopleintothecities.Tosupplyfor

对干眼症的诊断没有帮助的是

以下不是下颌骨薄弱部位的结构是()

选择不确定因素变化的百分率时，习惯上取()。

货币市场主要包括()。

【2013年下】《国家中长期教育改革和发展规划纲要(2010～2020年)》提出，对中小学教师实行()。

军队对于()相当于()对于人才

儿童以具体形象思维为主，逐步过渡到以抽象逻辑思维为主的关键年龄大约在()。

A、Useofbicycleamongalargepopulation.B、Developmentoftransportinfrastructure.C、Greenwayscrossingacityinalldirecti

One of the difficulties in building an SQL-like query lange for the Web is the absence of a database schema for this huge, heter

网络工程师上午基础知识考试

软考中级

()不是常用的缩短项目工期的方法。

在项目实施过程中，客户提出新的功能需求时，正确的做法是()。

项目管理过程中，()不完全属于监控过程组。

信息技术服务标准(ITSS)的IT服务生命周期模型中，()是在规划设计基础上，依据ITSS建立管理体系、提供服务解决方案。

GB/T16260—1996给出的质量特性中，不包括()。

在物联网的关键技术中，射频识别(RFID)是一种_________________。

在OSI参考模型中，数据链路层处理的数据单位是(22)。

30.以下关于数据仓库描述中，正确的是______。

Data mining is an(66)research field in database and artificial intelligence. In this paper, the data mining techniques are intro

Manyoftheworld’spollutionproblemshavebeencausedbythecrowdingoflargegroupsofpeopleintothecities.Tosupplyfor

对干眼症的诊断没有帮助的是

以下不是下颌骨薄弱部位的结构是()

选择不确定因素变化的百分率时，习惯上取()。

货币市场主要包括()。

【2013年下】《国家中长期教育改革和发展规划纲要(2010～2020年)》提出，对中小学教师实行()。

军队对于()相当于()对于人才

儿童以具体形象思维为主，逐步过渡到以抽象逻辑思维为主的关键年龄大约在()。

A、Useofbicycleamongalargepopulation.B、Developmentoftransportinfrastructure.C、Greenwayscrossingacityinalldirecti

Data　mining　is　an(66)research　field　in　database　and　artificial　intelligence.　In　this　paper,　the　data　mining　techniques　are　intro