Data conversion, extraction and record linkage using XML and RDF
[摘要] SIMILE is a joint project between MIT Libraries, MIT Computer Science and Artificial Intelligence Laboratory (CSAIL), HP Labs and the World Wide Web Consortium (W3C). It is investigating the application of Semantic Web tools, such as the Resource Description Framework (RDF), to the problem of dealing with heterogeneous metadata. This report describes how XML and RDF tools are used to perform data conversion, extraction and record linkage on some sample datasets featuring visual images (ARTstor) and learning objects (OpenCourseWare) in the first SIMILE proof of concept demo. 18 Pages
[发布日期] [发布机构] HP Development Company
[效力级别] [学科分类] 计算机科学(综合)
[关键词] SIMILE;RDF;Semantic Web;thesauri;data conversion;extraction;screen scraping;record linkage [时效性]