loading...
SumatraTT: a Generic Data Pre-processing System
Prague, Czech Republic September 01-September 05
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/DEXA.2003.123201014th International Workshop on Databa ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Petr Aubrecht, Czech Technical University
Petr Miksovsk?, Czech Technical University
Lubos Kr?, Czech Technical University
A systematic process of indexing cultural heritage artefacts began well before the era of computers. The first step of digitising such archives of hand- and typewriter-written data was naturally focused on transfer of these files into a digital form - either by means of re-typing the original data manually or by applying OCR methods on scanned documents. As a result, there exist huge digital archives of data and metadata in Europe, which describe millions of artifacts kept by thousands of galleries, museums, and/or private collections. To explore such archives (incl. data mining methods), the data need to be converted into a unified format and data model. Moreover, the original indexing methodologies may also vary significantly. Thus, even conversion to a unified metadata (ontology) model is needed.
Any data transformation is a tedious task, which usually requires to design, implement, and test number of scripts, which will be executed in order to transform the data sets. To simplify such data transformation processes, a generic data transformation system called SumatraTT has been developed at the Gerstner laboratory of the Czech Technical University in Prague. The system has been verified on a number of applications, mostly as a data pre-processing system in the process of data mining.
Currently, the goals of the CIPHER project opened new research directions aimed at investigating the ontology transformation and unification problems using SumatraTT.
Citation:
Petr Aubrecht, Petr Miksovsk?, Lubos Kr?, "SumatraTT: a Generic Data Pre-processing System," dexa, pp.120, 14th International Workshop on Database and Expert Systems Applications (DEXA'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.