Jirka Kosek, University of Economics, Prague, Czech Republic
Jir? Br?za, University of Economics, Prague, Czech Republic
The Rainbow project aims at the development of a reusable, modular architecture for web (particularly, website) analysis. Individual knowledge-based modules separately analyse different types of web data and communicate the results via web-service interface. The output of analysis has the form of classes (of web resources) predefined in an ontology, extracted text, and/or addresses of retrieved web resources. Within the project, several original methods of analysis as well as (analytic) knowledge acquisition have been developed. The current domains of investigation are sites of small organisations offering products or services, and pornography sites. The paper is the first systematic overview of diverse methods developed or envisaged in Rainbow.
Citation:
Vojtech Sv?tek, Jirka Kosek, Martin Labsk?, Jir? Br?za, Martin Kavalec, Miroslav Vacura, Vladim?r V?vra, V?clav Sn?sel, "Rainbow - Multiway Semantic Analysis of Websites," dexa, pp.635, 14th International Workshop on Database and Expert Systems Applications (DEXA'03), 2003