loading...
A Supervised Visual Wrapper Generator for Web-Data Extraction
Dallas, Texas November 03-November 06
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CMPSAC.2003.124541227th Annual International Computer So ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Xiaofeng Meng, Renmin University of China, China
Haiyan Wang, Renmin University of China, China
Dongdong Hu, Renmin University of China, China
Chen Li, University of California, Irvine
Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interest. In this paper, we propose a novel schema-guided approach to wrapper generation. We provide a user-friendly interface that allows users to define the schema of the data to be extracted, and specifies mappings from a HTML page to the target schema. Based on the mappings, the system can automatically generate an extraction rule to extract data from the page. Our approach to wrapper generation can significantly reduce the work of human beings in this process. And the user never have to deal with the internal extraction rule, or even familiarity with the details of HTML.
Citation:
Xiaofeng Meng, Haiyan Wang, Dongdong Hu, Chen Li, "A Supervised Visual Wrapper Generator for Web-Data Extraction," compsac, pp.657, 27th Annual International Computer Software and Applications Conference, 2003
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions