Web IR and IE 2005
(under construction)
Part I -- Web Information Extraction
Part II -- Traditional Information Retrieval
Part III -- Searching on the Web
Part IV -- Web Mining
Reading Assignment
- Nicholas Kushmerick, Daniel S. Weld, Robert B. Doorenbos: Wrapper
Induction for Information Extraction. IJCAI 1997, 729-737
- Chun-Nan Hsu, Ming-Tzung Dung:
Generating Finite-State Transducers for Semi-Structured Data Extraction from
the Web. Information System 1998, 521-538
- Ion Muslea, Steven Minton, Craig A. Knoblock:
A Hierarchical Approach to Wrapper Induction. Agents 1999,190-197
- Chia-Hui Chang, Shao-Chen Lui.
IEPAD: information extraction based on pattern discovery. WWW 2001, 681-688
- Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
RoadRunner: Towards Automatic Data Extraction from Large Web Sites.
VLDB 2001, 109-118
- Arvind Arasu, Hector Garcia-Molina:
Extracting Structured Data from Web Pages. SIGMOD 2003, 337-348
Please check here fore new announcement
Use your student ID number as login account.