INFORMATION RETRIEVAL AND EXTRACTION
Course Schedule 1999


EXAMS & GRADING


COURSE CONTENT

  1. Introduction to Information Retrieval and Extraction -- [slide]
  2. Conventional Information Retrieval Systems (Salton, Chapter 8) -- [slide]
  3. Automatic Indexing (Salton, Chapter 9) -- [slide]
  4. Information-Retrieval Models (Baeza-Yates, 1999, Chapter 2) -- [Slide]
  5. Retrieval Performance Evaluation (Baeza-Yates, 1999, Chapter 3) -- [slide]
* Acknowledgement: Special thanks to Prof. Chen, Hsin-Hsi for providing the course material.

PAPER PRESENTATION

The students are asked to read IR-related papers and prepare 20-30 minutes presentation on the class.

Presentation Guideline

Presentation Date: Dec. 8 - Dec. 23, 1999

Note:
Please mail me the paper you want to present as soon as you make the decision, including paper title, authors, year, origin, URL, etc.
Please also hand in one copy of the paper as well as the slides you make for the presentation on the day you present the paper.


PROJECT -- A Virtual IR system

In this project, you don't realy have to implement your Web search engine. Instead, you are asked to make a proposal of an IR system. You should describe as detail as possible in your proposal where your IR system is applied, how it is designed, the features of your design, etc. In particular, your proposal should include the following sections:
  1. Describe where your IR system is applied.
  2. Your IR system can be applied for example on the World Wide Web, a news center, or a bibliography collection, etc.
  3. Details how your Web search engine works.
  4. You can design your IR system from draft, using inverted file indexing your corpus and some model to rank documents, or you can design a multi-engine search engine assuming there's already some IR systems providing ranked documents for furthur processing. Be sure that at least one feature is described clearly such that it makes your IR system distinguished.
  5. Outline the features of your design.
  6. Depends on your design, describe how your text or query are processed, or what techniques are applied to your initial query result.
  7. What do you expect about your design?
  8. With your special design, how much improvement you expect to obtain, or what benefits the user can take from your design?
  9. What experiments do you need to validate your claims in section 4?
  10. Draw the figures or tables as well.
  11. What problems you might encounter?
  12. Challenge yourself about your design to adjust your system. Prepare as many problems as possible.
Note: The proposal should be printed using in 12pt font, single line spacing, and should not exceed 15 pages. Please also prepare 15-minute slides to present your work.

Proposal Due Date: Dec. 29, 1999

Presentation Date: Dec. 29, 1999 - Jan. 12, 2000