News

  • I am currently not offering this course. Please check my.smu.edu

Course Description

Introduces the field of information retrieval with an emphasis on its application in Web search. Introduces the basic concepts of stemming, tokenizing and inverted indices, text similarity metrics and the vector-space model. Popular Web search engines are studied and the concepts are applied in several Java-based projects using state-of-the-art frameworks like MapReduce. Web search frameworks and indexing tools like Apache Nutch and Lucene will also be reviewed.

Text

  • Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008. (Online Edition)

Slides

Assignments

Project

Tools for distance students

  • CamStudio: Free Screen recording software for Windows.