CS-463, Information Retrieval Systems
Last update:
|
Instructor: |
Yannis Tzitzikas
(email: tzitzik "at" csd.uoc.gr)
Office: G 111, Tel:2810 393 523 (ICS-FORTH Tel: 391633) Office hours: Tuesday 11:00-13:00 and Thursday 15:00-16:00, or by appointment. |
Teaching Assistant(s): | Stavros Saxtouris, email: saxtouri AT csd.uoc.gr Akkus Zebide, email: zebide AT csd.uoc.gr Periklis Tziabas, email: ptziavas AT csd.uoc.gr |
Course Hours: | Lectures: Tuesday 9:00-11:00 (RA 201) & Thursday 13:00-15:00 (RA 201) Tutorials: Monday 15-17, L206 |
Course Syllabus: | English version , Greek version. |
Course Attendance: | Attendance of lectures is expected but not required. |
Credits: | 4 |
Grading: |
total = (0,2 * homework) + (0,3 * project) + (0,2 * midterm) + (0,3 * final)
To pass the course you need (total>5) AND (final>5) |
Prerequisite: | CS-240 - Data Structures |
Course Enrollment: | Everybody is obliged to send mail to: majordomo@csd.uch.gr with content: subscribe hy463-list by the beginning of the lectures to enroll in the course's list. |
No. | Date | Lecture topics and slides | Reading, handouts and comments |
1 | Feb 22 | Administrative information, Introduction to IR (Motivation, Information Retrieval vs Data Retrieval, Retrieval vs Browsing, Flashback) (Slides L0) (Slides L1) |
Reading: [1] Ch. 1, Sec. 2.10 |
2 | Feb 24 | Evaluation of IR systems (I) (Recall and Precision, Alternative Measures,Reference Collections and Evaluation of Systems) (Slides L2) |
Reading: [1] Ch. 3 |
3 | Mar 1 | Models of Information Retrieval (I) (Boolean Model, Vector Space Model, Probabilistic Model) (Slides L3) |
Reading: [1] Ch. 2
Assignment 1 |
4 | Mar 3 | Models of Information Retrieval (II) (Fuzzy Model, Extended Boolean Model, Latent Semantic Indexing, Neural Network Model) (Slides L4) |
Reading: [1] Ch. 2 |
5 | Mar 8 | Models of Information Retrieval (III) (Inference Network Model, Belief Network Model) Query Languages for IR (Keywords, Boolean Queries, Context Queries, Natural Language Queries, Pattern Matching) (Slides L5) , (Slides L5b) |
Reading: [1] Ch. 2, Ch 4 |
6 | Mar 10 |
Advanced Query Operations (Relevance Feedback, Query Expansion, Automatic Local Analysis, Thesauri, Automatic Global Analysis) |
Reading: [1] Ch. 5 Assignment 2 |
7 | Mar 15 |
Advanced Query Operations (II) Document Clustering (startup) (Slides L67) |
Reading: [1] Ch. 7.3 |
8 | Mar 17 |
Document Clustering (II) (Slides L8) |
|
9 | Mar 22 | Text Preprocessing, Indexing and File Organization (stopwords, stemming, thesauri, inverted files, suffix trees, signature files) (Slides L9) |
Reading: [1] Sec 7.2 (stopwords), Sec 8.1-8.4 (text indexing and file
organization)
Assignment 3 |
10 | Mar 24 | Mid-term Exam | Contents of lectures 1-8 |
11 | Mar 29 | Text statistics and compression | Reading: [1]: Ch 6 (text statistics), 7.4-7.5, 8.8 (text compression) Assignment 4 |
12 | Mar 31 | Text Searching (Knuth-Morris-Pratt, Boyer-Moore family, Suffix automaton, Phrases and Proximity) |
Reading: [1] Ch. 8.5, 8.6 |
13 | Apr 5 | Stuctured Text Retrieval Models, XML Indexing and Searching | Reading: [1] Ch 4, Sec 2.9 (Structured Text Retrieval Models) |
14 | Apr 7 | Multimedia Information Retrieval (Similarity Queries,Feature-based Indexing and Searching, Spatial Access Methods,Searching in Multidimensional Spaces) |
Reading: [1] Ch. 12, (see articles) Project |
15 | Apr 12 | Parallel and Distributed IR (Architectures MIMD and SIMD, Collection Partitioning, Source Selection, Query Processing) |
Reading: [1] Ch. 9 |
16 | Apr 14 | Parallel and Distributed IR (II) (Peer-2-Peer Architectures and Systems) |
Reading: [1] Ch. 9 + articles |
17 | Apr 19 | Meta-Ranking (Integrated vs Isolated Methods) |
Reading: [1] Ch. |
18 | Apr 21 | Web Search (I) (History of Web, Indexing, Spidering/Crawling, Link Analysis (HITS, PageRank)) |
Reading: article ACM TOIT 1(1) |
Apr 23 - May 8 | Easter Holidays | ||
19 | May 10 | Web Search (II) | Reading: article VLDB'04 |
20 | May 12 | Web Search (III) | Reading: [1] Ch. |
21 | May 17 | User Interfaces, Visualization and Interaction Models | Reading: [1] Ch. 10 |
22 | May 19 | User Profiles, Multiple Reference Point Systems Case Studies |
Reading: [4] Ch. 6, 7 |
23 | May 24 | Current Trends in IR Research | Reading: [1] Ch. |
24 | May 26 | Retrospection, Discussion, Course Evaluation | Reading: [1] Ch. |
25 | May 30 | Demonstration of Projects | |
June X | Final Exam |
Assignment | Date due | Additional information |
Assignment 1 | March 1 until March 8 |
Subject: Retrieval Evaluation |
Assignment 2 | March 10 until Mar 17 |
Subject: Retrieval Models |
Assignment 3 | Mar 14-18 until Mar 21-25 |
Subject: Text file organization |
Assignment 4 | Mar 21-25 until Mar 28-April 1 |
Subject: Text properties |
Project | April 1 until May 15 |
Subject: Implementation of an Information Retrieval System with pseudo relevance feedback |