Content of the course:
This course will cover the following topics:
-
Information Retrieval and Web Search:
Basic Concepts of Information Retrieval
Information Retrieval Models
Relevance Feedback
Evaluation Measures
Text and Web Page Pre-Processing
Inverted Index and Its Compression
Latent Semantic Indexing
Web Search
Meta-Search: Combining Multiple Rankings -
Web Crawling:
A Basic Crawler Algorithm
Implementation Issues
Universal Crawlers
Focused Crawlers
Topical Crawlers -
Structured Data Extraction:
Wrapper Induction
Instance-Based Wrapper Learning
Automatic Wrapper Generation
String Matching and Tree Matching
Multiple Alignment
Building DOM Trees
Extraction Based on a Single List Page or Multiple Pages -
Information Integration:
Schema-Level Matching
Domain and Instance-Level Matching
Combining Similarities
1:m Match
Integration of Web Query Interfaces
Constructing a Unified Global Query Interface -
Opinion Mining and Sentiment Analysis:
Document Sentiment Classification
Sentence Subjectivity and Sentiment Classification
Opinion Lexicon Expansion
Aspect-Based Opinion Mining
Opinion Search and Retrieval