Data mining and Recommender Systems

Course ID
CSIS1-3
Direction
1st, 2nd
Semester
Winter
Type
1rd direction elective, 2nd direction mandatory

Learning Outcomes

Proficiency in Data Mining Techniques:
- Understanding Algorithms: Mastery of various data mining algorithms for clustering, association rule mining, anomaly detection, etc.
- Application Skills: Ability to apply these algorithms to different datasets and scenarios effectively.
Expertise in Recommender Systems:
- Comprehensive Knowledge: Understanding different types of recommender systems and their implementation.
- Hands-on Experience: Practical experience in building and evaluating recommendation models.
Data Handling and Analysis Skills:
- Data Preprocessing Mastery: Proficiency in data cleaning, transformation, and preparation for analysis.
- Dimensionality Reduction Techniques: Understanding and applying methods like PCA/SVD for efficient data representation.
- Feature Selection: Ability to select relevant features for analysis and model building.
- Anomaly Detection: Capability to identify outliers and anomalies within datasets.
Time Series Analysis Competency:
- Understanding Time Series Data: Proficiency in analyzing time-dependent data using techniques like ARMA/ARIMA.
- Forecasting Skills: Ability to make predictions and forecasts based on time series analysis
User Profiling and Content Filtering Skills:
- User Profiling: Understanding user behavior and creating profiles for personalized services or recommendations.
- Collaborative Filtering: Ability to utilize collaborative filtering techniques, including matrix/tensor factorization.
- Content Filtering: Understanding and implementing content-based recommendation methods.
Practical Application and Research:
- Hands-on Project Work: Experience in implementing learned concepts in real-world datasets.
- Research Skills: Ability to explore advanced topics and potentially contribute to research in the field.
Critical Thinking and Problem-Solving:
- Analytical Skills: Enhanced ability to analyze complex datasets and derive meaningful insights.
- Problem-Solving Approach: Proficiency in addressing data-related challenges using appropriate methods and tools.

Course Content

Classification
Clustering
Data preprocessing
Dimensionality reduction
PCA / SVD
Feature selection
Anomaly detection

Timeseries analysis (ARMA / ARIMA)

Association rule mining
User profiling
Content filtering
Collaborative filtering, incl. Matrix / tensor factorization

General Skills

Search, analysis and synthesis of data and information with the use of the assorted technologies

Independent work

Team work

Promoting free, creative and deductive reasoning

Learning and Teaching Methods - Evaluation

Teaching methods: On site 

Use of Information and Communication Technologies: η-τάξη, παρουσιάσεις, παραδείγματα κώδικα

Activity Work load
Semester
Lectures 20
Lab exercises 6
Thesis 60
Independent Study 64
Total 150

Assessment

Group project with presentation and final exam

Literature

Carlo Vercellis. Business Intelligence: Data Mining and Optimization for Decision Making Wiley. 2009
Charu Aggarwal, ChengXiang Zhai Mining Text Data, Springer 2012.
Bing Liu, Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data Data-Centric Systems and Applications, Springer 2008.
Ian Witten, Eibe Frank, Data Mining, Practical machine learning tools and techniques Elsevier, Morgan Kaufmann, 2005
Rob Sullivan, Introduction to Data Mining for the Life Science. Springer 2012.
Robert Stackowiak, Joseph Rayman, Rick Greenwald. Oracle Data Warehousing and Business Intelligence Solutions, Wiley, 2007
Jiawei Han, Micheline Kamber, Data Mining: Concepts and Techniques, Elsevier, Morgan Kaufmann, 2006