Learning Outcomes
The aim of this course is to help students get to know and learn a variety of basic statistical tools, useful for for Data Science. Students will learn how to convert raw data into descriptive summaries that can be easily visualized and understood. In addition, it will introduce students to the fundamental concepts of Statistical Inference, such as parameter estimation and Hypothesis Control, as well as multivariate statistical tools useful in Business Analytics, such as Regression Analysis, Factor Analysis and Analytical Analysis. To implement all of the above, the R language will be used, so that students become familiar with the specific software and can perform any data analysis.
Course Content
Advanced Topis in Probabity Theory (Stochastic Processes, Queuing Theory). Theory of Point Estimation and Statistical Inference. Simple and Multiple Linear Regression. General Linear Models (Logistic Regression). Multivariate Statistical Analysis- Dimension Reduction (Principal Components Analysis, Factor Analysis)-Cluster Analysis (Hierarchical and k-means). Methods of Data Visualization. R Language.