Statistics for Data Science (Wintersemester 2024/25)
News
- The fifth exercise sheet has been published.
Dates
Lectures | Mon 10:15-11:45 | A6/032 | Dr. Vesa Kaarnioja |
Exercises | Tue 10:15-11:45 | A7/031 | Dr. Vesa Kaarnioja |
Course exam | Mon February 10, 2025 10:00-12:00 |
A6/032 | |
Make-up exam | Mon February 24, 2025 10:00-12:00 |
A6/032 |
General Information
Description
This course serves as an introduction to foundational aspects of modern statistical data analysis. Frequentist and Bayesian inference are presented from the perspective of probabilistic modeling. The course will consist of three main parts:
- Probability foundations: probability spaces, random variables, distribution of a random variable, expectation and covariance, important limit theorems and inequalities.
- Frequentist inference: point estimators, confidence intervals, hypothesis testing.
- Bayesian inference: conjugate inference, numerical models, data assimilation.
Prerequisites
Basic set theory (inclusion, union, intersection, difference of sets), basic analysis (infinite series, calculus), matrix algebra, some knowledge of probabilistic foundations (discrete probability, Gaussian distributions) is helpful.
Completing the course
The conditions for completing this course are (1) successfully completing at least 60% of the course's exercises, and (2) successfully passing the course exam.
Registration
Please register to the course via Campus Management (CM), then you will be automatically registered in MyCampus/Whiteboard as well. Please note the deadlines indicated there. For further information and in case of any problems, please consult the Campus Management's Help for Students.
Lecture notes
Lecture notes will be published here after each week's lecture.
- Week 1: Introduction, probability space, conditional probability, independence of events
- Week 2: Random variables
- Week 3: Joint distributions, independent random variables, conditional distribution, transformations of random variables
- Week 4: Expected value and covariance
- Week 5: Inequalities and limits
- Week 6: Introduction to statistical inference, descriptive statistics, confidence interval
Exercise sheets
Weekly exercises will be published here after each lecture.
- Assignment Submission Guidelines
- Exercise 1 (model solutions, w1e2.py)
- Exercise 2 (model solutions, w2e1.py, w2e2.py, w2e3.py)
- Exercise 3 (model solutions)
- Exercise 4 (model solutions, w4e2.py, w4e3.py, note on least squares regression)
- Exercise 5 (files: mtcars.txt, mtcars.xlsx, HW.txt, FT.txt, BP2.txt, due by: November 26, 10:15 am)
Contact
Dr. Vesa Kaarnioja | vesa.kaarnioja@fu-berlin.de | Arnimallee 6, room 212 Consulting hours: By appointment |
Literature
- Larry Wasserman. All of Statistics: A Concise Course in Statistical Inference. Springer Science & Business Media, 2004.
- Morris H. DeGroot and Mark J. Schervish. Probability and Statistics. 4th edition, Pearson Education, 2013.
- José M. Bernardo and Adrian F. M. Smith. Bayesian Theory. 2nd edition, Wiley, 2007.
- Leonhard Held and Daniel Sabanés Bové. Applied Statistical Inference: Likelihood and Bayes. Springer Science & Business Media, 2013.
- Sebastian Reich and Colin Cotter. Probabilistic Forecasting and Bayesian Data Assimilation. Cambridge University Press, 2015.