Statistical Foundations of Data Science

Subject associations
ORF 525
Term
Spring 2025
Instructors
Registrar description

A theoretical introduction to statistical machine learning for data science. It covers multiple regression, kernel learning, sparse regression, high dimensional statistics, sure independent screening, generalized linear models, covariance learning, factor models, principal component analysis, supervised and unsupervised learning, deep learning, and related topics such as community detection, item ranking, and matrix completion.These methods are illustrated using real world data sets and manipulation of the statistical software R.