STAT 5600/6600 – Probability and Statistics for Data Science

Undergraduate/graduate course introducing probability and statistical inference with a strong emphasis on data science applications, computation in R (and Python), and modern inferential tools.


Syllabus

Lecture Slides

Lecture Notes

When both typed and handwritten notes are available, they offer complementary perspectives on the same material.

Homework Assignments

Homework assignments emphasize conceptual understanding, analytical reasoning, and computation in R. Selected solution files and source materials are available upon request.

Sample Exams & Practice Problems

These materials are provided for practice and illustration purposes only.

Code Examples (R & Python)

The code repositories include illustrative scripts and notebooks used throughout the course, covering probability simulations, Monte Carlo methods, resampling, Markov chains, regression, and related computational topics.

The code provided here was developed for instructional purposes and reflects working implementations used in this course. While these scripts ran correctly in the instructor’s environment, no guarantee is made regarding correctness, efficiency, or suitability for other settings. Users are encouraged to adapt and validate the code for their own purposes. If you notice any errors, inconsistencies, or opportunities for improvement, I would greatly appreciate being contacted so that corrections can be made.

Datasets

Additional Materials

These notes are provided as optional background references to support probability and statistics concepts used in the course.


Materials are provided for educational use. While care has been taken in their preparation, errors or omissions may occur. If you notice any errors, inconsistencies, or ambiguities in the lecture notes, homework problems, or sample exams, I would appreciate being contacted so that corrections can be made. Please contact me before reuse or adaptation.