Back to Courses
MATH003 Undergraduate

Probability and Statistics: The Science of Uncertainty

A comprehensive introductory university-level course on the mathematical foundations of probability and statistics. Requiring one year of calculus, the course covers probability models, random variables, expectation, sampling distributions, likelihood and Bayesian inference, and relationships among variables.

4.9
33.0h
736 students
0 likes
Math
Start Learning

Lessons

Lesson

This lesson introduces formal probability models as a rigorous framework to replace subjective intuition, highlighting the relative frequency interpretation and the Law of Large Numbers. Students learn to apply these mathematical structures to quantify uncertainty and manage risk in complex, real-world scenarios where human cognitive biases often fail.

This lesson introduces random variables as deterministic functions that map sample space outcomes to real numbers, providing a quantitative framework for probability. Students learn to utilize indicator functions, understand probability distributions, and apply the continuity of probability to analyze complex events.

This lesson introduces mathematical expectation as the long-run average of a random variable and explores its core properties, including linearity, monotonicity, and independence. Students learn to apply the Law of the Unconscious Statistician (LOTUS) to efficiently calculate the expected values of transformed variables without needing to derive their specific probability distributions.

This lesson introduces sampling distributions as the probability laws governing statistics, which are functions of independent and identically distributed (i.i.d.) random variables. Students learn to derive exact distributions for small sample sizes and explore how these concepts bridge the gap between raw data and statistical inference.

This lesson introduces statistical inference as the formal process of using sample data to estimate the underlying probability distributions and mechanics of a system. It emphasizes that inference is necessary to distinguish between inherent random variation and structural uncertainty, allowing researchers to make robust predictions beyond simple descriptive summaries.

This lesson explores likelihood-based inference, focusing on how the likelihood function quantifies the support for different parameter values given observed data. Students learn to use log-likelihoods for computational efficiency, apply the Central Limit Theorem for asymptotic inference, and utilize Fisher Information to measure the precision of statistical estimates.

This lesson introduces the Bayesian paradigm, which treats unknown parameters as random variables rather than fixed constants to allow for direct probability statements about them. Students learn to construct a complete Bayesian model by combining a sampling model with a prior distribution to update beliefs through the joint distribution.

This lesson explores the mathematical foundations of optimal statistical inference by defining the Mean Squared Error (MSE) as the sum of an estimator's variance and squared bias. Students learn how to minimize this error to identify the best estimators and understand the role of sufficiency and posterior means in decision theory.

This lesson introduces model checking as a critical validation step that ensures statistical inferences are grounded in reality rather than mathematical fiction. Students will learn to distinguish between parameter estimation and model validation, emphasizing that even the most precise calculations are meaningless if the underlying model assumptions do not accurately reflect the data-generating process.

This lesson defines a statistical relationship as any change in the conditional distribution of a response variable $Y$ when a predictor $X$ varies, moving beyond simple correlation to include shifts in mean, variance, or shape. It also emphasizes that establishing causality requires rigorous experimental design, such as blinding and blocking, to account for confounding variables and eliminate bias.

This lesson introduces stochastic processes as systems that evolve over time through probabilistic rather than deterministic rules, with a primary focus on the Simple Random Walk. Students learn to calculate path probabilities and expected values while exploring key concepts like the parity rule, Martingale fairness, and the foundational mechanics of the Gambler’s Ruin model.

Course Overview

📚 Content Summary

A comprehensive introductory university-level course on the mathematical foundations of probability and statistics. Requiring one year of calculus, the course covers probability models, random variables, expectation, sampling distributions, likelihood and Bayesian inference, and relationships among variables.

Master the rigorous mathematical science of uncertainty through calculus-based probability and statistical inference.

Author: Michael J. Evans and Jeffrey S. Rosenthal

Acknowledgments: The authors acknowledge contributions from various reviewers and colleagues at institutions such as the University of Toronto, McMaster University, and Purdue University. Funding and infrastructure support from the University of Toronto are also noted.

🎯 Learning Objectives

  1. Define a formal probability model using sample spaces, events, and probability measures.
  2. Apply combinatorial principles (permutations, subsets, binomial coefficients) to solve uniform probability problems.
  3. Utilize the Law of Total Probability and Bayes' Theorem to analyze multi-stage systems and update beliefs based on new information.
  4. Define and distinguish between discrete and absolutely continuous random variables and their respective probability/density functions.
  5. Identify and apply key probability distributions (Bernoulli, Binomial, Poisson, Normal, etc.) to model real-world phenomena.
  6. Compute marginal densities, conditional distributions, and assess independence for multivariate distributions.
  7. Calculate the expected value, variance, and covariance for discrete, continuous, and mixed random variables.
  8. Apply the Law of the Unconscious Statistician (LOTUS) and linearity properties to compute expectations of transformed variables.
  9. Derive moments using Probability-Generating (PGF) and Moment-Generating Functions (MGF).
  10. Define and derive sampling distributions for functions of i.i.d. sequences.

Lessons