Link

Introduction to Probability and Statistics in Biology and Public Health

PH 142, Fall 2021

Corinne Riddell (she/her/hers)

Corinne Riddell (she/her/hers)

c.riddell@berkeley.edu

Office Hours: Wednesday 1-2pm

Location TBD: Considering in person either outside or inside, on zoom, or a combo of these

We will not be updating this page with announcements. For the latest announcements, make sure to check our Piazza.

Note: Course material, including all video and assigned content (e.g. labs, homework, and other shared materials), is copyrighted and reposting to third party sites or any other form of redistribution is prohibited.

Important Information

Goals

  • Build strong foundations in statistics and introduce students to programming to prepare students for more advanced courses involving data analysis.
  • Foster a mindset of statistical thinking to give students the tools to understand experiments and research studies and evaluate their validity.
  • Empower students to address real-world problems with technical prowess.

Course Description

This course is an introduction to statistics and data science, primarily for MPH and undergraduate public health majors, and others interested in public health topics. The course can be divided into three parts:

  • Part I: We will be focusing on learning to explore and summarize univariate and bivariate distributions. We will be using the R statistical software to do this, specifically, we will use the dplyr and ggplot2 packages.
  • Part II: We will be learning about classical problems in probability and the some commonly used probability distributions: the Normal, Binomial, and Poisson distributions. In Part II we will also make the link between theoretical distributions and sampled distributions and cover a key concept called the Central Limit Theorem.
  • Part III: We introduce statistical inference, the process of estimating statistics from samples to make inference about populations.

Throughout the course, we will follow the PPDAC model, which stands for “Problem, Plan, Data, Analysis, and Conclusion”.