Spring 2021 - STAT207
Data Science Exploration


Watching Course Lectures
  • Synchronous Option : Visit our Compass page for Zoom links for the synchronous lectures held on
    TuTh 3:30pm-4:50pm CST.
  • Asynchronous Option : Videos of the synchronous lecture will be recorded and posted shortly after the lecture on our Compass page.
Lecture Materials
  • The links for the each new day's lecture will be available for printing/viewing by 8am CST (see schedule below).
Schedule

  • COVID-19 NOTE: Due to potential unforeseen circumstances regarding COVID-19, this schedule may be subject to change. Please check back regularly. I will send email notifications if something on schedule has to change.


On the day of the lecture:
Date Topics Materials Assignments
Tuesday, January 26, 2021 Introduction to STAT207, Github, Python, and the Full Data Science Pipeline
Pdfs
Class Introduction
01_Intro_to_Python_and_Github


Other materials (The lecture Jupyter notebooks and data files will always be in the _classnotes repository.)

Wednesday, January 27, 2021


Thursday, January 28, 2021 Introduction to the Data Science Pipeline 02_Intro_to_the_Data_Science_Pipeline

Tuesday, February 2, 2021 Introduction to Dataframe Manipulation and Data Cleaning (Notes below were originally posted on Thursday, but we're just now getting to this material.)
03_Intro_to_the_Dataframe_Manipulation_and_Data_Cleaning

Wednesday, February 3, 2021
Lab 1 due on Github 11:59pm CST.
Thursday, February 4, 2021 Descriptive Analytics for Numerical Variables 04_Descriptive_Analytics_for_Numerical_Variables
04_Descriptive_Analytics_for_Numerical_Variables_Notebook


Tuesday, February 9, 2021 Probability and Random Sampling 05_Probability_and_Random_Sampling
05_Probability_and_Random_Sampling_Notebook


Wednesday, February 10, 2021

Lab 2 due on Github 11:59pm CST.
Thursday, February 11, 2021 More Probability and Random Sampling

Tuesday, February 16, 2021 Creating a Sampling Distribution - Building Blocks for Inference
06_Creating_Sampling_Distributions_slides
06_Creating_Sampling_Distributions_notebook

Wednesday, February 17, 2021

Thursday, February 18, 2021 Introduction to Random Variables 07_Random_Variables_slides
07_Random_Variables_notebook

EXTENSION: Lab 3 due on Github 11:59pm CST.
Tuesday, February 23, 2021 Common Types of Random Variables and How to Use Them 08_Common_Random_Variables_and_How_to_Use_slides
08_Common_Random_Variables_and_How_to_Use_notebook


Wednesday, February 24, 2021

Lab 4 due on Github 11:59pm CST.
Thursday, February 25, 2021 Lecture

Tuesday, March 2, 2021 Midterm 1 Review

Wednesday, March 3, 2021


Thursday, March 4, 2021 Midterm 1 Takehome (No class)
 Midterm 1 Takehome Emailed to Students at 3:30pm CST


Friday, March 5, 2021 Midterm 1 Takehome Due on Compass at 3:30pm CST

Tuesday, March 9, 2021 Finishing Unit 8
Starting Unit 9: Central Limit Theorem and Confidence Intervals
09_Central_Limit_Theorem_and_Confidence_Intervals_slides
09_Central_Limit_Theorem_and_Confidence_Intervals_notebook

Wednesday, March 10, 2021

Lab 5 due on Github 11:59pm CST.
Thursday, March 11, 2021 Finishing Unit 9
Unit 10: Hypothesis Testing
10_Hypothesis_Testing_slides
10_Hypothesis_Testing_notebook

Tuesday, March 16, 2021 Working on Unit 10

Wednesday, March 17, 2021

Lab 6 due on Github 11:59pm CST.
Thursday, March 18, 2021 Inference for Associations (starting Unit 11)
11_Inference_for_Associations_slides
11_Inference_for_Associations_notebook

Tuesday, March 23, 2021 Working on Unit 11

Wednesday, March 24, 2021

Thursday, March 25, 2021 Finishing Unit 11
Starting Unit 12: Simple Linear Regression

12_simple_linear_regression_slides
12_simple_linear_regression_notebook

Lab 7 due on Github 11:59pm CST.
Tuesday, March 30, 2021 Finishing Unit 12
Starting Unit 13: Multiple Linear Regression

13_mutiple_linear_regression_slides
13_mutiple_linear_regression_notebook

Wednesday, March 31, 2021

Lab 8 due on Github 11:59pm CST.
Thursday, April 1, 2021 Working on Unit 13

Tuesday, April 6, 2021 Finishing Unit 13
Unit 14: ANOVA
Unit 15: Logistic Regression (Part 1)

14_ANOVA_slides
14_ANOVA_notebook
15_logistic_regression_part1_slides
15_logistic_regression_part1_notebook
Typo in old Unit 15 notebook. Please use the ones you see here and now on Github

Wednesday, April 7, 2021

Lab 9 due on Github 11:59pm CST.
Thursday, April 8, 2021 Finishing Unit 15
Unit 16: Logistic Regression - Part 2
Unit 17: Classification and ROC

16_logistic_regression_part2_slides
16_logistic_regression_part2_notebook
17_Classification_and_ROC_slides
17_Classification_and_ROC_notebook

Tuesday, April 13, 2021 BREAK - NO CLASSES

Wednesday, April 14, 2021

Thursday, April 15, 2021 Midterm 2 Review
Lab 10 due on Github 11:59pm CST.
Tuesday, April 20, 2021 Midterm 2 Takehome (No class)
 Midterm 2 Takehome Emailed to Students at 3:30pm CST


Wednesday, April 21, 2021 Midterm 2 Takehome Due on Compass at 3:30pm CST

Thursday, April 22, 2021 Finishing Unit 16 and 17
Starting Unit 18: Training Data vs. Test Data

18_Train_Test_slides
18_Train_Test_notebook

Tuesday, April 27, 2021 Unit 18 and Unit 19
19_Variable_Selection_for_Logistic_Regression_slides
19_Variable_Selection_for_Logistic_Regression_notebook
Wednesday, April 28, 2021


Lab 11 due on Github 11:59pm CST.
Thursday, April 29, 2021 Unit 19 and Unit 20
20_More_Variable_Selection_Methods_slides
20_More_Variable_Selection_Methods_notebook

Tuesday, May 4, 2021 Last Lecture: Final Exam Review

Wednesday, May 5, 2021

Lab 12 due on Github 11:59pm CST.
Thursday, May 6, 2021 Reading Day

Friday, May 7, 2021 Takehome Final Exam Emailed at 8am CST

Saturday, May 8, 2021 Takehome Final Exam Due on Compass at 8am CST

Friday, May 14, 2021 Optional Project due at 8pm CST