Course Assignments
Homework Labs
-
lab_exam_final: Exam Final (Due: May 14, 2020 at 11:59pm)
Final Exam notebook due Thursday, May 14, 11:59 pm
-
lab_project: Data Science Project (Due: May 12, 2020 at 11:59pm)
Optional data science project -- use methods learned in the class to go further and deeper
-
lab_12: Regularized Linear Regression (Due: May 6, 2020 at 11:59pm)
Use train/test methodology to compare ordinary least squares and regularized least squares regression with many feature variables
-
lab_11: Regularized Logistic Regression (Due: April 29, 2020 at 11:59pm)
Train a model for classifying email messages as spam or non-spam and evaluate using train / test splitting and cross validation
-
lab_10: Logistic Regression Model Selection (Due: April 22, 2020 at 11:59pm)
Train a logit model and evaluate on test data
-
lab_exam_02: Exam 2 (Due: April 14, 2020 at 11:59pm)
Exam 2 notebook due Tuesday, April 14, 11:59 pm
-
lab_09: Odds Ratios and Logistic Regression (Due: April 8, 2020 at 11:59pm)
Study association between categorical variables using odds ratio analysis and logit models
-
lab_08: Multiple Regression and ANOVA (Due: April 1, 2020 at 11:59pm)
Geographic analysis of US melanoma mortality rates and analysis of Iris species differences
-
lab_7: Regression models and inference (Due: March 25, 2020 at 11:59pm)
Linear regression modeling and math for real and simulated data.
-
lab_6: Hypothesis Tests (Due: March 11, 2020 at 11:59pm)
Analyze lead exposure data and birthweight data while exploring connections between confidence intervals and hypothesis tests.
-
lab_5: Sampling Distribution, Standard Error and Confidence Interval (Due: February 26, 2020 at 11:59pm)
Work with samples from uniform and normal distributions, using normal approximations for the sample mean.
-
lab_4: Random variables, parameters and samples (Due: February 19, 2020 at 11:59pm)
This lab covers the normal distribution, Bernoulli distribution, parameters and random samples.
-
lab_3: Sampling, probability and looping (Due: February 12, 2020 at 11:59pm)
This lab covers sampling, probability, for loops, and defining your own function.
-
lab_2: Data Frames (Due: February 5, 2020 at 11:59pm)
In this lab you will learn more about data structure in Python, read external data from csv files, and perform basic data extraction, analytics, and visualization.
-
lab_1: Data Science Setup (Due: January 29, 2020 at 11:59pm)
In this first lab, you will set up your account and computer for Data Science Exploration and begin to work with Python notebooks