Spring 2022 - STAT207
Data Science Exploration


Lab Assignments

  • lab_01: Data Science Setup (Due: Wednesday January 26, 11:59pm CST)
    In this first lab, you will set up your account and computer for Data Science Exploration and begin to work with Python notebooks

  • lab_02: Dataframes (Due: Wednesday February 2, 2022 at 11:59pm CST)
    In this second lab, we will practice the dataframe manipulation.

  • lab_03: Data Cleaning and Descriptive Analytics for Numerical Variables (Due: Feburary 9, 2022 at 11:59pm CST)
    In this third lab, we will clean a dataset and describe numerical variables.

  • lab_04: Probability (Due: February 16, 2022 at 11:59pm CST)
    In this fourth lab, we will calculate the theoretical and approximated probabilities of single and compound types of events.

  • lab_5: Sampling Distributions and Random Variables (Due: February 23, 2022 at 11:59pm CST)
    In this fifth lab, we will build sampling distributions and explore random variables.

  • lab_6: Random Variables (Due: March 2, 2022 at 11:59pm CST)
    In this sixth lab, we will explore the theoretical properties of sampling distributions and more random variables.

  • lab_7: Confidence Intervals (Due: March 9, 2022 at 11:59pm CST)
    In this seventh lab, we explore confidence intervals for a population mean and a population proportion.

  • lab_8: Hypothesis Testing (Due: March 23, 2022 at 11:59pm CST)
    In this eigth lab, we will conduct hypothesis testing.

  • lab_9: Inference for Associations (Due: March 30, 2022 at 11:59pm CST)
    In this ninth lab, we will conduct inference on the difference between two population means and the difference between two population proportions.

  • lab_10: Linear Regression (Due: April 6, 2022 at 11:59pm CST)
    In this tenth lab, we will fit simple and multiple linear regression models, assess the fit of the models, use them to make predictions, and conduct inference on the corresponding population slopes.

  • lab_11: Linear Transformations and ANOVA (Due: April 13, 2022 at 11:59pm CST)
    In this eleventh lab conduct linear transformations on some of our variables in an attempt to meet our conditions for inference. We will also conduct ANOVA.

  • 🎉 🥳lab_12: Logistic Regression 🎉 🥳 (Due: April 20, 2022 at 11:59pm CST)
    In this final lab we will fit logistic regression model, conduct inference on the slopes, and use them to make predictions.


  • project: Instructions and Materials for the Final Project (Due: May 13, 2022 at 1:30pm CST)
    The materials in this folder give information about how to complete and submit the final project. Please read over the instructions in the downloaded pdfs thoroughly before beginning the project. Don't wait until the last minute to start working on this project!