Spring 2022 - STAT207
Data Science Exploration


Course Components

This course is comprised of the following four components.

Main Lecture

  • Time: MWF 10:00am-10:50pm CST
  • Location: 106B8 Engineering Hall
  • Instructor: Dr. Tori Ellison
  • Attendance Policy: *not* part of participation grade, but highly encouraged!

Students enrolled in both lab sections will meet together on Tuesdays and Thursdays in the main lecture hall.

During the lectures we will work through the lecture materials for that day. New lecture materials for the given day will be posted on the course schedule (in a zip file) *at least* one hour before class.

The course lectures materials will be deliverd via about 20 units. Each new lecture unit will most likely contain the following files.

Lecture Unit Materials
  • Slides pdf
  • This pdf will focus more on the theoretical content of the class. This pdf contains "skeleton notes" that we will fill out in class. I would suggest downloading (perhaps printing) these pdf's yourself and taking notes in class.

  • csv file (usually)

  • Jupyter notebook file
  • The Jupyter notebook file will introduce new Python functions, parameters, packages etc. We will use it to go through real-world statistical applications of the theoretical content that we talk about in the slides pdf.

    I would suggest opening this Jupyter notebook and going through the code yourself in class. Try changing some of the parameters/values in the code to see what it does!

  • Notebook pdf
  • This is simply just a pdf copy of the Jupyter notebook file. I would suggest downloading (perhaps printing) these pdf's yourself and taking notes in class.

Assigned Lab Sections

These are the lab sections/times that you specifically enrolled in. You should only attend the lab that you enrolled in.

  • Section 1: Th 1:00pm-2:50pm CST
  • Section 2: Th 3:00pm-4:50pm CST
  • Location: 1066 Lincoln Hall
  • Teaching Assistant: Wenzhuo Zhou
  • Attendance Policy: Your attendance at the labs is part of your participation grade.
  • For each lab section that you attend and participate in you will get 5 points towards your participation grade. A perfect participation grade in the class is worth 50 points. (Thus this means that you can miss up to 4 labs penalty free).

    Wenzhuo will take attendance at each lab. Arriving more than 10 minutes late to the lab or leaving more than 10 minutes early from the lab counts as an absence.

    If you have to miss more than 4 labs due to health issues, religious observances, varsity obligations, or other extenuating circumstances you should email Dr. Ellison and we can discuss options at that point.

The purpose of the assigned lab sections is to give you a place to work on the weekly lab assignments where you can ask for help and get quick feedback from the teaching assistant and course assistants assigned to your lab section. Later in the semester, you will use these lab sections to work on your final projects.

Your weekly lab assignments will be comprised of two parts: a.) an individual lab assignment and b.) a group lab assignment. Thus, these assigned labs will give you the space to meet and collaborate with your group in person.

Tip: You should try to work on both the group and individual parts of the lab assignment in your assigned lab. Furthermore, you should try to get as much as you can done in these labs, while you have the immediate feedback of the TA and CAs.

Additional (Optional) Lab

The lab assignments will generally be due Tuesday at 11:59pm CST. So to get additional help answering any last minute questions we will have additional (optional) lab times on Wednesdays 5pm-7pm CST, Foreign Languages Building G13. There will be several course assistants to help answer your questions. You are not required to attend these additional open labs.

These optional labs will be held in-person, however you can also attend online if you'd like (see Canvas for Zoom links.)

Instructor Office Hours

These office hours will be held in-person, however you can also attend online if you'd like (see Canvas for Zoom links.)
Mondays 1:30pm-3:30pm CST
Computer Applications Building 138





STAT207 Instruction Team


Course Related Websites


Learning Outcomes

Overview:Building on the foundation of STAT 107, Data Science Discovery, we use Python, Jupyter notebooks, and GitHub to explore data science techniques, statistical concepts, and data analytic workflows, combined with the statistical analysis of STAT 200. As we explore data science we:

  • Develop understanding of probability models for noisy data and how these translate into uncertainty analysis and statistical inference
  • Understand how modeling assumptions and sampling frames affect our conclusions
  • Become adept with multiple regression modeling, basic machine learning, and inference
  • Become proficient in Python coding for data management, analytics, visualization
  • Understand and use GitHub repositories, the industry standard for submitting code and reports


Preqrequisites

STAT107



Course Topics

See: Course Topics



Course Materials


Course Assignments and Grades

Graded Components

Course grades are computed based on your percentage out of 800 points for the course. The graded components are as follows:

  • Participation (Lab Attendance) - 50 points
  • Two Midterm Exams - 200 pts total
  • Individual Lab Assignment Part - 250 pts
  • Group Lab Assignment Part - 50 pts
  • Project- 50 points
  • Final Exam - 200 pts

Total: 800 pts

Drop the 2 Lowest Assignments Policy:
  • Each lab assignment will be worth a total of 30 points total: 25 points for the individual lab assignment part, 5 points for the group lab assignment part. There will be 12 lab assignments, however we will drop the lowest two assignment grades. Thus a perfect total lab assignment score will amount to 300.
Late Policies:
  • Homework that is late by 5 minutes up to 24 hours will be deducted 30% of the assignment.
  • Homework that is late by more than 24 hours will receive 0 points.
Regrade Policies: You have ONE week to request a grade correction after a homework score is posted. You should clearly present the following information to the head TA (Wenzhuo Zhou, wenzhuo3@illinois.edu):
  • Which lab assignment is involved (e.g. lab assignment #6)
  • A detailed explanation of the suspected error
  • The number of points you feel you should have received for the question.

Final Course Grade

Course points will be translated into a course grade at the end of the semester. The grade thresholds will be based on your percentage score out of 800:

Grade Min Pct Min Pts Grade Min Pct Min Pts Grade Min Pct Min Pts
A+ 97 776 A 93 744 A- 90 720
B+ 87 696 B 83 664 B- 80 640
C+ 77 616 C 73 584 C- 70 560
D+ 67 536 D 63 504 D- 60 480

Participation

For each lab section that you attend and participate in you will get 5 points towards your participation grade. A perfect participation grade in the class is worth 50 points. (Thus this means that you can miss up to 4 labs penalty free).

Attendance at the lectures is highly encouraged, but not part of your grade.

Midterm Exams and Final Exam

Each of these exams will be takehome exams. There will be multiple versions of the exam. On the dates/times listed on the course schedule you will be emailed a personalized copy of your exam. You will then have 24 hours to complete the exam and submit it on Canvas.

You are allowed to use the notes/book. You are NOT allowed to discuss this exam, open someone else's exam, or share/send your exam to anyone else except for Dr. Ellison until after the 24 hour exam period is over.

Lab Assignments

Individual Lab Assignment

The individual part of the lab assignment is to be completed by each student individually. See the assignments tab for how to download and submit the individual lab assignment to your Github repository for each week.

Group Lab Assignment

The group part of the lab assignment is to be completed in groups of 2-3 from students in your assigned lab section. If you are unable to attend your lab section that week and all of the students in your lab section have already started working on the group lab assignment, then you can complete the group part on your own. See the assignments tab for how to download and submit the individual lab assignment to your Github repository for each week.

The group lab assignments will be more flexible in the questions that we are asking and will be more project-based.

Project

This project will be completed by the end of the semester. This will involve a final report (in a Jupyter notebook file) and giving a presentation during your lab sections. More information on this later! You can work in groups of 2-3 or your can complete it by yourself.

Learning Collaboratively

Working Together

We encourage you to discuss all of your course activities (with the exception of exams) with your friends and classmates! You will learn more though talking through the problems, teaching others, and sharing ideas.

Continue to read on “Academic Integrity” to understand the difference between collaboration and giving an answer away.

Academic Integrity

Collaboration is about working together. Collaboration is not giving the direct answer to a friend or sharing the source code to an assignment. Collaboration requires you to make a serious attempt at every assignment and discuss your ideas and doubts with others so everyone gets more out of the discussion Your answers must be your own words and your code must be typed (not copied/pasted) by you.

Academic dishonesty is taken very seriously in STAT 207 and all cases will be brought to the University, your college, and your department. You should understand how academic integrity applies specifically to STAT 207: the sanctions for cheating on an assignment includes a loss of all points for the assignment and that the final course grade is lowered by one whole letter grade (70 points). A second incident, or cheating on an exam, results in an automatic F in the course.

Academic integrity includes protecting your work. If you work ends up submitted by someone else, we have considered this a violation of academic integrity just as though you submitted someone else’s work.



Email Note
Given that this course is completely online, please check your email regularly for important class communications.



COVID-19 Note
Given the uncertain nature of how this semester might unfold due to COVID-19. The instructor reserves the right to make any changes she considers academically advisable. Such changes, if any, will be announced in class. Please note that it is your responsibility to attend the class and keep track of the proceedings. However, we will try to adhere to this syllabus and course schedule as much as possible, and will send an email informing you of any changes.