Course materials for ISE:4172 Big Data Analytics (Prof. Stephen Baek; University of Iowa).
Read getting_started.md to configure your system for the course materials.
See ICON. (Iowa students only)
| Index | Description | Materials |
|---|---|---|
| Lecture 1 Lab 1 |
Introduction Course introduction What is big data? Python & Numpy basics |
slides lab getting started |
| Lecture 2 Lab 2 |
Read & Represent Data Data types & formats Pandas basics |
slides lab |
| Lecture 3 Lab 3 |
Data Mining Public datasets Web crawling Application Programming Interfaces |
slides lab |
| Lecture 4 Lab 4 |
Data Preprocessing and Visualization Data cleaning Sampling Feature extraction (preview) Dimension reduction (preview) |
slides lab |
| No Class |
||
| Lecture 5 Lab 5 |
Supervised Learning Hypothesis Linear regression Complexity |
slides #1 slides #2 lab |
| Lecture 6 Lab 6 |
Distance and Similarity K-nearest neighbors Distance functions Nystrom approximation |
slides #1 slides #2 slides #3 lab Nystrom |
| Lecture 7 Lab 7 |
Cluster Analysis. |
slides #1 slides #2 lab |
| Final Project Assigned | ||
| No Class |
||
| No Class |
||
| Lecture 8 Lab 8 |
Neural Networks Introduction to neural networks Backpropagation Deep neural networks |
slides #1 slides #2 lab |
| Lecture 9 Lab 9 |
Dimensionality Reduction Principal Component Analysis Multi-dimensional Scaling Isomap & LLE t-SNE Self Organizing Maps |
slides #1 slides #2 |
| Lecture 10 Lab 10 |
Model Selection |
slides |
| Thanksgiving Recess |
||
| Thanksgiving Recess |
||
| Final Project Presentation |
||
| Final Project Presentation |