How to install, maintain, and use the data scientific software "stack". The Unix operating system, integrated development environments, and problem solving strategies.
By the end of the course, students are expected to:
- Use the Unix command line to navigate their computer's filesystem.
- Define and distinguish between absolute file paths and relative file paths.
- Effectively use local and remote version control software (e.g., Git and GitHub) to organize projects and manage file versions.
- Create, edit and run reproducible literate Python and R code documents (e.g., reports and presentations) using Jupyter and RMarkdown.
- Write and edit Markdown and in-line LaTeX syntax within literate code documents.
- Define and correctly use a project working directory.
- Diagnose and troubleshoot programming and development environment problems, and explain how such problems can be avoided.
This is an assignment-based course. You'll be evaluated as follows:
Assessment | Weight | Due Date | Location |
---|---|---|---|
iClicker | 2% | iClicker and Canvas | |
Policies | 1% | 2024-09-04 23:59 PT | Canvas |
Setup | 1% | 2024-09-04 23:59 PT | Canvas |
Lab 0a | 1% | 2024-09-05 18:00 PT | Gradescope |
Lab 0b | 1% | 2024-09-05 18:00 PT | Gradescope |
Lab 1 | 11% | 2024-09-07 18:00 PT | Github & Gradescope |
Lab 2 | 11% | 2024-09-14 18:00 PT | Github & Gradescope |
Quiz 1 | 25% | Week of 2024-09-17 | ICICS Testing Center |
Lab 3 | 11% | 2024-09-21 18:00 PT | Github & Gradescope |
Lab 4 | 11% | 2024-09-28 18:00 PT | Github & Gradescope |
Quiz 2 | 25% | Week of 2024-10-01 | ICICS Testing Center |
Tip: Use the lecture learning objectives as beacons when studying for your quizzes!
Follow the installation guide for the UBC Master of Data Science Software Stack here: https://ubc-mds.github.io/resources_pages/installation_instructions/
See the course lecture book.
Materials were inspired, re-used and re-mixed from the following sources:
- Software Carpentry, specifically the Unix Shell and Git lessons
- Happy Git and GitHub for the useR by Jenny Bryan and the STAT 545 TAs
- Data 8: The Foundations of Data Science, specifically Lab 01
- Data Carpentry Reproducible Science Workshop
The UBC Master of Data Science DSCI 521: Computing Platforms for Data Science course materials here are licensed under the Creative Commons Attribution 2.5 Canada License (CC BY 2.5 CA). If re-using/re-mixing please provide attribution and link to this webpage.