Skip to content

Commit 0792674

Browse files
authored
Merge pull request #223 from cmckee786/patching
fixes/chores: links, docs, styling, formatting + implemented proposal
2 parents fb6f6d1 + 1aa9dc0 commit 0792674

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

67 files changed

+1002
-1475
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
# The Professional Linux Users Group (ProLUG) Systems Administration Course Book
22

3+
Current deployment: <https://professionallinuxusersgroup.github.io/lac/>
4+
35
## Goals
46

57
The overarching goal of this project is to provide a free, open source Systems Administration course

book.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,5 +10,5 @@ default-theme = "navy"
1010
preferred-dark-theme = "navy"
1111
no-section-label = true
1212
git-repository-url = "https://github.com/ProfessionalLinuxUsersGroup/lac"
13-
edit-url-template = "https://github.com/ProfessionalLinuxUsersGroup/lac/edit/main/{path}"
13+
edit-url-template = "https://github.com/ProfessionalLinuxUsersGroup/lac/issues"
1414
git-repository-icon = "fa-github"

src/SUMMARY.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,9 @@
11
# Summary
22

3-
- [Intro](./intro.md)
43
- [About ProLUG](./prolug.md)
5-
- [Final Project Outline](./project.md)
4+
- [Table of Contents](./unitindex.md)
5+
- [Course Syallabus](./syllabus.md)
6+
- [Final Project Outline](./project.md)
67
- [Unit 1](./u1intro.md)
78
- [Work Sheet](./u1ws.md)
89
- [Lab](./u1lab.md)

src/assets/downloads/u1/u1_lab.txt

Lines changed: 0 additions & 54 deletions
This file was deleted.
-14.2 KB
Binary file not shown.

src/assets/downloads/u1/u1_worksheet.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,8 @@ ProLUG 101
22
Unit 1 Worksheet
33

44
Instructions
5-
Fill out this sheet as you progress through the lab and discussions. Turn in the sheet to [email protected] for grading and feedback.
5+
Fill out this sheet as you progress through the lab and discussions. Hold onto all of your work to send to me at the end of the course.
6+
67
Discussion Questions:
78
Unit 1 Discussion Post 1: Using a 0-10 system, rate yourself on how well you think you know each topic in the image. (You do not have to post this rating). Answer these questions (here):
89
1. What do you hope to learn in this course?
Lines changed: 54 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,54 @@
1+
ProLUG 101
2+
Unit 10 Worksheet
3+
4+
Instructions
5+
Fill out this sheet as you progress through the lab and discussions. Hold onto all of your work to send to me at the end of the course.
6+
7+
Discussion Questions:
8+
Unit 10 Discussion Post 1: Read this document: https://kubernetes.io/docs/concepts/overview/
9+
10+
1.What are the two most compelling reasons you see to implement Kubernetes in your organization?
11+
2.When the article says Kubernetes is not a PaaS? What do they mean by that? What is a PaaS in comparison?
12+
13+
Unit 10 Discussion Post 2: You get a ticket about your new test cluster. The team is unable to deploy some of their applications. They suspect there is a problem and send you over this output:
14+
15+
[root@Test_Cluster1 ~]# kubectl version
16+
Client Version: v1.31.6+k3s3
17+
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
18+
Server Version: v1.30.6+k3s1
19+
[root@rocky15 ~]# kubectl get nodes
20+
NAME STATUS ROLES AGE VERSION
21+
Test_Cluster1 Ready control-plane,master 17h v1.30.6+k3s1
22+
Test_Cluster2 NotReady worker 33m v1.29.6+k3s1
23+
Test_Cluster3 Ready worker 17h v1.28.6+k3s1
24+
25+
1.What are you checking on the cluster to validate you see their error?
26+
2.What do you think the problem could be?
27+
3.Do you think someone else has tried anything to fix this problem before you? Why or why not?
28+
29+
Unit 10 Discussion Post 3: You are the network operations center (NOC) lead. Your team has recently started supporting the dev, test, and QA environments for your company’s K8s cluster. Write up a basic checkout procedure for your new NOC personnel to verify operation of the cluster before escalating on critical alerts.
30+
31+
1.What information online helped you figure this out? What blogs or tools did you use?
32+
2.What did you learn in this process of writing this up?
33+
34+
Definitions/Terminology
35+
36+
Kubernetes/K8s
37+
K3s
38+
Controller Manager
39+
ETCD
40+
Kubelet
41+
Kube-proxy
42+
Controlplane
43+
Node
44+
Static Pod
45+
Scheduler
46+
API Server
47+
48+
Notes During Lecture/Class:
49+
50+
Links:
51+
52+
Terms:
53+
54+
Useful tools:
Lines changed: 65 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,65 @@
1+
ProLUG 101
2+
Unit 11 Worksheet
3+
4+
Instructions
5+
Fill out this sheet as you progress through the lab and discussions. Hold onto all of your work to send to me at the end of the course.
6+
7+
Discussion Questions:
8+
9+
Unit 11 Discussion Post 1: You’ve heard the term “loose coupling” thrown around the office about a new monitoring solution coming down the pike. You find a good resource and read the section on “Prefer Loose Coupling” https://sre.google/workbook/monitoring/
10+
11+
1.What does “loose coupling” mean, if you had to summarize to your junior team members?
12+
2.What is the advantage given for why you might want to implement this type of tooling in your monitoring? Do you agree? Why or why not?
13+
3.They mention “exposing metrics” what does it mean to expose metrics? What happens to metrics that are exposed but never collected?
14+
15+
Unit 11 Discussion Post 2: Your HPC team is asking for more information about how CPU0 is behaving on a set of servers. Your team has node exporter writing data out to Prometheus (Use this to simulate https://promlabs.com/promql-cheat-sheet/).
16+
17+
1.Can you see the usage of CPU0 and what is the query?
18+
2.Can you see the usage of CPU0 for just the last 5 minutes and what is the query?
19+
3.You know that CPU0 is excluded from Slurm, can you exclude that and only pull the user and system for the remaining CPUs and what is that query?
20+
21+
Definitions/Terminology
22+
23+
SLO
24+
SLA
25+
SLI
26+
KPI
27+
Span
28+
Trace
29+
Prometheus
30+
Node_Exporter
31+
Grafana
32+
Dashboard
33+
Heads up Display
34+
35+
Notes During Lecture/Class:
36+
37+
Links:
38+
39+
Terms:
40+
41+
Useful tools:
42+
43+
Lab and Assignment
44+
45+
Unit 11 Lab Monitoring
46+
47+
Continue working on your project from the Project Guide
48+
Topics:
49+
1.System Stability
50+
2.System Performance
51+
3.System Security
52+
4.System monitoring
53+
5.Kubernetes
54+
6.Programming/Automation
55+
56+
You will research, design, deploy, and document a system that improves your administration of Linux systems in some way.
57+
58+
Digging Deeper (optional)
59+
1.Read the rest of the chapter https://sre.google/workbook/monitoring/ and note anything else of interest when it comes to monitoring and dashboarding.
60+
2.Look up the “ProLUG Prometheus Certified Associate Prep 2024” in Resources -> Presentations in our ProLUG Discord. Study that for a deep dive into Prometheus.
61+
3.Complete the project section of “Monitoring Deep Dive Project Guide” from the prolug-projects section of the Discord. We have a Youtube video on that project as well. https://www.youtube.com/watch?v=54VgGHr99Qg
62+
63+
Reflection Questions
64+
1.What questions do you still have about this week?
65+
2.How can you apply this now in your current role in IT? If you’re not in IT, how can you look to put something like this into your resume or portfolio?
Lines changed: 101 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,101 @@
1+
ProLUG 101
2+
Unit 12 Worksheet
3+
4+
Instructions
5+
Fill out this sheet as you progress through the lab and discussions. Hold onto all of your work to send to me at the end of the course.
6+
7+
Discussion Questions:
8+
9+
Unit 12 Discussion Post 1: Your manager has come to you with another emergency. He has a meeting next week to discuss capacity planning and usage of the system with IT upper management. He doesn’t want to lose his budget, but he has to prove that the system utilization warrants spending more.
10+
11+
1.What information can you show your manager from your systems?
12+
2.What type of data would prove system utilization? (Remember the big 4: compute, memory, disk, networking)
13+
3.What would your report look like to your manager?
14+
15+
Unit 12 Discussion Post 2: You are in a capacity planning meeting with a few of the architects. They have decided to add 2 more agents to your Linus sytems, Bacula Agent and an Avamar Agent . They expect these agents to run their work starting at 0400 every morning.
16+
7 day view
17+
24 hour view
18+
19+
1.What do these agents do? (May have to look them up)
20+
2.Do you think there is a good reason not to use these agents at this timeframe?
21+
3.Is there anything else you might want to point out to these architects about these agents they are installing?
22+
23+
Unit 12 Discussion Post 3: Your team has recently tested at proof of concept of a new storage system. The vendor has published the blazing fast speeds that are capable of being run through this storage system. You have a set of systems connected to both the old storage system and the new storage system.
24+
25+
1.Write up a test procedure of how you may test these two systems.
26+
2.How are you assuring these test are objective?
27+
a. What is meant by the term Ceteris Paribus, in this context?
28+
29+
Definitions/Terminology
30+
31+
Baseline
32+
Benchmark
33+
High watermark
34+
Scope
35+
Methodology
36+
Testing
37+
Control
38+
Experiment
39+
Analytics
40+
- Descriptive
41+
- Diagnostic
42+
- Predictive
43+
- Prescriptive
44+
45+
Notes During Lecture/Class:
46+
47+
Links:
48+
49+
Terms:
50+
51+
Useful tools:
52+
Spyder ide
53+
54+
Lab and Assignment
55+
Unit 12 Lab Baselining and Benchmarking
56+
57+
Continue working on your project from the Project Guide
58+
59+
Topics:
60+
1.System Stability
61+
2.System Performance
62+
3.System Security
63+
4.System monitoring
64+
5.Kubernetes
65+
6.Programming/Automation
66+
67+
You will research, design, deploy, and document a system that improves your administration of Linux systems in some way.
68+
69+
Digging Deeper (optional)
70+
71+
1.Analyzing data may open up a new field of interest to you. Go through some of the free lessons on Kaggle, here: https://www.kaggle.com/learn
72+
a. What did you learn?
73+
b. How will you apply these lessons to data and monitoring you have already collected as a system administrator?
74+
2.Find a blog or article that discusses the 4 types of data analytics.
75+
a. What did you learn about past operations?
76+
b. What did you learn about predictive operations?
77+
3.Download Spyder IDE (Open source)
78+
a. Find a blog post or otherwise try to evaluate some data.
79+
b. Perform some Linear regression.
80+
81+
My block of code (but this requires some additional libraries to be added. I can help with that if you need it.)
82+
83+
import matplotlib.pyplot as plt
84+
from sklearn.linear_model import LinearRegression
85+
size = [[5.0], [5.5], [5.9], [6.3], [6.9], [7.5]]
86+
price =[[165], [200], [223], [250], [278], [315]]
87+
plt.title('Pizza Price plotted against the size')
88+
plt.xlabel('Pizza Size in inches')
89+
plt.ylabel('Pizza Price in cents')
90+
plt.plot(size, price, 'k.')
91+
plt.axis([5.0, 9.0, 99, 355])
92+
plt.grid(True)
93+
model = LinearRegression()
94+
model.fit(X = size, y = price)
95+
#plot the regression line
96+
plt.plot(size, model.predict(size), color='r')
97+
98+
Reflection Questions
99+
100+
1.What questions do you still have about this week?
101+
2.How can you apply this now in your current role in IT? If you’re not in IT, how can you look to put something like this into your resume or portfolio?
Lines changed: 68 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,68 @@
1+
ProLUG 101
2+
Unit 13 Worksheet
3+
4+
Instructions
5+
Fill out this sheet as you progress through the lab and discussions. Hold onto all of your work to send to me at the end of the course.
6+
7+
Discussion Questions:
8+
9+
Unit 13 Discussion Post 1: Your security team comes to you with a discrepancy between the production security baseline and something that is running on one of your servers in production. There are 5 servers in a web cluster and only one of them is showing this behavior. They want you to account for why something is different.
10+
11+
1.How are you going to validate that the difference between the systems?
12+
2.What are you going to look at to explain this?
13+
3.What could be done to prevent this problem in the future?
14+
15+
Unit 13 Discussion Post 2: Your team has been giving you more and more engineering responsibilities. You are being asked to build out the next set of servers to integrate into the development environment. Your team is going from RHEL 8 to Rocky 9.4.
16+
17+
1.How might you start to plan out your migration?
18+
2.What are you going to check on the existing systems to baseline your build?
19+
3.What kind of validation plan might you use for your new Rocky 9.4 systems?
20+
21+
Definitions/Terminology
22+
23+
Hardening
24+
Pipeline
25+
Change management (IT)
26+
Security Standard
27+
Security Posture
28+
Acceptable Risk
29+
NIST 800-53
30+
STIG
31+
CIS Benchmark
32+
OpenSCAP
33+
SCC Tool
34+
HIDS
35+
HIPS
36+
37+
Notes During Lecture/Class:
38+
39+
Links:
40+
41+
Terms:
42+
43+
Useful tools:
44+
45+
Lab and Assignment
46+
Unit 13 Lab System Hardening
47+
48+
Continue working on your project from the Project Guide
49+
50+
Topics:
51+
1.System Stability
52+
2.System Performance
53+
3.System Security
54+
4.System monitoring
55+
5.Kubernetes
56+
6.Programming/Automation
57+
58+
You will research, design, deploy, and document a system that improves your administration of Linux systems in some way.
59+
60+
Digging Deeper (optional)
61+
1.Run through this lab: https://killercoda.com/het-tanis/course/Linux-Labs/107-server-startup-process
62+
a. How does this help you better understand the discussion 13-2 question?
63+
2.Run through this lab: https://killercoda.com/het-tanis/course/Linux-Labs/203-updating-golden-image
64+
a. How does this help you better understand the process of hardening systems?
65+
66+
Reflection Questions
67+
1.What questions do you still have about this week?
68+
2.How can you apply this now in your current role in IT? If you’re not in IT, how can you look to put something like this into your resume or portfolio?

0 commit comments

Comments
 (0)