homework_1 #13

alexanderquispe · 2025-03-26T21:36:51Z

📚 Data Science Homework 1 — Web Scraping

🔧 Instructions

Dear all,

Please follow the instructions below to complete Homework 1:

1. Git Branch and Folder Structure

Each student must create a new branch named:
```
[UPID]_hw1_2025_1
```
Example: 123456_hw1_2025_1
Using this branch, create a folder named exactly as your branch in the folder hw1:
```
123456_hw1_2025_1/
```
Inside your folder, include the following:
- requirements.txt
- Your scraping code (your jupyter notebook). The format name should be 123456_hw1_2025_1.ipynb
- Your resulting CSV file
Save everything under the main homework1/ directory in the repo.

2. Task Description

Scrape all Data Science job offers from the Bumeran platform that match the following filters(using code not by hand!):

3. Suggested Scraping Strategy (Two Stages)

✅ Stage 1: Extract Job Posting Links

Scrape all the job listing URLs based on the filters above.
Navigate across all pages if necessary.

✅ Stage 2: Scrape Job Details

For each job URL collected in Stage 1, extract the following:
- Job Title
- Description (up to the "Benefits" section)
- District
- Work Mode (e.g., on-site, remote, hybrid)

4. Output

Your final output must be a CSV file with the following columns:
```
Job Title | Description | District | Work Mode
```

5. 📹 Short Explanation Video

Create a 3-minute video explaining your work.
Your video should include:
- A short explanation of your environment setup.
- A walk-through of your code and any specific functions/classes you used.
- A sample run showing the output.
Upload your video link to the next Google sheet.

Deadline - April 2 23:59 p.m. NO EXTENSION!
Let us know if you have any questions. Good luck!

The text was updated successfully, but these errors were encountered:

Avanve: 1)Git Branch and Folder Structure

+ cvs Homework #1

Homework_1 + excel

homework 1 + excel

Update example homework

update my homework

Update my homwork

update mi tarea

Update my homework

Update final de la tarea. El csv es hasta la fecha 4.2.2025. Este puede variar si se corre el código otro día

#13

#13 homework

homerwork update

#13

tarea

#13

Update my homework

UPDATE MY HOMEWORK

update my homework 1

Last update of the homework

#13

Update my homework

Envío de Tarea 1

#13

Tarea

Update my homework

my homework

#13

tarea

cam

my homework

intento de subida 2

My homework

#13

Se subieron los files anteriormente a la hora limite de la tarea a la carpeta con mi codigo, ahora se vuelven a subir pero en la carpeta con mi codigo dentro de la carpeta de homework.

#13 _2

Update task development

#13

Daf1807 added a commit that referenced this issue Mar 28, 2025

#13

3b23296

Avanve: 1)Git Branch and Folder Structure

Daf1807 added a commit that referenced this issue Mar 30, 2025

#13

11d9523

Daf1807 added a commit that referenced this issue Mar 30, 2025

#13

e613e41

Daf1807 added a commit that referenced this issue Mar 30, 2025

#13

8f0d8e5

Daf1807 added a commit that referenced this issue Mar 30, 2025

#13

401ed72

Daf1807 added a commit that referenced this issue Mar 30, 2025

#13

9c94b61

Daf1807 added a commit that referenced this issue Mar 30, 2025

#13

f21f735

+ cvs Homework #1

Daf1807 added a commit that referenced this issue Mar 30, 2025

#13

8c40bec

Daf1807 added a commit that referenced this issue Apr 1, 2025

#13

2bd34e7

Homework_1 + excel

Daf1807 added a commit that referenced this issue Apr 1, 2025

#13

f460f1a

homework 1 + excel

The-Paul2002 added a commit that referenced this issue Apr 1, 2025

#13

7b03605

Update example homework

The-Paul2002 added a commit that referenced this issue Apr 1, 2025

Merge pull request #28 from d2cml-ai/12345_Hw1_2025_1_example

f00df9e

#13

The-Paul2002 added a commit that referenced this issue Apr 1, 2025

#13

f1caeb8

update my homework

The-Paul2002 added a commit that referenced this issue Apr 1, 2025

Merge pull request #29 from d2cml-ai/12345_Hw1_2025_1_example

f584cec

#13

The-Paul2002 added a commit that referenced this issue Apr 1, 2025

#13

887e62f

Update my homwork

The-Paul2002 added a commit that referenced this issue Apr 1, 2025

Merge pull request #30 from d2cml-ai/123456789_hw1_EXAMPLE

3d8c213

#13

fabianlo003 pushed a commit that referenced this issue Apr 1, 2025

#13

2dc8002

update mi tarea

fabianlo003 added a commit that referenced this issue Apr 1, 2025

Merge pull request #31 from d2cml-ai/246672_hw1_2025_1

18a73cb

#13

JosueChumpitazi added a commit that referenced this issue Apr 2, 2025

#13

659185d

Update my homework

rominaratto added a commit that referenced this issue Apr 2, 2025

#13

cb6ed65

#13

JosueChumpitazi added a commit that referenced this issue Apr 2, 2025

Merge pull request #33 from d2cml-ai/160238_hw1_2025_1

e8f8167

#13

Hide801 added a commit that referenced this issue Apr 2, 2025

#13

7219a58

Update final de la tarea. El csv es hasta la fecha 4.2.2025. Este puede variar si se corre el código otro día

Hide801 added a commit that referenced this issue Apr 2, 2025

Merge pull request #35 from d2cml-ai/246653_hw1_2025_1

814fac9

#13

fabianlo003 added a commit that referenced this issue Apr 2, 2025

Merge pull request #38 from d2cml-ai/main

bb0485d

#13 homework

fabianlo003 mentioned this issue Apr 2, 2025

Merge pull request #38 from d2cml-ai/main #39

Merged

fabianlo003 added a commit that referenced this issue Apr 2, 2025

#13

5b1a22e

homerwork update

fabianlo003 added a commit that referenced this issue Apr 2, 2025

Merge pull request #40 from d2cml-ai/246672_hw1_2025_1

4018847

#13

fabianlo003 added a commit that referenced this issue Apr 2, 2025

#13

ce8e07f

tarea

fabianlo003 added a commit that referenced this issue Apr 2, 2025

Merge pull request #41 from d2cml-ai/246672_hw1_2025_1

d0ce847

#13

AbigailMontanez added a commit that referenced this issue Apr 2, 2025

#13

137a05a

Update my homework

Fer20ca added a commit to Fer20ca/Data-Science-Python that referenced this issue Apr 3, 2025

d2cml-ai#13

1499710

UPDATE MY HOMEWORK

Fer20ca added a commit to Fer20ca/Data-Science-Python that referenced this issue Apr 3, 2025

d2cml-ai#13

af757c7

UPDATE MY HOMEWORK

NadiaCopello added a commit that referenced this issue Apr 3, 2025

#13

0ddc392

update my homework 1

NadiaCopello added a commit that referenced this issue Apr 3, 2025

#13

ae28d02

update my homework 1

josezh07 added a commit that referenced this issue Apr 3, 2025

#13

c946516

josezh07 added a commit that referenced this issue Apr 3, 2025

#13

aadd057

Hide801 added a commit that referenced this issue Apr 3, 2025

#13

c61c41e

Last update of the homework

Hide801 added a commit that referenced this issue Apr 3, 2025

Merge pull request #50 from d2cml-ai/246653_hw1_2025_1

0106374

#13

milkoguz23 added a commit that referenced this issue Apr 3, 2025

#13

eb46b44

Update my homework

margaretteys added a commit that referenced this issue Apr 3, 2025

#13

3bc4171

Envío de Tarea 1

milkoguz23 added a commit that referenced this issue Apr 3, 2025

Merge pull request #51 from d2cml-ai/216580_hw1_2025_1

5fdb3da

#13

margaretteys added a commit that referenced this issue Apr 3, 2025

Merge pull request #52 from d2cml-ai/248329_hw1_2025_1

c5cbfda

#13

Sebasgp29 added a commit that referenced this issue Apr 3, 2025

#13

251ebc1

Tarea

legion8423 added a commit that referenced this issue Apr 3, 2025

#13

fca0424

Update my homework

Daf1807 added a commit that referenced this issue Apr 3, 2025

#13

c94e426

my homework

Sebasgp29 added a commit that referenced this issue Apr 3, 2025

Merge pull request #53 from d2cml-ai/233531_hw1_2025_1

0b33b3e

#13

legion8423 added a commit that referenced this issue Apr 3, 2025

#13

c26e5fb

tarea

legion8423 added a commit that referenced this issue Apr 3, 2025

#13

8e3131b

cam

Daf1807 added a commit that referenced this issue Apr 3, 2025

#13

2b1a510

my homework

legion8423 added a commit that referenced this issue Apr 3, 2025

#13

c432571

intento de subida 2

Daf1807 added a commit that referenced this issue Apr 3, 2025

#13

6a921c3

My homework

Daf1807 added a commit that referenced this issue Apr 3, 2025

Merge pull request #57 from d2cml-ai/241878]_hw1_2025_1

264aa98

#13

Sebasgp29 added a commit that referenced this issue Apr 3, 2025

#13 _2

1ab86e5

Se subieron los files anteriormente a la hora limite de la tarea a la carpeta con mi codigo, ahora se vuelven a subir pero en la carpeta con mi codigo dentro de la carpeta de homework.

Sebasgp29 added a commit that referenced this issue Apr 3, 2025

#13

89edab7

Sebasgp29 added a commit that referenced this issue Apr 3, 2025

Merge pull request #58 from d2cml-ai/233531_hw1_2025_1

268ee9c

#13 _2

Sebasgp29 added a commit that referenced this issue Apr 3, 2025

#13

eadb407

Sebasgp29 added a commit that referenced this issue Apr 3, 2025

Merge pull request #59 from d2cml-ai/233531_hw1_2025_1

cc966dc

#13 _2

The-Paul2002 added a commit that referenced this issue Apr 8, 2025

#13

62afdb2

Update task development

The-Paul2002 added a commit that referenced this issue Apr 8, 2025

Merge pull request #63 from d2cml-ai/task_development

2d8ecb0

#13

josezh07 added a commit that referenced this issue Apr 10, 2025

#13

9841515

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

homework_1 #13

homework_1 #13

alexanderquispe commented Mar 26, 2025 •

edited

Loading

homework_1 #13

homework_1 #13

Comments

alexanderquispe commented Mar 26, 2025 • edited Loading

📚 Data Science Homework 1 — Web Scraping

🔧 Instructions

1. Git Branch and Folder Structure

2. Task Description

3. Suggested Scraping Strategy (Two Stages)

✅ Stage 1: Extract Job Posting Links

✅ Stage 2: Scrape Job Details

4. Output

5. 📹 Short Explanation Video

alexanderquispe commented Mar 26, 2025 •

edited

Loading