Skip to content

aiden10/jobs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LinkedIn and Indeed Job Scraper

Usage

Install the required packages by typing the following command into your terminal: pip install -r requirements.txt

Modify the config.json file to your liking.

To scrape LinkedIn, add your cookies to the directory as cookies.json. This extension can be used to obtain the cookies.

  • queries: what you want to search

  • locations: where you want to search

  • includes: string that the job title must include (at least one string must be in job title)

  • must_include: kind of a misnomer since it's the same as 'includes' (at least one string must be in job title)

  • exlcudes: excludes jobs if the title contains any of the strings in the list

  • age_limit (days): jobs older than this are not added and once jobs in jobs.json become older than this, they are removed

  • distance: not really sure if this is miles or kilometres, but it's the distance from the location

Then run the jobs.bat file or call the main function directly. And for added convenience you could also add the jobs.bat to your start menu or desktop.

After the script finishes, you can open the index.html file and view the jobs that have been scraped in your browser.

job_scroll-ezgif com-video-to-gif-converter

Example:

{
    "queries": [
        "student",
        "co-op",
        "intern"
    ],
    
    "locations": [
        "Whitby, ON",
        "Markham, ON",
        "Waterloo, ON",
        "Toronto, ON",
        "Ottawa, ON"
    ],
    "includes": [
        "software engineering",
        "software engineer",
        "software",
        "technology",
        "computer", 
        "full stack",
        "full-stack",
        "backend",
        "frontend",
        "qa",
        "automation",
        "data",
        "developer",
        "mobile",
        "web",
        "it",
        "quality assurance"
    ],
    "must_include": [
        "intern",
        "co-op",
        "student",
        "coop"
    ],
    "excludes": [
        "marketing",
        "business",
        "administrative",
        "sales",
        "economics",
        "writer",
        "finance",
        "marketeer",
        "civil",
        "mechanical",
        "accounting",
        "accountant"
    ],
    "age_limit": 14,
    "distance": 25
}

Note

  • Random delays have been added to prevent the script from immediately being blocked. I might implement proxy switching eventually.
  • Indeed might flag you as a bot for a little bit if this runs too much. It'll go away if you visit the site yourself and click the Cloudflare button to verify yourself enough times.

About

Job Scraper for LinkedIn and Indeed

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published