Skip to content

An emergency script to clone the results of the "all" search for AHRQ sites taken down

License

Notifications You must be signed in to change notification settings

planetscape/AHRQ_search_clone

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AHRQ Mirror Project

This is a project to backup important resources from https://guideline.gov/ and https://www.qualitymeasures.ahrq.gov/ The reasons for this are explained here: http://www.fredtrotter.com/2018/07/15/emergency-ahrq-backup/

Project by data journalists working at CareSet Systems

General functions

Guidelines

  • get_guidelines.php - A simple script that downloads all guidelines search results
  • extract_guideline_links.php - Once get_guidelines.php is run, use this to download from wayback, the latest version of guidelines. Create guideline_links.csv which shows what was gettable and how old it was.
  • guidelines_links.csv - shows the guidelines and which timestamp that wayback machine got for them.
  • www.guideline.gov - contains the actual mirror

Measures Clearinghouse

Measures inventory

Expert Commentary

There are three sections of commentary and synthesis that appear to be original articles hosted on these websites. The urls are:

These are backed up by downloading the main page to the one_off_mirror directory and then parsing that html for links that match pattern in the all of the urls.

manual backups

  • one_off_mirror - things I thought were worth downloading that do not belong anywhere else

About

An emergency script to clone the results of the "all" search for AHRQ sites taken down

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 100.0%