How it Works

Step 1 | Spider the Website by running 'main.py'

Spider the Website of your choice.
- This algorithm won't wander away from the inputted website. For ex if the domain is 'www.wikipedia.org' then it will stay on 'www.wikipedia.org'.
Grab all the links from the page, then randomly select a link, open the link and get all the links from that newly opened page.
The above process goes on until the specified number of pages are retrieved.

All you have to do after the first three steps is to open 'view.html' to view the results.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
scripts		scripts
.gitignore		.gitignore
createJson.py		createJson.py
main.py		main.py
pagerank.py		pagerank.py
readme.md		readme.md
resetRanks.py		resetRanks.py
spider.py		spider.py
view.html		view.html
wipeDB.py		wipeDB.py