-
Spider the Website of your choice.
- This algorithm won't wander away from the inputted website. For ex if the domain is 'www.wikipedia.org' then it will stay on 'www.wikipedia.org'.
-
Grab all the links from the page, then randomly select a link, open the link and get all the links from that newly opened page.
-
The above process goes on until the specified number of pages are retrieved.
- After spidering has finished, calculate the page ranks of the retrieved pages.
- Generate a JSON file to help with D3JS visualization.
- All you have to do after the first three steps is to open 'view.html' to view the results.