Was written especially for a job appliance to travellizy.
- First install docker and test it works.
- Then clone the repository
- Build the image like
docker build --tag crawler:0.2.1.2 .
while within the cloned directory - Run the image like
docker run -it crawler:0.2.1.2 bash
- Run the crawler like
php ./crawl.php http://www.dolekemp96.org/main.htm
within the container. Please include the url trailing slash like after a domain name, if any. - Optionally copy the HTML report file back to docker host like
docker cp a307572a9698:/usr/src/crawler/report_08.05.2020.html ./logs/