-
Notifications
You must be signed in to change notification settings - Fork 0
/
README
22 lines (17 loc) · 932 Bytes
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
Dependencies that must be installed to run the software
==== Ubuntu =====
sudo apt-get install libtry-tiny-perl
sudo apt-get install libwww-mechanize-perl
sudo apt-get install libyaml-perl
sudo apt-get install libhtml-treebuilder-xpath-perl
sudo apt-get install libdbix-class-schema-loader-perl
sudo apt-get install libcompress-bzip2-perl
Test it like this :
perl -I lib/ bin/aspxtraktor.pl --term "software"
That only processes index pages and saves them but does not download the details.
Detail Pages are processed like this :
perl -I lib/ bin/aspxtraktor.pl --term "softwa" --recurse
Read in a file into the database like this :
perl -I lib/ bin/aspxtraktor.pl --file=output_test/DataExtractor_IPKO_P1_Data4.htm.bz2
if you want to load the business activity types : add the --loadtype arguement
perl -I lib/ bin/aspxtraktor.pl --loadtype --file=output_test/DataExtractor_IPKO_P1_Data4.htm.bz2