Skip to content
Open
Changes from all commits
Commits
Show all changes
27 commits
Select commit Hold shift + click to select a range
4fde7d1
Update spiders.txt
acidvertigo Dec 9, 2014
49ac9ae
change 008 spider to voltron
acidvertigo Dec 9, 2014
9200ef4
Remove AnyApexBot as not exists anymore
acidvertigo Dec 9, 2014
c6e4b5d
Remove B-l-i-t-z-B-O-T as does not exists anymore
acidvertigo Dec 9, 2014
8d040f6
Remove BillyBobBot as does not exists anymore
acidvertigo Dec 9, 2014
a984faf
remoe Boithobot as does not works anymore
acidvertigo Dec 9, 2014
f0dda94
Remove btbot as use google search results
acidvertigo Dec 9, 2014
8104436
remove CatchBot as does not exists anymore
acidvertigo Dec 9, 2014
7bb8b73
Remove Cerberian Drtrs as it is a filtering tool
acidvertigo Dec 9, 2014
3c4f48b
Remove Charlotte as it use yahoo search results
acidvertigo Dec 9, 2014
ee739dc
Remove ConveraCrawler cosmos spiders as index not html pages
acidvertigo Dec 9, 2014
87c7dcb
Remove Covario IDS as does not seems to work anymore
acidvertigo Dec 9, 2014
1a6046a
Remove DiamondBot as seems to not exists anymore
acidvertigo Dec 9, 2014
5102b28
Remove discobot as does not exists anymore
acidvertigo Dec 9, 2014
d75908a
Remove EARTHCOM.info as does not exists anymore
acidvertigo Dec 9, 2014
4832988
Remove EmeraldShield.com WebBot as does not exists anymore
acidvertigo Dec 9, 2014
10575a9
remove EsperanzaBot as does not exists anymore
acidvertigo Dec 9, 2014
2963a62
Remove FDSE robot as is an internal serch engine for sites
acidvertigo Dec 9, 2014
9dc86a0
Remove FindLinks as actually is not working
acidvertigo Dec 9, 2014
e753ed5
Remove g2crawler as it is not used anymore
acidvertigo Dec 9, 2014
08b4fee
remove Gaisbot as does not exists anymore
acidvertigo Dec 9, 2014
8a9ee7e
Remove genieBot as does not exists anymore
acidvertigo Dec 9, 2014
c1c02a0
Remove GurujiBot as does not exists anymore
acidvertigo Dec 9, 2014
bcf88ce
remove HappyFunBot as does not exists anymore
acidvertigo Dec 9, 2014
b6b6e07
Remove small and not existent spiders user agents
acidvertigo Dec 9, 2014
a958130
Convert all to lowercase
acidvertigo Dec 9, 2014
a8097b4
Fix typo
acidvertigo Dec 9, 2014
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
127 changes: 84 additions & 43 deletions catalog/includes/spiders.txt
Original file line number Diff line number Diff line change
@@ -1,54 +1,95 @@
$Id$
almaden.ibm.com
appie 1.1
architext
ask jeeves
asterias2.0
augurfind
abachobot
accoona-ai-agent
addsugarspiderbot
arachmo
baiduspider
bannana_bot
bdcindexer
crawler
crawler@fast
docomo
becomebot
beslistbot
bimbot
bingbot
dataparksearch
dotbot
envolk[its]spider
exabot
fast enterprise crawler
fast-webcrawler
fluffy the spider
frooglebot
geobot
furlbot
fyberspider
galaxybot
gigabot
girafabot
googlebot
gulliver
henrythemiragorobot
googlebot-image
holmes
ia_archiver
infoseek
kit_fireball
lachesis
lycos_spider
mantraagent
mercator
moget/1.0
muscatferret
nationaldirectory-webspider
naverrobot
ncsa beta
ichiro
igdespyder
irlbot
l.webis
larbin
ldspider
lexxebot
linguee bot
linkwalker
lmspider
lwp-trivial
mabontland
magpie-crawler
mediapartners-google
mj12bot
mnogosearch
mojeekbot
moreoverbot
morning paper
msnbot
mxbot
netresearchserver
ng/1.0
osis-project
netseer crawler
newsgator
ng-search
nicebot
noxtrumbot
nutchcvs
obot
oozbot
orangebot
polybot
pompos
scooter
seventwentyfour
sidewinder
sleek spider
slurp/si
[email protected]
steeler/1.3
szukacz
t-h-u-n-d-e-r-s-t-o-n-e
psbot
pycurl
scoutjet
scrubby
seekbot
seochat::bot
seznambot
shim-crawler
shopwiki
shoula robot
silk
snappy
sogou spider
sosospider
speedy spider
stackrambler
suggybot
synoobot
teoma
thumbnail.cz robot
tineye
turnitinbot
ultraseek
twengabot
urlfilebot
vagabondo
voilabot
w3c_validator
zao/0
zyborg/1.0
voltron
websquash.com
yacy
yahoo! slurp
yahooseeker
yandexbot
yandeximages
yeti
yodaobot
youdaobot
zealbot
zyborg