View Single Post
Old 07-17-2004, 11:49 AM   #1
b2l_grefix
Green Mole
 
b2l_grefix's Avatar
 
Join Date: May 2004
Posts: 9
custom depth of search per site in cgi (including urllist)

Hi,

since I use phpdig to spider several sites I wanted it to be possible to give a custom search depth per site in a urllist when spidering with a cron job.

So I made a few modifications to spider.php and robot_functions.php.

Thought this might be usefull for some off you, so I like to share it with you

IMPORTANT: modifications are made for version 1.8.3!

You need to add an extra column (site_limit) is to the table sites in the phpdig database sites

ALTER TABLE PREFIXsites ADD site_limit SMALLINT( 6 ) ;
------------------------------------------------------------------------------

example of list.txt
not specifying the depth or setting the depth to 0 will cause the spider to use the default depth as set in config.php

shell command:

Code:
#php -f [PHPDIG_DIR]/admin/spider.php list.txt
site1.com will be spidered with a depth of 2
site2.com and site3.com will use the default value as specified in the config.php file
site4.com will be spidered with a depth of 2

It can also be used to spider a single site

shell command:
Code:
#php -f [PHPDIG_DIR]/admin/spider.php http://host.mydomain.com depth
examples:
Code:
#php -f [PHPDIG_DIR]/admin/spider.php http://host.mydomain.com 5
this will spider with a depth of 5
Code:
#php -f [PHPDIG_DIR]/admin/spider.php http://host.mydomain.com
Code:
#php -f [PHPDIG_DIR]/admin/spider.php http://host.mydomain.com 0
If not specified or 0, then the default depth will be used as set up in config.php
Attached Files
File Type: zip phpdig_customdepth_cgi.zip (23.8 KB, 32 views)
__________________
Grefix
b2l_grefix is offline   Reply With Quote