PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > Troubleshooting

Reply
 
Thread Tools
Old 10-01-2003, 12:50 PM   #1
jimigisme
Green Mole
 
Join Date: Sep 2003
Posts: 10
Question Error while spidering

I've got phpdig running from the command line as follows:
C:\php>php.exe -f C:\phpdig\admin\spider.php sites.txt
THe spider runs for a bit:
Spidering in progress...
Then:
Notice: Undefined index: path in C:\phpdig\admin\robot_functions.php on line 347
Any ideas would be helpful,
Thank you!
jimigisme is offline   Reply With Quote
Old 10-01-2003, 06:26 PM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
What's in sites.txt, i.e., is there a full URL, one per line?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 10-02-2003, 08:27 AM   #3
jimigisme
Green Mole
 
Join Date: Sep 2003
Posts: 10
Example of sites.txt
http://www.sitename.com
http://www.sitename.com
http://www.sitename.com
http://www.sitename.com
jimigisme is offline   Reply With Quote
Old 10-05-2003, 09:37 AM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Where is sites.txt located? Line 347 in robot_functions.php is:
PHP Code:
$path $newurl["path"]; 
Could you change that to:
PHP Code:
$path $newurl["path"];
echo 
$path " <--- Path<br><br>"
to see what $path is?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 10-06-2003, 08:03 AM   #5
jimigisme
Green Mole
 
Join Date: Sep 2003
Posts: 10
Thank you for your help, here is what I got:

SITE : http://www.welcometoourworldofgifts.com/
Exclude paths :
- @NONE@
/welcome/default.asp <--- Path<br><br>+1:http://www.welcom
(time : 00:00:28)
/welcome/default.asp <--- Path<br><br>/welcome/default.asp
jimigisme is offline   Reply With Quote
Old 10-06-2003, 02:51 PM   #6
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hmm... How about try the following:

In the config file, set the following:
PHP Code:
define('PHPDIG_SESSID_REMOVE',true); // remove SIDS from indexed URLS
define('PHPDIG_SESSID_VAR','mscssid'); // name of the SID variable 
Anyone out there ever have problems indexing *.asp files?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
spidering error = theURL,winName,features ddowdall Troubleshooting 0 03-19-2006 07:28 AM
error while spidering hhsuser Script Installation 1 11-16-2004 06:31 PM


All times are GMT -8. The time now is 04:00 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.