PhpDig.net

PhpDig.net (http://www.phpdig.net/forum/index.php)
-   Troubleshooting (http://www.phpdig.net/forum/forumdisplay.php?f=22)
-   -   Command line spider ignoring "filelist.txt" (http://www.phpdig.net/forum/showthread.php?t=1149)

lighthouse 08-03-2004 06:12 AM

Command line spider ignoring "filelist.txt"
 
I can run a command line spider by doing the following:

>> php - f search.php
The above indexes everything...

However - I want to just update a few sites, so I have a file called "filelist.txt" which contains 2 URLS's (each on a diffent line)...

When I do the following:

>>php - f search.php filelist.txt
It makes no difference... phpdig proceeds to update everything!!!!

Any ideaS?...
I need to resolve this so i can setup a cron....

Also - FYI:
>> php -f search.php http://www.anydomain.com
This works as expected... It's just with specifying a filelist that there is problems.

Thanks in advance
LH

Charter 08-03-2004 11:29 AM

Hi. What version of PhpDig are you using?

lighthouse 08-09-2004 03:31 AM

1.6.x (as far as i can see - taken from "spider.php" file)

lighthouse 08-09-2004 05:08 AM

Quote:

Originally posted by lighthouse
1.6.x (as far as i can see - taken from "spider.php" file)

Update - I upgraded to latest version, and STILL it ignores the filelist.txt and simply begins indexing everything!!!

Please help!!
LH

lighthouse 08-10-2004 05:09 AM

Update:
Have tried and tried with this, but no luck.

I've opted to just run 6 crons (instead of 1), in the format:
php - f path/to/dir/spider.php http://theurliwanttoindex.com

Not pretty, but it works....
Still open to any suggestions if you have any...

LH

Charter 08-15-2004 03:03 PM

Hi. Try updating to 1.8.3.

lighthouse 08-16-2004 05:32 AM

Quote:

Originally posted by Charter
Hi. Try updating to 1.8.3.

Charter - I did upgrade - and still the same result....
:(

Charter 08-17-2004 02:27 PM

Is the tempspider table empty?

lighthouse 08-18-2004 01:01 AM

That seems to have done the trick!!!! :)
Does the Temporary table always need to be empty then?...
Only I see that it's often got entries in it...

LH

Charter 08-18-2004 07:35 AM

Hi. The tempspider table should try and empty after PhpDig is done, but if it doesn't (for example, the spidering process prematurely ends) then just go and empty that table.


All times are GMT -8. The time now is 07:50 AM.

Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.