PhpDig.net

PhpDig.net (http://www.phpdig.net/forum/index.php)
-   Troubleshooting (http://www.phpdig.net/forum/forumdisplay.php?f=22)
-   -   Command line vs. admin indexing (http://www.phpdig.net/forum/showthread.php?t=1173)

wx3 08-13-2004 09:57 AM

Command line vs. admin indexing
 
Hi all,

I've installed phpdig, and it seems to work fine, except that I can't get it to index regularly via cron or command line.

If I login through the admin interface and click on the green checkbox, phpdig spends about 2-3 minutes indexing the site. When it's done, all the changes to the site are reflected in the search.

If I run the spider from the command line:
Code:

/usr/local/bin/php -f /usr/www/users/username/search/admin/spider.php forceall
I get this:
39258: old priority 0, new priority 18

Spidering in progress...
-----------------------------
SITE : http://www.mywebsite.com/
Exclude paths :
- @NONE@
No link in temporary table
links found : 0
Optimizing tables...
Indexing complete !

What am I doing wrong?

Thanks much,
Kevin

vinyl-junkie 08-13-2004 05:41 PM

Welcome to the forum, wx3. :D

Try this instead:
Code:

#php -f [PHPDIG_DIR]/admin/spider.php http://host.mydomain.com
Substitute your directory name for [PHPDIG_DIR], and of course your domain name, too.

Let us know how that works for you.

Charter 08-15-2004 03:28 PM

Hi, sounds like maybe you need to add the full path to the if statement in the config file?

wx3 08-16-2004 09:44 AM

Hi,

Thanks to both of you for the suggestions. I'm pretty sure the path is set correctly in that first IF statement in the config file (the one that checks to see if relative_script_path is valid). I tried adding the URL explicitly like vinyl suggested, and this is the result:

Spidering in progress...
-----------------------------
SITE : http://www.mywebsite.com/
Exclude paths :
- @NONE@
1:http://www.mywebsite.com/
(time : 00:00:05)
No link in temporary table
links found : 1
Optimizing tables...
Indexing complete !

It found 1 link instead of 0, but still failed to index the site. Any other suggestions?

Thanks,
Kevin

Charter 08-17-2004 03:00 PM

What search depth and links per did you use? What happens if you now go update the site?

mirdin 09-01-2004 07:13 AM

How do you set the depth and linksper from the command line?

Code:

#php -f [PHPDIG_DIR]/admin/spider.php http://host.mydomain.com
If this is not possible where can I set the default depth and perlinks in config.php?

Spider 09-06-2004 07:03 AM

I seem to have the same problem wx3 has. When I go from command line with:

Code:

/usr/bin/php -f /home/sites/www.vematex.nl/web/phpdig/admin/spider.php forceall
I get:

Code:

<b>Warning</b>:  Undefined variable:  argv in <b>/home/sites/site6/web/phpdig/admin/spider.php</b> on line <b>36</b><br>
<br>
<b>Warning</b>:  Undefined variable:  argc in <b>/home/sites/site6/web/phpdig/admin/spider.php</b> on line <b>49</b><br>
12114: old priority 0, new priority 18
Bezig met spideren...
-----------------------------
SITE : http://www.vematex.nl/.....etc

I use phpdig 1.8.3

Charter 09-07-2004 04:07 PM

See around line 25 of the spider.php file.

Spider 09-08-2004 12:31 AM

Hi,

I turned on the register_argc_argv on our server. No more 'Undefined variable:'.

Instead of 'forceall' I put 2 'spider'lines in the crontab, each pointing to a different dir. That works.

For now I'm happy. Thanks!


All times are GMT -8. The time now is 04:10 AM.

Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.