![]() |
ARGH - Indexing only ever gets 1 link! HELP
Hi,
I have installed phpdig onto my server, but i cannot seem to figure out why its not indexing. I have checked mysql privs, file privs and looked at the HTTP headers that are sent, but they do look correct. All i get from any domain is the following: Spidering in progress... SITE : http://freebox.mine.nu/ Exclude paths : - @NONE@ 1:http://freebox.mine.nu/ (time : 00:00:00) No link in temporary table links found : 1 http://freebox.mine.nu/ Optimizing tables... Indexing complete ! [Back] to admin interface. Help Regards, Daniel. |
Re: ARGH - Indexing only ever gets 1 link! HELP
Quote:
Please post OS and PHP-Version - it is important ! |
Sorry my foobar
Sorry I havent slept in 36 hours...
Running: Vanilla Kernel 2.4.22 Apache 2.0.47 MySQL 4.0.13-r4 PHP 4.3.3-r2 I have also added the l_time FLOAT into the db. And rectified the \n -> \r\n for the headers Cheers, Dan. |
Also
Also, I have testing on many different domains. The ones that are hosted on the same box as the phpdig will due to my bind setup be told the internal IP not the global one, so thats not causing it.
Im really confused as to why its not working... |
Hi. Does anything in this thread help?
|
Nope
I'd allready read that one, that was why I pointed out that Bind(DNS Server) has two "views", if an external ip askes about my domain, it gets the 'Real' ip, if an internal ip asks about the domain it get the internal 192.168.x.x address.
I have tried setting PHPDIG_DEFAULT_INDEX as well. I'm lost! |
Oops, this thread got lost.
Has anything changed? |
Nope
No change, ive had a bit of a fiddle further to no joy. :(
I ended up installing htdig instead. |
Hi. I can't see to duplicate the problem. I crawled your site at level one and found nine links. Did anything in this thread help?
|
Same problem here
Same problem here :confused:
Im running PHP Version 4.3.2 BUT Server API is CGI (Could this be a problem?) And running on an AIX. The output is: Spidering in progress... -------------------------------------------------------------------------------- SITE : http://www.kvis.org/ Exclude paths : - @NONE@ No link in temporary table -------------------------------------------------------------------------------- links found : 0 ...Was recently indexed Optimizing tables... Indexing complete ! -------------------------------------------------------------------------------- [Back] to admin interface. And I've testet tons of sizes - same result. Can I turn some kind of debugging on? |
Hi. There seems to be a problem with PhpDig and CGI mode, but I currently don't have access to PHP in CGI mode. If someone out there with PHP in CGI mode, who is having problems like those posted above, can offer access, then I could try to locate the problem.
|
Hi. To schade: I purchased a hosting account that runs PHP in CGI mode, and I also set an open_basedir restriction. I was able to crawl several sites using this account without incident, except for your site where I received the same results as you did. Now I no longer think that there is a CGI mode problem. Rather, it seems the PhpDig problem is related to something site specific. Are you able to setup a plain demo page without JavaScript on your account and crawl it?
|
|
I've been digging into robot_functions.php and found the reason to my errror fsockopen() fails.
This small program demonstrates the error: <?php $fp = fsockopen ("www.schade.dk", 80, $errno, $errstr, 30); if (!$fp) { echo "$errstr ($errno)<br>\n"; } else { fputs ($fp, "GET / HTTP/1.0\r\nHost: www.schade.dk \n\r\n"); while (!feof($fp)) { echo fgets ($fp,128); } fclose ($fp); } ?> Returning this errormessage: ---- Warning: fsockopen(): php_network_getaddresses: getaddrinfo failed: System error returned in errno (is your IPV6 configuration correct? If this error happens all the time, try reconfiguring PHP using --disable-ipv6 option to configure) in /home/www/php/test.php on line 2 Warning: fsockopen(): unable to connect to www.schade.dk:80 in /home/www/php/test.php on line 2 No such file or directory (2) ---- hmmm, searching the net I found: http://bugs.php.net/bug.php?id=11058 ... thats all for now, but I'll keep digging :-) |
Hi. When I run your snippet, I get the following output:
Code:
HTTP/1.1 200 OK |
All times are GMT -8. The time now is 07:18 AM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.