PDA

View Full Version : HELP! Yes..another site not being indexed post.


benjamintr
07-27-2005, 08:18 PM
I've read through as many messages as I could find here, and yet still, no matter what I do, I get this message when trying to index ANY site -- not just the site I'm trying to use PHPDIG on:

SITE : http://www.url.com/
Exclude paths :
- @NONE@
1:http://www.url.com/
(time : 00:00:05)
No link in temporary table

I'm modified the config file, I've changed the link depth and links per page, I've entered in URL's manually. What else can I do at this point? I'm ready to give up. :bang:

Charter
07-29-2005, 08:59 AM
Is allow_url_fopen set to on?

benjamintr
07-31-2005, 12:55 PM
I just checked again. Yes, it's ON.

What's next? ;)

Charter
07-31-2005, 01:16 PM
Are the following directories set to 777 permissions?

> [PHPDIG_DIR]/text_content
> [PHPDIG_DIR]/includes
> [PHPDIG_DIR]/admin/temp

Do you see any error messages onscreen if you uncomment error_reporting(E_ALL); in the config file, and then try to index?

benjamintr
08-01-2005, 01:47 PM
It's a windows server, but I've chmod'd everything.

Unfortunately, I don't see any error messages after uncommenting error reporting.

...?

Charter
08-02-2005, 06:23 PM
Do you see anything in the server access or error log? One thing noticed was that domain.com gets redirected to 123.123.123.123 and this might be an issue, as PhpDig is coded to stay in the domain given for index. You can try changing this by editing the phpdigCompareDomains function in the robot_functions.php file. Also, PhpDig doesn't follow links in 'heavy' JavaScript, but this (http://www.phpdig.net/forum/showthread.php?t=1139) thread might be useful. Check out this (http://www.phpdig.net/forum/showthread.php?t=310) thread too.

benjamintr
08-15-2005, 03:05 PM
I'm a little confused... This happens no matter what URL I tell it to crawl.

Does the phpdig code need to be on the same site as the domain URL?

Charter
08-19-2005, 08:03 AM
No, PhpDig does not need to be on the same server as the domain, but if your OS/setup is, for example, a CGI loadbalanced cluster of servers, it may not possible to index sites on the cluster as there cannot be a connection back to the loadbalanced address. Presumably not every site you've tried is on the same server, so it seems like some sort of setting issue but I'm not sure. Slap in a bunch of echo "A"; echo "B"; etcetera into spider.php and see what prints onscreen. Hopefully that will provide a clue.