PhpDig.net

PhpDig.net (http://www.phpdig.net/forum/index.php)
-   How-to Forum (http://www.phpdig.net/forum/forumdisplay.php?f=33)
-   -   Too many results per site spidered and redirections (http://www.phpdig.net/forum/showthread.php?t=437)

paullind 01-24-2004 12:05 PM

Too many results per site spidered and redirections
 
Hi

Sometimes I spider a site that doesn't exist anymore but it gets forwarded to a default directory with an enormous number of links on them and begins spidering that page.

Here's an example page I enter to spider:
http://www.jacksondiamondkats.com

I am using windows2000 server and entering this page in a bat file like this:
php.exe -f "C:\InetPub\wwwroot\phpdig\admin\spider.php" http://www.jacksondiamondkats.com

I have 3 questions after dealing with this site:

1] Can I prevent my spider from being redirected?
PHPDIG_IN_DOMAIN set to false already

2] Can I limit the number of sites spidered in case there are too many links on a page?
- there is a variable in config.php, called NUMBER_OF_RESULTS_PER_SITE, which I set to 10 for example, but it still tries to spider however many links are on the page above ie >70.

Any recomondations on how to deal with a site like:
http://www.jacksondiamondkats.com

3] I just want to spider a main/homepage for a site and the links from this page only, are these variables set corrrectly?
SPIDER_MAX_LIMIT 1
SPIDER_DEFAULT_LIMIT 1
RESPIDER_LIMIT 1

Any assistance would be appreciated,

Paul L

Charter 01-24-2004 09:04 PM

Hi. For one, perhaps try modifying some code in the phpdigTestUrl function. For two, NUMBER_OF_RESULTS_PER_SITE is for the max number of results to display per site from a search, but perhaps this thread might help. For three, yes a limit of one indexes a given page and links from that page.


All times are GMT -8. The time now is 08:16 AM.

Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.