![]() |
New Exclude Option
greetings!
I am trying to accomplish the spidering of a site which has an event calendar that I would like to index. This site is not my own, so the usual methods for excluding links do not apply. My question is this: how hard would it be to add a "do not follow" rule for links? The site I am spidering has a series of links which follow this pattern: index.htm?cYear=2002 index.htm?cYear=2003 index.htm?cYear=2004 index.htm?cYear=2005 In the interest of not spending a ton of time indexing things that I don't want, and to minimize the intrusion on their site (I am using their bandwidth), I would like to have a rule, perhaps in the config file that basically says: if cYear is not = 2005 then don't follow the link. Any thoughts on this would be appreciated. I am ok with PHP, but not good enough to dig into the code and find where to hard code the exception statement. Once I knew where the statement went, I could code it though... Thanks, -josegringo |
config file...
FORBIDDEN_EXTENSIONS in PhpDig < 1.8.8 FORBIDDEN in PhpDig 1.8.8+ http://www.phpdig.net/forum/showthread.php?t=1684 |
Thanks
Charter,
Thanks for quickly setting me in the right direction. I spent a good part of the day running in circles. I guess I just wasn't searching for the right term in the forum. Cheers, -Joey |
All times are GMT -8. The time now is 01:53 AM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.