PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > How-to Forum

Reply
 
Thread Tools
Old 10-26-2005, 04:47 AM   #1
Drexel
Green Mole
 
Join Date: Oct 2005
Posts: 2
Not index certain links?

Hello,

how can I configure that certain links are not indexed and contained in the search?

Example: I have a link on one site to mydomain.com/mysite.html, I don't want the spider to follow this link and index it to the search database.

Thanks for help!
Drexel is offline   Reply With Quote
Old 10-26-2005, 10:42 PM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Add mysite.html to your robots.txt file or click the 'no way' sign for the page from the admin panel. Alternatively, check out the following code in the config file and see this thread.

Code:
// in PhpDig 1.8.8 RC1

// regexp for forbidden links - some links may return text/html mime-type but should not be indexed so forbid them !!!
// you can also expand the FORBIDDEN value by writing a regex to forbid certain links containing 'word' from being indexed
define('FORBIDDEN','\.(js|inc|rm|ico|cab|swf|css|gz|z|tar|zip|tgz|msi|arj|zoo|rar|r[0-9]+|exe|bin|pkg|rpm|deb|bz2)$');

// in PhpDig 1.8.7

// regexp forbidden extensions - return sometimes text/html mime-type !!!
define('FORBIDDEN_EXTENSIONS','\.(rm|ico|cab|swf|css|gz|z|tar|zip|tgz|msi|arj|zoo|rar|r[0-9]+|exe|bin|pkg|rpm|deb|bz2)$');
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 10-27-2005, 09:55 PM   #3
Drexel
Green Mole
 
Join Date: Oct 2005
Posts: 2
Sorry i found out by myself how it works. You only have to create a robots.txt on the root fo the server with the following content:

User-agent: PhpDig
Disallow: /folder/filetoexclude.extension
Drexel is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
re-index links jjoel How-to Forum 3 10-29-2005 05:19 PM
the same problem - not index all links redlock Troubleshooting 0 12-28-2004 06:36 AM
how to manual insert into base links for index? zaartix How-to Forum 3 12-09-2004 12:36 PM


All times are GMT -8. The time now is 08:36 PM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.