PDA

View Full Version : how to dig only 1 page


zaartix
05-09-2004, 11:19 PM
i have already digged site, but now i need to dig only one page of my site, if i paste link into uri, phpdig making index from the beginning
for example i have domain.com
and need to dig domain.com/somepage/index.php?action=news

how to do this?

zaartix
05-09-2004, 11:27 PM
in manual i find this:

Indexing by command line interface
Le script [PHPDIG_DIR]/admin/spider.php could be lauched by the

shell in order to not overload the webserver.

Launching the script :

#php -f [PHPDIG_DIR]/admin/spider.php [option]

List of options :
- all (default) : Update all hosts ;
- forceall : Force update all hosts ;
- http://mondomaine.tld : Add or update the url ;
- path/file : Add or update all urls listed in the given file.

Examples :

#php -f [PHPDIG_DIR]/admin/spider.php http://host.mydomain.com
#php -f [PHPDIG_DIR]/admin/spider.php [File containing an urls list]


but in is for shell, how to do this from url request?
spider.php?url=http://www.domain.com/somepage/index.php?action=news
not working :bang:

bloodjelly
05-10-2004, 12:48 AM
Hi zaartix -

First, (1) go to the administration panel and highlight the site you want to update. Click "update form" and you'll be taken to a listing of pages already spidered on this URL. (2) Click the right-pointing purple arrow on the left according to the directory your file is in. (3) Then, on the right, your file should show up on a list. Click the green arrow over there to update it. Sorry if that's confusing, but I think once you see it you'll understand. Good luck.

zaartix
05-10-2004, 01:16 AM
:) this way i know, but needed page don't have external link, phpdig didn't find link on that page. i nned add only one page into phpdig base

bloodjelly
05-10-2004, 01:42 AM
Maybe you could create a robots.txt file to exclude all but the file you want indexed. This thread (http://www.phpdig.net/showthread.php?s=&threadid=55&highlight=robot.txt) might help get you started.:)

zaartix
05-10-2004, 01:48 AM
you are mean that the phpdig can understand robots.txt?

vinyl-junkie
05-10-2004, 06:05 AM
Originally posted by bloodjelly
First, (1) go to the administration panel and highlight the site you want to update. Click "update form" <snip>Believe it or not, I never knew how to do this. :D I do have a question though. I noticed a misspelling on one of the pages on my site yesterday, so I went in and corrected it. (It was a name spelling, from Weir to Wier). When I followed your directions for re-indexing a single URL, I got "duplicate of an existing document! Any idea why that would happen? It's only a single word change, but it still seems like the page should get re-indexed.

ibrown
05-11-2004, 01:06 AM
Originally posted by bloodjelly
First, (1) go to the administration panel and highlight the site you want to update. Click "update form" and you'll be taken to a listing of pages already spidered on this URL. ... <snip>

I too am having trouble updating the index for just four pages of a 9000-page Website ... and I have discovered the listing page as described by Bloodjelly.

However, and interestingly, my missing four pages don't show up in this list! There is nothing fanciful about the filenames (msg00118.htm, msg05327.htm, msg05452.htm ...).

Does PHPDig only list those pages which it has indexed, rather than all files in the directory?
:bang:

ibrown
05-11-2004, 01:23 AM
I wrote:
... my missing four pages don't show up in this list! There is nothing fanciful about the filenames (msg00118.htm, msg05327.htm, msg05452.htm ...).

Does PHPDig only list those pages which it has indexed, rather than all files in the directory?

I have just taken the brute force approach for my site: made a temporary Web page containing links to my missing four pages and uploaded it to my site, giving it the filename of an existing page.

I then went to the list in the update interface and clicked the green arrow to update. Bingo, all missing pages now indexed. Deleted the temporary Web page and reinstituted the original existing page.

I am now a happy camper. :D