PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 09-30-2003, 12:45 PM   #1
chazter
Green Mole
 
Join Date: Sep 2003
Posts: 8
Add PDF files to be indexed

Maybe I'm not reading the documentation correctly but Ill go ahead and ask another question since I cant seem seem to grasp this yet.

I realize that I can index my pdf file if its coded on a link of a page, but how do I index pdf files that isnt coded on a link. For example in my test site:

http://www.ricalliance.org/newrica/news/subnewsarchive.php?ID=1&Title=Filings

This page has list of PDF files within the last 30 days but there is another link called ARCHIVES

If you click on the link ARCHIVES, it will take you to

http://www.ricalliance.org/newrica/news/subnewsarchive.php?ID=1&Title=Filings

On this page a user must specify the how far do they need to go back for a list of PDF files. Once it is specified it will return a results page with the PDF files that fall into that range.

Its those PDF files I would love to have indexed.

The question is how do I do it? They all reside in a specified directory in my website.

I appreciate any assistance and Thanks in advance.
chazter is offline   Reply With Quote
Old 10-01-2003, 06:08 PM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
If you have access to shell, you could make a text file with the full URL to each PDF, each URL on one line. Otherwise, you should be able to type the full URL to a PDF file into the browser interface to crawl one PDF at a time.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 10-02-2003, 06:54 AM   #3
chazter
Green Mole
 
Join Date: Sep 2003
Posts: 8
Quote:
Originally posted by Charter
If you have access to shell, you could make a text file with the full URL to each PDF, each URL on one line. Otherwise, you should be able to type the full URL to a PDF file into the browser interface to crawl one PDF at a time.
At first your suggestion didnt make sense but after a nights sleep I was able to figure out what you were saying.

What I did was something similar to your solution. In my PHP page I created an array variable that captured all the PDF files and the associate URL. I then linked that variable to a hidden form tag. The phpdig spidered that particular page and was able to index all of my URL's and the associated PDF's from the hidden tag. This way if I ever add to my PDF table, it updates it automatically.

Thanks for pointing my in the right direction.

Have a great day.
chazter is offline   Reply With Quote
Old 10-05-2003, 09:26 AM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Great, glad it's working.

If you would, could you write up what you did and post it in the Mod Submissions forum in case others might want to try it?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 10-07-2003, 06:43 AM   #5
chazter
Green Mole
 
Join Date: Sep 2003
Posts: 8
Sure thing I hope it makes sense. I posted as Add PDF files to be indexed - Solution
chazter is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
searching PDF files bcunico External Binaries 3 02-24-2006 01:40 AM
Not indexing pdf files jayhawk External Binaries 13 02-18-2004 05:13 AM
How to index a directory with pdf files simonced How-to Forum 3 02-13-2004 10:41 AM
Add PDF files to be indexed - Solution chazter Mod Submissions 0 10-07-2003 06:42 AM
Search PDF files chazter External Binaries 4 10-02-2003 06:47 AM


All times are GMT -8. The time now is 08:07 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.