View Single Post
Old 07-10-2004, 09:01 AM   #8
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. At FooLabs is a mirror to PlanetMirror where you can find compiled versions of pdftotext.

Go to PlanetMirror and download xpdf-3.00-linux.tar.gz (assuming linux is your operating system).

Unzip xpdf-3.00-linux.tar.gz and extract only the pdftotext file (it's already been compiled and is a binary file).

FTP just the pdftotext file in binary mode to your account.

Once the file is over, change its permission to rwxr-xr-x (755 permission).

Now in the PhpDig config file, set the following:
PHP Code:
define('PHPDIG_INDEX_PDF',true); // set to true
define('PHPDIG_PARSE_PDF','/the/full/path/to/pdftotext'); // assuming linux
define('PHPDIG_OPTION_PDF',''); // two single quotes, no space inbetween 
Also be sure to set the following in the PhpDig config file too:
PHP Code:
define('PHPDIG_PDF_EXTENSION','.txt'); // don't forget the period in .txt 
Give PhpDig a whirl and see if it indexes PDF files.

From the admin panel of PhpDig version 1.8.1, just type in the link to a PDF file, and set search depth to zero and set links per to one, to test pdftotext on the one PDF file.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote