PhpDig.net

PhpDig.net (http://www.phpdig.net/forum/index.php)
-   External Binaries (http://www.phpdig.net/forum/forumdisplay.php?f=36)
-   -   A question about pdftotext installation , thanks m(_ _)m (http://www.phpdig.net/forum/showthread.php?t=1845)

mynamesucks 02-15-2005 06:26 PM

A question about pdftotext installation , thanks m(_ _)m
 
Hi,

I am not very sure about installing pdftotext on a linux server.
I have downloaded xpdf-3.00.tar.gz and unzip it.
Must I ask ISP to make/make install it on server?
Or I just need to upload the pdftotext.cc to server?
Thanks indeed.
m (_ _) m

mynamesucks 02-15-2005 08:07 PM

Well, now I have put pdftotext to server.
and
define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','/path/cgi-bin/pdftotext');
define('PHPDIG_OPTION_PDF','');

define('PHPDIG_PDF_EXTENSION','.txt');

Then I set the dig URL "http://www.domain.com/1.pdf
and depth=1, link=1, then dig.
But spider.php write nothing into database.

Can anyone help me?
Thanks
Waiting...

mynamesucks 02-15-2005 10:31 PM

Hello, everyone

Now I can convert pdf file to txt file by command line.
But I can not index pdf file by phpdig.

The config is:
define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','/path/cgi-bin/pdftotext');
define('PHPDIG_OPTION_PDF','');

define('PHPDIG_PDF_EXTENSION','.txt');

Can anyone tell me what should I do???
m (_ _) m Thanks

mynamesucks 02-16-2005 12:42 AM

I modify the code and then display following:

Is result test http an array: 1
What is result test http status: PDF

Is result test an array: 1
What is result test status: PDF
Use is executable is set to: 1
Index the pdf is set to: 1
Parse the pdf is set to: /var/www/cgi-bin/xpdf/pdftotext
Does parse pdf exist:
Is parse pdf executable:
1:http://www.mizuno-solution.com/1.pdf
(time : 00:00:05)
No link in temporary table

What's mean of this?

Charter 02-17-2005 05:20 AM

It means that PHP thinks the file doesn't exist so isn't executable. Make sure you FTP the binary pdftotext file and set 755 permissions on the directories to pdftotext and on pdftotext itself.

mynamesucks 02-22-2005 10:00 PM

I picked up the issue.
Thanks
m (_ _) m


All times are GMT -8. The time now is 07:07 AM.

Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.