PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 04-09-2004, 06:04 AM   #1
DoWn
Green Mole
 
Join Date: Apr 2004
Location: Bordeaux, France
Posts: 3
Unhappy pstotext problem.

Hi. Again a problem trying to index pdf files.

First : the environment

Debian linux running Apache 1.3.26 . PHP 4.1.2.

PHP dig 1.8.0

Succesfully installed pstotext.

In console mode, pstotext runs very well :

The command 'pstotext file.pdf ' display the text contained in the pdf on the screen.


I also tried to redirect output of pstotext in a text file successfully.

phpdig config :

define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','/usr/bin/pstotext');
define('PHPDIG_OPTION_PDF','-cork');

verified (twice) that pstotext is in /usr/bin/ directory

The trouble is the following :

phpdig seems to read correctly pdf files but doensn't index them at all.

help me please.
__________________
Even though he's free, can't fly with these heavy chains
DoWn is offline   Reply With Quote
Old 04-09-2004, 07:24 AM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Are the directories to pstotext and the pstotext file itself set to 755 permissions?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 04-10-2004, 01:14 AM   #3
DoWn
Green Mole
 
Join Date: Apr 2004
Location: Bordeaux, France
Posts: 3
Hi.

Thank you for answering so quiclky.

The directories and pstotext file itself are set to 755 rights (rwxr-xr-x)

phpdig reads the pdf files but doesn't index them.

__________________
Even though he's free, can't fly with these heavy chains
DoWn is offline   Reply With Quote
Old 04-10-2004, 12:44 PM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Maybe something in this thread will help.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 04-12-2004, 11:19 PM   #5
DoWn
Green Mole
 
Join Date: Apr 2004
Location: Bordeaux, France
Posts: 3
Thumbs up

Hi.

Thank you for your help.

I patched spider.php and robot_functions.php and it seems to be working now.

Phpdig now index some of my pdf.

I still have some problems when trying to index a directory containing only pdf files, but i'm still searching.

Thank you again
__________________
Even though he's free, can't fly with these heavy chains
DoWn is offline   Reply With Quote
Old 04-13-2004, 06:32 PM   #6
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
>> I still have some problems when trying to index a directory containing only pdf files, but i'm still searching.

Hi. Are there links to all these PDF files? As PhpDig follows links, it won't index a standalone directory of files. Also, it seems some PDF files just take too much memory. See this thread for more details.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
problems with pstotext - path-problem? jmeyerdo External Binaries 1 01-26-2006 11:30 AM
How do you install pstotext krugar External Binaries 2 12-08-2004 11:53 AM
problem with pstotext loicoco External Binaries 2 07-16-2004 02:17 AM
pstotext issue killer27 External Binaries 7 05-12-2004 01:28 PM
Pstotext problem!!!!! mario External Binaries 14 03-24-2004 03:16 AM


All times are GMT -8. The time now is 07:29 PM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.