PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 06-18-2004, 09:40 AM   #1
N100101
Green Mole
 
Join Date: Jun 2004
Posts: 3
Indexing PDFs doesen't really work

OS: Linux
PHP Version 4.3.2


**********************************
Spidering in progress...

SITE : http://localhost/
Exclude paths :
- @NONE@

Is result test an array: 1
What is result test status: PDF
Use is executable is set to: 1
Index the pdf is set to: 1
Parse the pdf is set to: /usr/local/bin/pdftotext
Does parse pdf exist: 1
Is parse pdf executable: 1

Command is: /usr/local/bin/pdftotext ../admin/temp/7672.tmp
Result contains: Array ( )
Return value is: 3

1:http://localhost/pub/info/info_st.pdf
(time : 00:00:05)
No link in temporary table

links found : 1
http://localhost/pub/info/info_st.pdf
Optimizing tables...
Indexing complete
**********************************

Indexing via terminal works without any problems.

Any hints?

Thanks in advance.
N100101 is offline   Reply With Quote
Old 06-18-2004, 12:46 PM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. In robot_functions.php try changing:
PHP Code:
$command PHPDIG_PARSE_PDF.' '.PHPDIG_OPTION_PDF.' '.$tempfile2
to the following:
PHP Code:
$command PHPDIG_PARSE_PDF.' '.PHPDIG_OPTION_PDF.' '.$tempfile2.' 2>&1'
And see if it will echo the problem.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 06-18-2004, 02:32 PM   #3
N100101
Green Mole
 
Join Date: Jun 2004
Posts: 3
Here is the result:

Command is: /usr/local/bin/pdftotext ../admin/temp/5952.tmp 2>&1
Result contains: Array ( [0] => Error: Bad annotation action [1] => Error: Copying of text from this document is not allowed. )
Return value is: 3

Hm, what does this mean?
N100101 is offline   Reply With Quote
Old 06-18-2004, 03:59 PM   #4
N100101
Green Mole
 
Join Date: Jun 2004
Posts: 3
Arrgh, sure that PDF cannot be copied...

I have tested it with another PDF and it works!

Thanks a lot.
N100101 is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
pdftotext - not indexing PDFs - oh geez monkeynutts External Binaries 1 11-11-2005 09:15 AM
PhpDig indexing won't work sigfy Troubleshooting 11 01-07-2005 06:47 AM
indexing doesn't work on the serverver it should OceanSurf Troubleshooting 8 11-15-2004 03:14 PM
PDFs in directory listed as one long entry motopsycho External Binaries 4 03-09-2004 05:07 PM
Indexing word doc's OK search through files don't work dapuse External Binaries 3 02-05-2004 07:38 AM


All times are GMT -8. The time now is 08:52 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.