PDA

View Full Version : indexing pdf problems


hetest
01-25-2008, 09:35 AM
Hello,

I installed phpdig successfully and it indexes a local site.
windows 2003 php 4.4.2

However, I also need to index pdfs. I installed pdftotext.exe.
Indexing found the pdfs but failed to open them.

I included all of the suggested debug output in robot_functions.php
It would print out but stopped after showing that pdftotext was executable (=1).

I ran again without the debug and it stopped in the middle of indexing (presumably when it came to a pdf) with no error.

The server log showed that it stopped with the wonderfully helpful error message of "Unknown error".

Any ideas out there?

hetest
01-25-2008, 03:21 PM
Update......

I was able to write a short php script that used the pdftotext.exe to successfully create a plain text version.

So why phpdig cannot do it...I have no idea.

phpdig will work if pdf is not enabled.

If it is, phpdig does not just ignore them...it crashes.