|04-05-2006, 06:18 PM||#1|
Join Date: Feb 2006
Location: Dublin, Georgia USA
pdftotext works! but dies after a few pages
phpdig is great! I've even got it working with pdftotext, somewhat. The problem I'm seeing is that during indexing pdftotext dies after the 2nd or 3rd .pdf file. The first pdf is completely indexed and I can search for unique text that I would only find in it and I'll get good results.
I've tried running pdftotext from the command line on the offending pdfs and get an empty text file. I don't think it's php as I've set the script memory limit pretty high (128mb) and the script timeout to an hour. The biggest hint, though, is that windows tells me that pdftotext.exe has encountered a fatal error and needs to close.
Here's my setup, let me know what other information might be useful:
I know this forum doesn't offer support for external binaries themselves, but maybe others have run into this frequently.
Any help would be greatly appreciated! Thanks in advance,
|Thread||Thread Starter||Forum||Replies||Last Post|
|command line indexing that actually works||carlaron||Troubleshooting||0||11-06-2006 08:48 PM|
|Student who try to works with Msword!||davids211082||External Binaries||1||03-15-2005 09:09 AM|
|How phpdig works with composed keywords ?||julien||How-to Forum||3||03-01-2005 11:12 PM|
|Indexing doesn't works properly||Alysum||Troubleshooting||7||12-16-2004 07:53 PM|
|converted from html pages to php pages now no pages will index!!! help!!||bigals||Troubleshooting||24||04-01-2004 09:34 AM|