More Research
I have looked at the temp files in the text_content dir, and all of the junk that i am getting in the database is in this file. How is the file getting created? When i run the pdftotext in bash everything works just fine, what is the difference?
|