View Single Post
Old 02-16-2005, 02:38 AM   #1
mynamesucks
Green Mole
 
Join Date: Feb 2005
Posts: 8
Can phpdig index Japanese PDF file???

Hi,

I converted a PDF file to TXT file thouth pdftotext in linux commend line.
Following is the commend:
./pdftotext test.pdf test.txt
And then I can get the TXT file.

But when I converted a PDF file written with Japanese, it will occur a problem.
Pdftotext just converted the English and numbers except Japanese.
After converting a Japanese Pdf file, I just got a blank TXT file.

Then I tried to set encode in commend:
./pdftotext -enc Shift-JIS test.pdf test.txt
It will display:
Error: Couldn't find unicodeMap file for the 'Shift-JIS' encoding
Error: Couldn't get text encoding

Can anyone tell me what should I do next?
Thanks indeed.
Waiting for your help!!!
mynamesucks is offline   Reply With Quote