PhpDig.net

PhpDig.net (http://www.phpdig.net/forum/index.php)
-   External Binaries (http://www.phpdig.net/forum/forumdisplay.php?f=36)
-   -   Can phpdig index Japanese PDF file??? (http://www.phpdig.net/forum/showthread.php?t=1847)

mynamesucks 02-16-2005 02:38 AM

Can phpdig index Japanese PDF file???
 
Hi,

I converted a PDF file to TXT file thouth pdftotext in linux commend line.
Following is the commend:
./pdftotext test.pdf test.txt
And then I can get the TXT file.

But when I converted a PDF file written with Japanese, it will occur a problem.
Pdftotext just converted the English and numbers except Japanese.
After converting a Japanese Pdf file, I just got a blank TXT file.

Then I tried to set encode in commend:
./pdftotext -enc Shift-JIS test.pdf test.txt
It will display:
Error: Couldn't find unicodeMap file for the 'Shift-JIS' encoding
Error: Couldn't get text encoding

Can anyone tell me what should I do next?
Thanks indeed.
Waiting for your help!!! :cry: :cry: :cry: :cry: :cry: :cry:

mynamesucks 02-16-2005 02:39 AM

Can phpdig index Japanese PDF file???
 
Hi,

I converted a PDF file to TXT file thouth pdftotext in linux commend line.
Following is the commend:
./pdftotext test.pdf test.txt
And then I can get the TXT file.

But when I converted a PDF file written with Japanese, it will occur a problem.
Pdftotext just converted the English and numbers except Japanese.
After converting a Japanese Pdf file, I just got a blank TXT file.

Then I tried to set encode in commend:
./pdftotext -enc Shift-JIS test.pdf test.txt
It will display:
Error: Couldn't find unicodeMap file for the 'Shift-JIS' encoding
Error: Couldn't get text encoding

Can anyone tell me what should I do next?
Thanks indeed.
Waiting for your help!!! :cry: :cry: :cry:

Charter 02-17-2005 05:27 AM

If you FTP the external binary itself without configuring any options, pdftotext doesn't know what to do with Japanese. See this (in Japanese).

mynamesucks 02-22-2005 09:59 PM

Thanks Charter


All times are GMT -8. The time now is 09:26 PM.

Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.