PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 02-16-2005, 02:38 AM   #1
mynamesucks
Green Mole
 
Join Date: Feb 2005
Posts: 8
Can phpdig index Japanese PDF file???

Hi,

I converted a PDF file to TXT file thouth pdftotext in linux commend line.
Following is the commend:
./pdftotext test.pdf test.txt
And then I can get the TXT file.

But when I converted a PDF file written with Japanese, it will occur a problem.
Pdftotext just converted the English and numbers except Japanese.
After converting a Japanese Pdf file, I just got a blank TXT file.

Then I tried to set encode in commend:
./pdftotext -enc Shift-JIS test.pdf test.txt
It will display:
Error: Couldn't find unicodeMap file for the 'Shift-JIS' encoding
Error: Couldn't get text encoding

Can anyone tell me what should I do next?
Thanks indeed.
Waiting for your help!!!
mynamesucks is offline   Reply With Quote
Old 02-16-2005, 02:39 AM   #2
mynamesucks
Green Mole
 
Join Date: Feb 2005
Posts: 8
Can phpdig index Japanese PDF file???

Hi,

I converted a PDF file to TXT file thouth pdftotext in linux commend line.
Following is the commend:
./pdftotext test.pdf test.txt
And then I can get the TXT file.

But when I converted a PDF file written with Japanese, it will occur a problem.
Pdftotext just converted the English and numbers except Japanese.
After converting a Japanese Pdf file, I just got a blank TXT file.

Then I tried to set encode in commend:
./pdftotext -enc Shift-JIS test.pdf test.txt
It will display:
Error: Couldn't find unicodeMap file for the 'Shift-JIS' encoding
Error: Couldn't get text encoding

Can anyone tell me what should I do next?
Thanks indeed.
Waiting for your help!!!
mynamesucks is offline   Reply With Quote
Old 02-17-2005, 05:27 AM   #3
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
If you FTP the external binary itself without configuring any options, pdftotext doesn't know what to do with Japanese. See this (in Japanese).
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 02-22-2005, 09:59 PM   #4
mynamesucks
Green Mole
 
Join Date: Feb 2005
Posts: 8
Thanks Charter
mynamesucks is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
phpdig not index file if name contain "!" or ' loic@83 Troubleshooting 0 12-14-2007 11:32 AM
can phpdig index PDF server-side served from php? Sybolt How-to Forum 1 02-18-2005 12:16 PM
can't index pdf using pdftotext rom External Binaries 22 08-27-2004 04:11 PM
Install phpdig in a file named phpdig doesn't work Sansnom Script Installation 1 05-09-2004 03:13 PM
How to index a directory with pdf files simonced How-to Forum 3 02-13-2004 10:41 AM


All times are GMT -8. The time now is 01:34 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.