PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 11-14-2006, 03:40 AM   #1
DeSoto
Green Mole
 
Join Date: Nov 2006
Posts: 2
Swedish characters in catdoc

Hi!
I'm about to develop a site that has a search engine, and I've been looking at PhpDig, and it seems really nice! I saw that it used catdoc to get text from MS Word-files, so I started testing with it, but it doesn't seem to work.

The two lines:
define('PHPDIG_PARSE_MSWORD','W:\www\catdoc\catdoc.exe');
define('PHPDIG_OPTION_MSWORD','-s 8859-1');

are in my config.php, but some characters are not translated correctly. If I test it with a Word-file i created, "ä" becomes "d", "ö" becomes "tz" and "å" becomes "e", and in rtf-files, it becomes other wierd characters. I've spent the last hours googling on it, but I can't make it work. Am I missing something? I've read about character substitution in catdoc, but I really don't know how I would do it.

All help is appreciated!
DeSoto is offline   Reply With Quote
Old 11-16-2006, 01:21 AM   #2
DeSoto
Green Mole
 
Join Date: Nov 2006
Posts: 2
Ah! Stupid me, you just edit the charsets/ascii.rpl file, so that "ö" gets replaced with something unique, that you can replace before you insert it into the database.
DeSoto is offline   Reply With Quote
Old 11-21-2006, 05:47 AM   #3
JohanLM
Green Mole
 
Join Date: Feb 2006
Posts: 2
Can you explain that in Sweidhs... erhm... English...
Just tell me what you did. :p

How do I replace it with something "unique"?
JohanLM is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Japanese characters on an English page Shdwdrgn Troubleshooting 1 03-15-2005 08:28 AM
urls with collection of weird characters revenazb Troubleshooting 6 01-10-2005 01:09 AM
ignore special characters like - mirdin Troubleshooting 5 09-11-2004 06:48 AM
Compiled or corrupted characters tryangle How-to Forum 1 04-20-2004 09:47 AM
National characters < Please Help plodz How-to Forum 4 10-29-2003 09:27 AM


All times are GMT -8. The time now is 01:54 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.