View Single Post
Old 12-08-2004, 03:12 AM   #1
xperienss
Green Mole
 
Join Date: Dec 2004
Location: Geneva Switzerland
Posts: 8
catdoc problem with WinXP

Hi all

I am using phpdig 1.8.4 on winXP (Windows NT SERVER 5.1 build 2600 ) with easyPHP 1.7 (PHP Version 4.3.3)

I am trying to index .doc files (to start with) with the spider but so far no luck...

When i used catdoc in command line, i get this :
---
catdoc ./test.doc
Banane
Fruit
Abricot
---
those are the words in my doc file.
So i guess catdoc.exe is working

But when i try to index the file using phpdig, here is what i get :
---
SITE : http://server/
Chemins exclus :
- @NONE@
1:http://server/moteur/catdoc/test.doc
(temps : 00:00:07)
Pas de liens dans la table temporaire
liens trouvés : 1
http://server/moteur/catdoc/test.doc
Optimizing tables...
Indexation terminée !
---
its look like its not indexing that file

Here is my config file

PHP Code:
define('LIMIT_DAYS',0);                 //default days before reindex a page

//---------EXTERNAL TOOLS SETUP
// if set to true is_executable used - set to '0' if is_executable is undefined
define('USE_IS_EXECUTABLE_COMMAND','0'); //use is_executable for external binaries

// if set to true, full path to external binary required
define('PHPDIG_INDEX_MSWORD',true);
//define('PHPDIG_PARSE_MSWORD','D:\\serveur\\www\\moteur\\catdoc\\catdoc.exe');
//define('PHPDIG_PARSE_MSWORD','D:\serveur\www\moteur\catdoc\catdoc.exe');
//define('PHPDIG_PARSE_MSWORD','D:\\\\serveur\\\\www\\\\moteur\\\\catdoc\\\\catdoc.exe');
define('PHPDIG_PARSE_MSWORD','D:/serveur/www/moteur/catdoc/catdoc.exe');
define('PHPDIG_OPTION_MSWORD','');

define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','D:\\serveur\\www\\moteur\\catdoc\\pdftotext.exe');
define('PHPDIG_OPTION_PDF','');

define('PHPDIG_INDEX_MSEXCEL',true);
define('PHPDIG_PARSE_MSEXCEL','D:\\serveur\\www\\moteur\\catdoc\\xls2csv.exe');
define('PHPDIG_OPTION_MSEXCEL','');

define('PHPDIG_INDEX_MSPOWERPOINT',false);
define('PHPDIG_PARSE_MSPOWERPOINT','/usr/local/bin/ppt2text');
define('PHPDIG_OPTION_MSPOWERPOINT',''); 
---
PHP INFO :
Safe_mode OFF
allow_url_fopen ON
---

robot_functions.php :
PHP Code:
case 'MSWORD':
$usetool true;
//$command = PHPDIG_PARSE_MSWORD.' '.PHPDIG_OPTION_MSWORD.' '.$tempfile2;
$command PHPDIG_PARSE_MSWORD.' '.PHPDIG_OPTION_MSWORD.' '.$tempfile2.' 2>&1';
break; 
Anything else i can try to make it work ??
thanx for your help...
xperienss is offline   Reply With Quote