View Single Post
Old 12-15-2005, 01:58 PM   #1
navanick
Green Mole
 
Join Date: Oct 2005
Posts: 1
Cool catdoc catppt xls2csv problems PLEASE HELP

I have installed PHPDig on:

WinXP Pro
PHP Version 5.0.4

I installed PHP, Apache, and MySQL using the WAMP package from e-novative available at:

http://www.e-novative.info/software/wamp.php

which installs Apache as a service.

My problem with PHPDig is that it is not indexing office files(.doc, .xls, .ppt). However, it is indexing PDF files just fine.

Also, when I run the catdoc(catdoc catppt xls2csv) binaries from the command prompt they do display the file output fine.

I did reference the post of http://www.phpdig.net/forum/showthread.php?t=799

Below is the output for the office file indexing:

Is result test http an array: 1
What is result test http status: MSWORD

Is result test an array: 1
What is result test status: MSWORD
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1

Command is: C:\binaries\catdoc\catdoc.exe -s 8859-1 -a ../admin/temp/74465382.tmp 2>&1
Result contains: Array ( [0] => The system cannot execute the specified program. )
Return value is: 1

9:http://d-slc-10329/test/test.doc
(time : 00:01:03)



Is result test http an array: 1
What is result test http status: MSEXCEL

Is result test an array: 1
What is result test status: MSEXCEL
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1

Command is: C:\binaries\catdoc\xls2csv.exe ../admin/temp/92277562.tmp 2>&1
Result contains: Array ( [0] => The system cannot execute the specified program. )
Return value is: 1

4:http://d-slc-10329/test/test.xls
(time : 00:00:27)



Is result test http an array: 1
What is result test http status: MSPOWERPOINT

Is result test an array: 1
What is result test status: MSPOWERPOINT
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1

Command is: C:\binaries\catdoc\catppt.exe ../admin/temp/49149262.tmp 2>&1
Result contains: Array ( [0] => The system cannot execute the specified program. )
Return value is: 1

5:http://d-slc-10329/test/test.ppt
(time : 00:00:35)


Below is the output for the PDF file indexing:

Is result test http an array: 1
What is result test http status: PDF

Is result test an array: 1
What is result test status: PDF
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1

Command is: C:\binaries\pdf\pdftotext.exe ../admin/temp/84798822.tmp 2>&1
Result contains: Array ( )
Return value is: 0

7:http://d-slc-10329/test/test.pdf
(time : 00:00:53)


Below is my config.php script:

//---------EXTERNAL TOOLS SETUP
// if set to true is_executable used - set to '0' if is_executable is undefined
define('USE_IS_EXECUTABLE_COMMAND','0'); //use is_executable for external binaries

// if set to true, full path to external binary required
define('PHPDIG_INDEX_MSWORD',true);
define('PHPDIG_PARSE_MSWORD','C:\binaries\catdoc\catdoc.exe');
define('PHPDIG_OPTION_MSWORD','-s 8859-1');

define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','C:\binaries\pdf\pdftotext.exe');
define('PHPDIG_OPTION_PDF','');

define('PHPDIG_INDEX_MSEXCEL',true);
define('PHPDIG_PARSE_MSEXCEL','C:\binaries\catdoc\xls2csv.exe');
define('PHPDIG_OPTION_MSEXCEL','');

define('PHPDIG_INDEX_MSPOWERPOINT',true);
define('PHPDIG_PARSE_MSPOWERPOINT','C:\binaries\catdoc\catppt.exe');
define('PHPDIG_OPTION_MSPOWERPOINT','');

//---------EXTERNAL TOOLS EXTENSIONS
// if external binary is not STDOUT or different extension is needed
// for example, use '.txt' if external binary writes to filename.txt
define('PHPDIG_MSWORD_EXTENSION','');
define('PHPDIG_PDF_EXTENSION','.txt');
define('PHPDIG_MSEXCEL_EXTENSION','');
define('PHPDIG_MSPOWERPOINT_EXTENSION','');


I will be forever grateful if you can help me get this setup. Thanks in advance for any help!
navanick is offline   Reply With Quote