I have installed PHPDig on:
WinXP Pro
PHP Version 5.0.4
I installed PHP, Apache, and MySQL using the WAMP package from e-novative available at:
http://www.e-novative.info/software/wamp.php
which installs Apache as a service.
My problem with PHPDig is that it is not indexing office files(.doc, .xls, .ppt). However, it is indexing PDF files just fine.
Also, when I run the catdoc(catdoc catppt xls2csv) binaries from the command prompt they do display the file output fine.
I did reference the post of
http://www.phpdig.net/forum/showthread.php?t=799
Below is the output for the office file indexing:
Is result test http an array: 1
What is result test http status: MSWORD
Is result test an array: 1
What is result test status: MSWORD
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1
Command is: C:\binaries\catdoc\catdoc.exe -s 8859-1 -a ../admin/temp/74465382.tmp 2>&1
Result contains: Array ( [0] => The system cannot execute the specified program. )
Return value is: 1
9:http://d-slc-10329/test/test.doc
(time : 00:01:03)
Is result test http an array: 1
What is result test http status: MSEXCEL
Is result test an array: 1
What is result test status: MSEXCEL
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1
Command is: C:\binaries\catdoc\xls2csv.exe ../admin/temp/92277562.tmp 2>&1
Result contains: Array ( [0] => The system cannot execute the specified program. )
Return value is: 1
4:http://d-slc-10329/test/test.xls
(time : 00:00:27)
Is result test http an array: 1
What is result test http status: MSPOWERPOINT
Is result test an array: 1
What is result test status: MSPOWERPOINT
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1
Command is: C:\binaries\catdoc\catppt.exe ../admin/temp/49149262.tmp 2>&1
Result contains: Array ( [0] => The system cannot execute the specified program. )
Return value is: 1
5:http://d-slc-10329/test/test.ppt
(time : 00:00:35)
Below is the output for the PDF file indexing:
Is result test http an array: 1
What is result test http status: PDF
Is result test an array: 1
What is result test status: PDF
Use is executable is set to: 0
Index the pdf is set to: 1
Parse the pdf is set to: C:\binaries\pdf\pdftotext.exe
Does parse pdf exist: 1
Is parse pdf executable: 1
Command is: C:\binaries\pdf\pdftotext.exe ../admin/temp/84798822.tmp 2>&1
Result contains: Array ( )
Return value is: 0
7:http://d-slc-10329/test/test.pdf
(time : 00:00:53)
Below is my config.php script:
//---------EXTERNAL TOOLS SETUP
// if set to true is_executable used - set to '0' if is_executable is undefined
define('USE_IS_EXECUTABLE_COMMAND','0'); //use is_executable for external binaries
// if set to true, full path to external binary required
define('PHPDIG_INDEX_MSWORD',true);
define('PHPDIG_PARSE_MSWORD','C:\binaries\catdoc\catdoc.exe');
define('PHPDIG_OPTION_MSWORD','-s 8859-1');
define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','C:\binaries\pdf\pdftotext.exe');
define('PHPDIG_OPTION_PDF','');
define('PHPDIG_INDEX_MSEXCEL',true);
define('PHPDIG_PARSE_MSEXCEL','C:\binaries\catdoc\xls2csv.exe');
define('PHPDIG_OPTION_MSEXCEL','');
define('PHPDIG_INDEX_MSPOWERPOINT',true);
define('PHPDIG_PARSE_MSPOWERPOINT','C:\binaries\catdoc\catppt.exe');
define('PHPDIG_OPTION_MSPOWERPOINT','');
//---------EXTERNAL TOOLS EXTENSIONS
// if external binary is not STDOUT or different extension is needed
// for example, use '.txt' if external binary writes to filename.txt
define('PHPDIG_MSWORD_EXTENSION','');
define('PHPDIG_PDF_EXTENSION','.txt');
define('PHPDIG_MSEXCEL_EXTENSION','');
define('PHPDIG_MSPOWERPOINT_EXTENSION','');
I will be forever grateful if you can help me get this setup. Thanks in advance for any help!