PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 04-01-2004, 10:58 PM   #1
Kylord
Green Mole
 
Join Date: Mar 2004
Posts: 2
no indexing with catdoc and xls2csv

hello,

well I have a problem with catdoc and xls2csv (on linux system) :
I have set correctly the path, and change all variables in config.php like this :

define('PHPDIG_INDEX_MSWORD',true);
define('PHPDIG_PARSE_MSWORD','/usr/local/httpd/cgi-bin/catdoc');
define('PHPDIG_OPTION_MSWORD','-s 8859-1');

define('PHPDIG_INDEX_MSEXCEL',true);
define('PHPDIG_PARSE_MSEXCEL','/usr/local/httpd/cgi-bin/xls2csv');
define('PHPDIG_OPTION_MSEXCEL','');

but phpdig doesn't index these files.
it's strange because with pdf (with pdftotext) , I have no problem

I wonder if it is because catdoc and xls2csv return on STDOUT
whereas pdftotext writes in a file .txt.

the path of catdoc and xls2csv are symbolic links but I don't think it's a problem. By the way, when I execute catdoc on line command, it works normally.

What can I do to solve this problem ?
Kylord is offline   Reply With Quote
Old 04-05-2004, 01:23 AM   #2
Kylord
Green Mole
 
Join Date: Mar 2004
Posts: 2
well i've added some lines in robot_functions.php to find the problem. here they are :

echo $command . "<br>"; // try running this from shell in admin dir
print_r($result); // holds the output sent to STDOUT
echo "<br>" . $retval; // is zero if command succeeded

and when spidering comes through a pdf, a doc or a xls file, I can read this :
/usr/opt/www/juju/catdoc/bin/catdoc -s 8859-1 ../admin/temp/53771962.tmp
Array ( )

so it appears it returns nothing, even for pdf files (i had believed that it works for pdf files because of the green quote but actually it seems it doesn't)
its very strange, maybe the temp files aren't created ?
Kylord is offline   Reply With Quote
Old 04-09-2004, 07:19 AM   #3
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. What version of PHP? Perhaps try this thread.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
catdoc catppt xls2csv problems PLEASE HELP navanick External Binaries 0 12-15-2005 01:58 PM
catdoc not indexing all files brianread External Binaries 1 11-30-2005 08:14 AM
PDF and CATDOC indexing chrisdgreen External Binaries 7 11-01-2005 02:50 PM
catdoc and xls2csv not indexing greener_02445 External Binaries 14 04-13-2004 07:33 PM
catdoc & xls2csv binaries Hitman External Binaries 1 01-13-2004 09:52 AM


All times are GMT -8. The time now is 07:22 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.