![]() |
spider.php blocked when indexing
Hello,
I've installed phpdig v.1.8.8 with EasyPhp on Windows. I would like to index pdf file. I've added the 3 part of code in "read me before..." When i try to index pdf files, it blocks SITE : http://192.168.1.28/ Chemins exclus : - @NONE@ Is result test http an array: 1 What is result test http status: HTML Is result test an array: 1 What is result test status: HTML Use is executable is set to: 1 Index the pdf is set to: 1 Parse the pdf is set to: //.../phpdig/xpdf/pdftotext.exe Does parse pdf exist: 1 Thanks for your help |
When I comment this line //echo "Is parse pdf executable: " . is_executable(PHPDIG_PARSE_PDF) . "<br>";
I obtain this result : SITE : http://192.168.1.28/ Chemins exclus : - @NONE@ Is result test http an array: 1 What is result test http status: PDF Is result test an array: 1 What is result test status: PDF Use is executable is set to: 1 Index the pdf is set to: 1 Parse the pdf is set to: //.../phpdig/xpdf/pdftotext.exe Does parse pdf exist: 1 Command is: //.../phpdig/xpdf/pdftotext.exe ../admin/temp/69288482.tmp 2>&1 Result contains: Array ( [0] => Error: Couldn't open file '../admin/temp/69288482.tmp' ) Return value is: 1 1:http://192.168.1.28/espace-dpi/directives/dir117.pdf (temps : 00:00:01) Pas de liens dans la table temporaire And i have a tmp file which its name is 69288481.tmp (1ko) and not 69288482.tmp |
What did you set in the config file for the following?
|
define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','\\\\..\\..\\phpdig\\xpdf\\pdftotext.exe'); define('PHPDIG_OPTION_PDF',''); define('PHPDIG_PDF_EXTENSION','.txt'); |
Did "Is parse pdf executable" come out as zero or blank or one? If it was zero or blank, try setting the PHPDIG_PARSE_PDF constant in the config file to the full server path instead of using a relative path. Also if you are not using PHP5, set the USE_IS_EXECUTABLE_COMMAND constant in the config file to the number zero.
|
I'm not using PHP5 so define('USE_IS_EXECUTABLE_COMMAND','0'); and it comes out blank.
I obtain the display of my 2nd post when I comment this line //echo "Is parse pdf executable: " . is_executable(PHPDIG_PARSE_PDF) . "<br>"; PHPDIG_PARSE_PDF is already a full server path (i'm not working on the server machine) |
Try running pdftotext.exe dir117.pdf from command prompt. Does it work? Are you able to index non-PDF files/HTML pages?
|
pdftotext.exe runs well from dos command, it's from php it doesn't work and when i run a .bat file from php, a dos windows open and close but no txt file is created...
|
And it doesn't work also with doc or xls files with catdoc or antiword. Only indexing of HTML pages works fine...
|
If HTML pages are indexed, but not DOC, PDF, PPT, or XLS files, then it seems that EasyPHP might not be allowing the PHP exec function:
Code:
exec($command,$result,$retval); Code:
<?php |
All times are GMT -8. The time now is 12:18 PM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.