PDA

View Full Version : Shell command no indexing


noel
10-26-2005, 03:47 PM
Hello,

If I put :

php -f spider.php http://www.a site.com

The spider find links

If I put

php -f spider.php forceall

to force update of all the database, it didn't find any links, I have @NONE@ for all the sites.


What I have to do ?

Thank you

Noël

P.S: When you will come back , you will have work ;)))) :santa:

Charter
10-26-2005, 11:23 PM
The @NONE@ refers to what PhpDig finds from a robot.txt file. Try setting up the shell index so PhpDig runs in the background and logs the output to a file. There are instructions for how to do this here (http://www.phpdig.net/navigation.php?action=doc#toc7). Also nix this (http://www.phpdig.net/forum/showthread.php?t=2005) bug. After another forceall index is complete, what do you see in the log file?

noel
10-27-2005, 08:07 AM
[root@wpc1460 admin]# php -f spider.php forceall >> spider.log
Failed loading /usr/lib/ioncube/Ioncube_loader_lin_4.4.so: /usr/lib/ioncube/Ioncube_loader_lin_4.4.so: cannot open shared object file: No such file or directory

But the spider is working because the log file , is update , here is a small copy because because is still spidering there are 700 sites :

5726: old priority 0, new priority 18
Exploration des liens en cours...
-----------------------------
SITE : http://www.fromage-corse.com/
Chemins exclus :
- @NONE@
Pas de liens dans la table temporaire
liens trouvés : 0

-----------------------------
SITE : http://www.corse-piscines.com/
Chemins exclus :
- @NONE@
Pas de liens dans la table temporaire
liens trouvés : 0

-----------------------------
SITE : http://www.corsenature.com/
Chemins exclus :
- @NONE@
Pas de liens dans la table temporaire
liens trouvés : 0
*http://www.corse-moto-service.com/ Verrouillé*

-----------------------------
SITE : http://www.corsematin.com/
Chemins exclus :
- admin__nm/
- upload/
Pas de liens dans la table temporaire
liens trouvés : 0



I have also that whith the numbre 92 what is this ?
-----------------------------
SITE : http://www.centralhotel.fr/
Chemins exclus :
- @NONE@
Pas de liens dans la table temporaire
liens trouvés : 92

-----------------------------
SITE : http://www.ccm-airlines.com/
Chemins exclus :
- @NONE@
Pas de liens dans la table temporaire
liens trouvés : 92

-----------------------------
SITE : http://www.castelbr****.com/
Chemins exclus :
- @NONE@
Pas de liens dans la table temporaire
liens trouvés : 92


Still have problem with accents , Stil have Ecluded way = @NONE@ so why it did'nt undex the site ?

Thank You

Noël

Charter
10-27-2005, 10:22 AM
The 'failed loading - cannot open shared object file' message is not a PhpDig issue. You'll need to look at your ionCube setup for that. PhpDig 1.8.8 RC1 is set to use UTF-8 as the character encoding, so set your browser character encoding to UTF-8 when viewing the admin panel, as the characters à and © together in UTF-8 make the é in ISO-8859-1. The number 92 is telling you how many links were found so far. As PhpDig is still spidering, check and see how many links were found when it is done spidering.