Numbers everywhere...
Hi,
I'm encountering some problems while indexing a website with phpdig.
There is no prolem with the indexing itself, but it's the text that is stored in the txt files of the text_content directory.
All the text file contain text with numbers and letters (ex:19b)placed almost every where.
On first indexation, there are few but on re-indexing, these alpha-numeric "bugs" begin to invade all the text. especially in the begining of the text
Here's an example after 3rd indexation :
"b3 46 19b
198 19b ee
119 66 6e 10 Le livre du Mois 15 2 1c Miró, un feu dans les ruines 1a 1d5 Sans doute êtes vous déjÃ* nombreux Ã* avoir vu ou Ã* revoir la très importante exposition consacrée"
The text is the one that is shown in the result page, so it is really annoying.
It's like some ereg_replace/eregi stuff did'nt do its job well.
If somebody can tell me what's wrong, I'll be grateful.
Thx.
|