baskamer
03-31-2004, 06:00 AM
Hi, am I correct to assume that an the index of a page is not automaticly removed?
I have a situation were users can 'depublish' their own pages. these pages still show up in the search results, with the old content. When a page (which has its own url) is depublished, a visit to that page will yeild an other tekst, than the published page.
there are no other links to those pages, so the spider doesnot find those pages, and therefore the index is not updated.
i am not sure if this is a bug or expected behavior. I mean is it correct behavior not to visit pages that have previously been linked (spidered) but are 'orphant' (no links point to them).
Is this something that will be changed in a future release?
Can you please give me any insight on this?
a possible solution is to have a cron job delete all the indexed files, just before spidering. Or is this not the way to go...?
I have a situation were users can 'depublish' their own pages. these pages still show up in the search results, with the old content. When a page (which has its own url) is depublished, a visit to that page will yeild an other tekst, than the published page.
there are no other links to those pages, so the spider doesnot find those pages, and therefore the index is not updated.
i am not sure if this is a bug or expected behavior. I mean is it correct behavior not to visit pages that have previously been linked (spidered) but are 'orphant' (no links point to them).
Is this something that will be changed in a future release?
Can you please give me any insight on this?
a possible solution is to have a cron job delete all the indexed files, just before spidering. Or is this not the way to go...?