Okay, I did a test using the following setup:
Code:
http://www.phpdig.net/temp/published.html
http://www.phpdig.net/temp/16634f0b.html
Where published.html contained the first three links:
Code:
<A HREF="16634f0b.html">chinese orch </A><BR>
<A HREF="18634f0b.html">the bit </A><BR>
<A HREF="1c634f0b.html">wt:pegasus </A><BR>
And PhpDig v.1.8.7 was limited to indexing a couple of links.
PhpDig printed out the following:
Spidering in progress... [Stop spider]
SITE : http://www.phpdig.net/
Exclude paths :
- @NONE@
1:http://www.phpdig.net/temp/published.html
(time : 00:00:06)
+
level 1...
2:http://www.phpdig.net/temp/16634f0b.html
(time : 00:00:16)
No link in temporary table
links found : 2
http://www.phpdig.net/temp/published.html
http://www.phpdig.net/temp/16634f0b.html
Optimizing tables...
Indexing complete ! [Back] to admin interface.
A test search on orch yielded the attached image.
What happens if you directly index the following:
http://archive/archives/????/???/16634f0b.html
(replacing the ?'s with year and month info)
If you want to see 16634f0b.html, what do you type in your browser:
http://archive/archives/YYYY/MMM/16634f0b.html or something else?