PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > Troubleshooting

Reply
 
Thread Tools
Old 08-02-2005, 03:28 PM   #1
tscholle
Green Mole
 
Join Date: Jul 2005
Posts: 4
Question Indexing Help...I am missing something

Hello All:

I have not been lucky finding any posts exactly like this and need a little help. Unfourtanatly my site is on an intranet and I can not provide a link for you to review so I will do the best that I can to explain this.

PhpDig v.1.8.7

All of my data is stored in a directory on the site that is broken into a year directory then a month directory. So it looks like this

-->Archives
-->2005
-->January

All of the month directories contain a bunch of html files that are a listed in a html file called publised.html that is also in the month directory.

Everything seems to go fine when I set up Phpdig and index database looks fine. However when I go to search evey link takes you to the published.html file and not the html page that has the data you really want.

What am I doing wrong? Am I choosing something wrong in the search depth?

When I enter what should be indexed I do put in something like this...

http://archive/archives/1990/jan/published.html
http://archive/archives/1990/feb/published.html
http://archive/archives/1990/mar/published.html
http://archive/archives/1990/apr/published.html
http://archive/archives/1990/may/published.html
http://archive/archives/1990/jun/published.html
http://archive/archives/1990/jul/published.html
http://archive/archives/1990/aug/published.html
http://archive/archives/1990/sep/published.html
http://archive/archives/1990/oct/published.html

Is that wrong?

Any help or advice that anyone could offer would be GREATLY appriciated and I thank you in Advance!

Tom Scholle
tjscholle@cbs.com
tscholle is offline   Reply With Quote
Old 08-02-2005, 06:00 PM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
So each published.html page contains links to other pages in the archives/year/month/ directory? Try setting LIMIT_TO_DIRECTORY to false and PHPDIG_IN_DOMAIN to true (both in the config file) and then, from the admin panel, use a large search depth, set links per to zero, and use the no option.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 08-04-2005, 11:11 AM   #3
tscholle
Green Mole
 
Join Date: Jul 2005
Posts: 4
Question no luck...

I am afraid everything still comes back pointing to published.html. Should I change how I set the dig

from http://archive/archives/1990/jan/published.html

to http://archive/archives/1990/jan

Would that fix it? am I limiting it too much?
tscholle is offline   Reply With Quote
Old 08-04-2005, 11:55 AM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
What does the HTML from one of the published.html look like? Just attach one of the published.html files, if you will, so I can have a look-see. Also, if you can, attach a screenshot showing the trouble area. This will help me get a better understanding.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 08-04-2005, 02:39 PM   #5
tscholle
Green Mole
 
Join Date: Jul 2005
Posts: 4
I have added a zip file with a published.html and a screen shot of the results. I hope that helps. Thank you for your help!
Attached Files
File Type: zip cbs4 stuff.zip (51.9 KB, 9 views)
tscholle is offline   Reply With Quote
Old 08-04-2005, 02:56 PM   #6
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
When you click the published.html link, like the one shown in the screenshot, where are you taken? Also attach one of those 16634f0b.html type files so I can look-see and test.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 08-04-2005, 04:47 PM   #7
tscholle
Green Mole
 
Join Date: Jul 2005
Posts: 4
Here you go....

I guess a this point it would help you to know that these pages get created by our NRCS (newsroom computer system). This is an archive of a shows rundown.

When I click on a link like the one in the screenshot I am taken directly to the published.html for that month and not to the story it is refrencing.

I hope am answering the questions correctly here...

Again I thank you for this help!

Tom
Attached Files
File Type: zip 16634f0b.zip (1.1 KB, 7 views)
tscholle is offline   Reply With Quote
Old 08-04-2005, 09:20 PM   #8
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Okay, I did a test using the following setup:
Code:
http://www.phpdig.net/temp/published.html
http://www.phpdig.net/temp/16634f0b.html
Where published.html contained the first three links:
Code:
<A HREF="16634f0b.html">chinese orch        &nbsp;</A><BR>
<A HREF="18634f0b.html">the bit             &nbsp;</A><BR>
<A HREF="1c634f0b.html">wt:pegasus          &nbsp;</A><BR>
And PhpDig v.1.8.7 was limited to indexing a couple of links.

PhpDig printed out the following:

Spidering in progress... [Stop spider]
SITE : http://www.phpdig.net/
Exclude paths :
- @NONE@
1:http://www.phpdig.net/temp/published.html
(time : 00:00:06)
+
level 1...
2:http://www.phpdig.net/temp/16634f0b.html
(time : 00:00:16)
No link in temporary table
links found : 2
http://www.phpdig.net/temp/published.html
http://www.phpdig.net/temp/16634f0b.html
Optimizing tables...
Indexing complete ! [Back] to admin interface.

A test search on orch yielded the attached image.

What happens if you directly index the following:

http://archive/archives/????/???/16634f0b.html
(replacing the ?'s with year and month info)

If you want to see 16634f0b.html, what do you type in your browser:

http://archive/archives/YYYY/MMM/16634f0b.html or something else?
Attached Images
File Type: gif screenshot.gif (26.2 KB, 7 views)
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Table dbprefixengine missing !help! wedmaster Script Installation 2 04-06-2005 02:03 PM
keywords missing after indexing 123av Troubleshooting 2 10-21-2004 08:28 AM
Table engine missing vladone Script Installation 2 03-30-2004 10:38 PM
Bugs, and missing Features in V. 1.6.2 Rolandks Bug Tracker 4 01-23-2004 07:01 AM
Tables missing but it is wrong jakobyen Troubleshooting 2 09-26-2003 06:46 PM


All times are GMT -8. The time now is 04:44 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.