PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > Troubleshooting

Reply
 
Thread Tools
Old 03-31-2006, 11:17 AM   #1
dreamingdigital
Green Mole
 
Join Date: Mar 2006
Posts: 3
Unhappy Anbody else can't get it to spider Miva Merchant files?

I can't get it to spider a Miva Merchant Web site. It will, however, spider an individual page if I give it the exact URL. Example: http://www.violinshop.ca/miva/mercha...ory_Code=_HOME but it will not follow the links from that page and contiinue. It works fine on PHP and HTML sites. Just wondering if anybody had any luck spidering a Miva site?

Colin
dreamingdigital is offline   Reply With Quote
Old 04-27-2006, 12:42 PM   #2
Dave A
Purple Mole
 
Dave A's Avatar
 
Join Date: Aug 2004
Location: North Island New Zealand
Posts: 170
Hi most of what I can see that may cause that to happen is that it looks like the site may have java applets instead of html links.
Most webspiders can't follow Java links..
A heap of the site seems to be Java based so that may be the problem..
Dave A is offline   Reply With Quote
Old 04-27-2006, 08:11 PM   #3
dreamingdigital
Green Mole
 
Join Date: Mar 2006
Posts: 3
Well it's Miva Merchant. So I don't know if that's Java or not but it's dynamically DB and Miva engine generated Web pages.

PHP and ASP generated pages work with PHPDIG but not Miva I guess. I thought Miva was XML but who knows.

CP
dreamingdigital is offline   Reply With Quote
Old 04-27-2006, 08:15 PM   #4
Dave A
Purple Mole
 
Dave A's Avatar
 
Join Date: Aug 2004
Location: North Island New Zealand
Posts: 170
I tell you what?
I will try and index the site from my end in a few minutes and let you know what happens, maybe this will shed some light on it.
Dave A is offline   Reply With Quote
Old 04-27-2006, 08:17 PM   #5
dreamingdigital
Green Mole
 
Join Date: Mar 2006
Posts: 3
You're the best!

I can get it to index individual pages if I explicitly tell it what page to index. But I can't get it to follow any links and index the site. The root index page has links to pretty much all the root categories so just setting http://www.violinshop.ca/ as the site to index "should" be enough. Good luck. You rock.

CP
dreamingdigital is offline   Reply With Quote
Old 04-27-2006, 09:26 PM   #6
Dave A
Purple Mole
 
Dave A's Avatar
 
Join Date: Aug 2004
Location: North Island New Zealand
Posts: 170
Okay having played with it my self I can see what might be a problem?
The source code shows the menus at the top of the screen in odd colours, I have to move the nouse over them to change the colour from balck to white and at first they are odd symbols until it sorts itself out.

The source for that reads as follows:
<TABLE WIDTH=636 BORDER=0 CELLPADDING=0 CELLSPACING=0>
<TR>
<TD>
<A HREF="http://www.violinshop.ca/miva/merchant.mvc?Screen=CTGY&Category_Code=_HOME"
ONMOUSEOVER="DD_images_to_normal(); changeImages('home', 'images/layout/home-over.gif'); dd_show_selects(); MM_showHideLayers('hider1','','hide', 'hider2','','hide', 'divs_instruments','','hide', 'divs_instruments_beginners','','hide', 'divs_strings','','hide', 'divs_strings_violin','','hide', 'divs_strings_viola','','hide', 'divs_strings_cello','','hide', 'divs_strings_bass','','hide', 'divs_cases','','hide', 'divs_access','','hide', 'divs_access_stands','','hide', 'divs_access_endpins','','hide', 'divs_music_cds','','hide', 'divs_gifts','','hide', 'divs_music','','hide', 'divs_music_violin','','hide', 'divs_music_cello','','hide', 'divs_books','','hide', 'divs_specials','','hide', 'divs_repair','','hide', 'divs_rental','','hide', 'divs_faqs','','hide', 'divs_contact','','hide');"
ONMOUSEOUT="changeImages('home', 'images/layout/home.gif');">
<IMG NAME="home" SRC="images/layout/home.gif" WIDTH=56 HEIGHT=21 BORDER=0></A></TD>
<TD>
<A HREF="http://www.violinshop.ca/miva/merchant.mvc?Screen=CTGY&Category_Code=GAME"
ONMOUSEOVER="DD_images_to_normal(); changeImages('games', 'images/layout/games-over.gif'); dd_show_selects(); MM_showHideLayers('hider1','','hide', 'hider2','','hide', 'divs_instruments','','hide', 'divs_instruments_beginners','','hide', 'divs_strings','','hide', 'divs_strings_violin','','hide', 'divs_strings_viola','','hide', 'divs_strings_cello','','hide', 'divs_strings_bass','','hide', 'divs_cases','','hide', 'divs_access','','hide', 'divs_access_stands','','hide', 'divs_access_endpins','','hide', 'divs_music_cds','','hide', 'divs_gifts','','hide', 'divs_music','','hide', 'divs_music_violin','','hide', 'divs_music_cello','','hide', 'divs_books','','hide', 'divs_specials','','hide', 'divs_repair','','hide', 'divs_rental','','hide', 'divs_faqs','','hide', 'divs_contact','','hide');"
ONMOUSEOUT="changeImages('games', 'images/layout/games.gif');">

<IMG NAME="games" SRC="images/layout/games.gif" WIDTH=52 HEIGHT=21 BORDER=0></A></TD>
<TD>
<A HREF="http://www.violinshop.ca/miva/merchant.mvc?Screen=CTGY&Category_Code=_REPAIR"
ONMOUSEOVER="DD_images_to_normal(); changeImages('repair', 'images/layout/repair-over.gif'); dd_hide_selects(); MM_showHideLayers('hider1','','show', 'hider2','','show', 'divs_instruments','','hide', 'divs_instruments_beginners','','hide', 'divs_strings','','hide', 'divs_strings_violin','','hide', 'divs_strings_viola','','hide', 'divs_strings_cello','','hide', 'divs_strings_bass','','hide', 'divs_cases','','hide', 'divs_access','','hide', 'divs_access_stands','','hide', 'divs_access_endpins','','hide', 'divs_music_cds','','hide', 'divs_gifts','','hide', 'divs_music','','hide', 'divs_music_violin','','hide', 'divs_music_cello','','hide', 'divs_books','','hide', 'divs_specials','','hide', 'divs_repair','','show', 'divs_rental','','hide', 'divs_faqs','','hide', 'divs_contact','','hide');"
ONMOUSEOUT="changeImages('repair', 'images/layout/repair.gif');">
<IMG NAME="repair" SRC="images/layout/repair.gif" WIDTH=49 HEIGHT=21 BORDER=0></A></TD>

That appears to show the wrong graphics and the menu bars don't drop down cleanly to show the links, so this is where the webspider is falling over.

The index has continued but it was very very slow, in the end it found and indexed

the URI : http://www.violinshop.ca/ (8 pages)
Root
/miva
in the root area it found one file 1 pages
In the miva directory it found 7 pages
7 pages
-merchant.mvc?Screen=CTGY&Category_Code=_BOWS
-merchant.mvc?Screen=CTGY&Category_Code=_CASES
-merchant.mvc?Screen=CTGY&Category_Code=_CONSIGNMENT
-merchant.mvc?Screen=CTGY&Category_Code=_MUSIC
-merchant.mvc?Screen=CTGY&Category_Code=_RENTAL
-merchant.mvc?Screen=CTGY&Category_Code=_REPAIR
-merchant.mvc?Screen=CTGY&Category_Code=_STRINGS


So it will get to it in the end but it did take about twenty minutes (The spider is still working)

When you start to index it set the depth to eight and eight links per level and it should take a while but it appears to get there in the end.

I will leave the links inot your website running live, that way if anyone searches for any of the content then they will give you a visit.

Try it set at eight and eight and wait a while it will do it in the end..


Heaps of regards
Dave Andrews
From the Linknz search engine.
Dave A is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Spider SWF and GIF files Niels01 How-to Forum 1 12-26-2006 07:40 AM
1.8.4 spider doesn't handle some files :( Juri Savin Troubleshooting 5 12-23-2004 05:52 AM


All times are GMT -8. The time now is 07:56 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.