![]() |
Spidering vBulletin web sites?
Has anyone spidered a vBulletin site? It seems like the same pages are getting reindexed with different session variables. I am using PHPdig 1.8
Jamison |
Hi. If not already done, use this file and set the following in the config file to match the session variable name:
PHP Code:
|
match the session variable name?
how do i ascertain that? |
Hi. The session variable name can sometimes be seen in the address bar of a web browser or in the HTML source of a page.
|
so for a vbulletin page would it be... ???
|
the spider can deal with the forum pages in vbulletin but not the thread pages...
Duplicate of an existing document 21:http://www.archiseek.com/content/showthread.php (time : 00:03:09) 22:http://www.archiseek.com/content/for...php?forumid=16 (time : 00:03:16) Duplicate of an existing document 23:http://www.archiseek.com/content/showthread.php (time : 00:04:12) 24:http://www.archiseek.com/content/for...php?forumid=22 (time : 00:04:18) |
Hi. I haven't experienced that issue on PhpDig.net. What are you using for a session variable name?
|
Hi, I must be stupid... please help.
Hi Charter,
Thanks for all the great advice you give on this site. I'm reading your info and I'm still a bit confused (sorry, I'm still learning php and xml). The part of the config file you listed was: Quote:
Sorry, I guess I need to read up a bit more about linux, php, and xml and... and.. yep. I'm a newbie. Thanks in advance, Charter. |
Hi. You'd need to provide the name of the SID variable in the config file:
PHP Code:
|
Oh okay. I think I understand. So if they are using "sid" as their session id then I guess I should go into the config file, set the PHPDIG_SESSID_VAR to 'sid' and then reindex my database? If so, cool. I'll try it out. Thanks a bunch!
|
Hi. Yep, that's it. You might have to delete the site, clean the dictionary, and then index anew so PhpDig doesn't continue to store old session info in the tables.
|
isn't it possible to set more then just 1 sid-variables? would be good when u are spidering some strange sites and they use different sid's
|
Quote:
|
Is there a way to set more than one "PHPSESSID" string? I index many different "related" sites but some use "SID" some "SESS" some "SESSID" some "PHPID" etc.,
|
Quote:
|
All times are GMT -8. The time now is 03:34 PM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.