PDA

View Full Version : А есть суппорт на русском?


@Cramac
01-02-2005, 12:46 PM
Всем привет. Собственно сабж.

@Cramac
01-02-2005, 01:53 PM
All greetings. At once I am sorry for bad English (PromtXT)
At me such problem.
Has established search, has corrected in configs the data on access to base. Has tried to go but at input of the password and a login, it{he} did not let, threw back on the form. Has disconnected authorization. Has come. But it is impossible on normal about to index all site (www.Elgorsk.ru)
That writes at attempt about to index (repeatedly) the basic site:

SITE : http://elgorsk.ru/
Exclude paths :
- @NONE@
Duplicate of an existing document
1:http://elgorsk.ru/
(time : 00:00:06)

No link in temporary table

--------------------------------------------------------------------------------

links found : 1
http://elgorsk.ru/
Optimizing tables...
Indexing complete !


And on it business rises.
Tried to index a forum (forum.elgorsk.ru)
That it{he} has found a maximum, it about 60-70 pages... But there it is more than them!

Help the beginner.

vinyl-junkie
01-02-2005, 09:16 PM
Pay particular attention to post #6 in this thread:
http://www.phpdig.net/forum/showthread.php?t=1692

:)

@Cramac
01-02-2005, 11:14 PM
Tried in all variants and helps nothing.

vinyl-junkie
01-02-2005, 11:32 PM
When I tried indexing your site just now, I was able to index 3 pages before I stopped the spider, but I indexed it as:

http://www.elgorsk.ru/

not as:

http://elgorsk.ru/

That's one problem.

Another thing I noticed is that you have a lot of subdirectories on your site. Phpdig interprets those as separate domains and will not index them as part of the process of indexing your main domain. You'll have to list those on separate lines on the admin page for phpdig to spider them.

Charter
01-03-2005, 06:04 AM
Another thing I noticed is that you have a lot of subdirectories on your site. Phpdig interprets those as separate domains and will not index them as part of the process of indexing your main domain. You'll have to list those on separate lines on the admin page for phpdig to spider them.
Not quite... ;)

If LIMIT_TO_DIRECTORY is true then the index is limited to the given (sub)directory. The dropdown on the search box, assuming it's enabled via the config file, is for searching within a (sub)domain/(sub)directory. If interested, see this (http://www.phpdig.net/forum/showpost.php?p=4797&postcount=9) for some distinction between (sub)direcories and (sub)domains.


//for limit to directory, URL format must either have file at end or ending slash at end
//e.g., http://www.domain.com/dirs/ (WITH ending slash) or http://www.domain.com/dirs/dirs/index.php
define('LIMIT_TO_DIRECTORY',true); //limit index to given (sub)directory, no sub dirs of dirs are indexed

vinyl-junkie
01-03-2005, 08:41 AM
Welcome back, Charter! :D Hope you had at least a semi-restful time away from the forums (and that you got your furnace fixed. It's cold here in the US!).

Thanks for the correction on the (sub)directory issue. I guess I had been giving out a bit of misinformation. I'll file this one away for future reference.

Charter
01-03-2005, 02:45 PM
Thanks, my time away was filled with sadness and reflection, but I did get my furnace fixed. It had dropped to about 50 °F (10 °C) before the heat came back.

Anyway, @Cramac should try setting LIMIT_TO_DIRECTORY to false and PHPDIG_IN_DOMAIN to true, both in the config file.


define('PHPDIG_IN_DOMAIN',false); //allows phpdig jump hosts in the same
//domain. If the host is "www.mydomain.tld",
//domain is "mydomain.tld"

@Cramac
01-03-2005, 10:14 PM
Thanks. I shall try once again.

P.S.I have tried yesterday to start a spider through crones so then the hosting - provider called to me and have told that I strongly him{it} load a server.

@Cramac
01-04-2005, 01:24 AM
And still a question: whether it is possible to forbid indexation (temporarily) pages already proindexed?

vinyl-junkie
01-04-2005, 03:19 AM
Thanks. I shall try once again.

P.S.I have tried yesterday to start a spider through crones so then the hosting - provider called to me and have told that I strongly him{it} load a server.
How did you set it up to run? Unless you set off multiple spiders at once, running phpdig through cron shouldn't create a large server load.

@Cramac
01-04-2005, 11:47 AM
Like indexed all that could, it has turned out about 1000 pages. And where the others?
According to search machines yandex.ru at me on a site over 22000 pages....

Charter
01-05-2005, 01:23 AM
In the config file set the following:

define('SPIDER_MAX_LIMIT',100);
define('RESPIDER_LIMIT',100);
define('LINKS_MAX_LIMIT',100);
define('RELINKS_LIMIT',100);
define('LIMIT_TO_DIRECTORY',false);
define('PHPDIG_IN_DOMAIN',true);

From the PhpDig admin panel use the following:

set "search depth" to 100
set "links per" to zero
use "no" option this time
click the dig button

@Cramac
01-06-2005, 02:08 AM
Whether and it is possible to start indexation for all site not from the beginning, and during unfinished? I.e. if I shall casually stop the robot in the middle of work as it to start from a place of end instead of is constant from the beginning?

vinyl-junkie
01-06-2005, 03:23 AM
If you stop spidering in the middle, just don't get rid of the contents of your tempspider table. That's what phpdig uses to figure out where it's been.

Charter, please correct me if I didn't quite explain that correctly, but that's my understanding of how it works.