![]() |
google like search engine
Hi All,
I've gone up and done these forums, and I am a bit confused (nothing new!) ;) I want to create a search engine for a niche market. There may several thousand sites in this niche. 1) I want to start by listing a few big ones... 2) have the ability for the people to come and request to be indexed (the process would have a screen for them to list their URL, and after approval, they would get indexed based on some sort of schedule) So some questions: a) How is the data stored? - Does the PhpDig store the URL, Title, Description, Keywords of the "crawled" pages in the mySQL database? - Where are the actual INDEXED content of the pages stored? b) How much storage is needed? - i.e. if we have 1000 sites, with 15 pages each... a total of 15,000 pages, How much storage would be needed? c) How quick is the code? - Using the above example (15,000 pages), how long would a 2 word search take? d) And most importantly, has someone put together a Moded version for this kind of application? thanks, Sam |
Quote:
Quote:
Quote:
Quote:
|
Dear junkie,
Thanks for the reply. However, you did not give ANY answers :( I do understant EVRYTHING depends on something else! I have not downloaded the code, or installed it yet. My host is on "all Windows" platform. All I wanted to get some estimates before I went and paid for a linux hosting just to try the code. You seem to know a lot about this code... so here are a few questions: a) How is the data stored? - Does the PhpDig store the URL, Title, Description, Keywords of the "crawled" pages in the mySQL database? - Where are the actual INDEXED (text) content of the pages stored? b) In your own installation, what are the sizes of the database and how long would a 2 word search take? |
Quote:
One thing that you might not be aware of is that phpdig will work on a Windows server. However, my own experience with that has been that it doesn't work very well there. You might have better luck than me though. I have been told that a Windows server settings can be tweaked so that phpdig will work pretty well, but I've never pursued that myself so I can offer you any insight on that. Quote:
Quote:
Quote:
Quote:
Hope this answers your questions. :) |
All times are GMT -8. The time now is 11:55 PM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.