Crawl for illegal content
Hi,
I need to crawl the sites of customers hosted with us and try to assess if any of them have illegal/pornographic material hosted on their servers.
I am currently trying to use Metis but it is horribly error prone and slooowww.
I want a site to be crawled for certain keywords, if content of any URL matches the keywords then I want a report of all such URLs for a customer to be emailed to my support-desk for verification/action.
Has anyone done this with PHPDig? Is this possible or is PHPDig going to give me reports only when I query it? More like I run a query on the indexed information everyday and mail myself a report.
Thanks,
Siddhartha
|