Results 1 to 13 of 13

Thread: Boitho stats

Hybrid View

Previous Post Previous Post   Next Post Next Post
  1. #1
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,468
    Blog Entries
    13

    Boitho stats

    Added today..

    http://stats.free-dc.org/new/projpage.php?proj=boi

    They are slightly different to the projects own stats as they have a leaderboard based on a combination of the pages and robots. I'm just ranking on the pages.

    Bok

  2. #2
    Keeper of the Fridge PY 222's Avatar
    Join Date
    Jul 2002
    Location
    San Jose, CA
    Posts
    2,706
    Where did this come from?

    A linky would be good.

  3. #3
    Administrator Bok's Avatar
    Join Date
    Oct 2003
    Location
    Wake Forest, North Carolina, United States
    Posts
    24,468
    Blog Entries
    13
    I'll leave the explanations to PCZ as I know very little about this..

    the link I'm using for the stats is

    http://dcsetup.boitho.com/cgi-bin/dc/topTeams.cgi

    Bok

  4. #4

  5. #5
    Keeper of the Fridge PY 222's Avatar
    Join Date
    Jul 2002
    Location
    San Jose, CA
    Posts
    2,706
    Ahh...

    This is a distributed crawler, where everybody can donate there superfluous computer resources and spare bandwidth, to help us create a bigger and better search engine.

    Our goal is to make a general internet search engine with a thumbnail picture of all the pages. The problem is that the bandwidth and computer resources needed to make a thumbnail of a internet page is several times the resources needed just to download a HTML page. This means that Boitho has to spend more resources on crawling than other search engines.

    To make the most of our available resources, and to allow volunteers to donate their superfluous bandwidth and idle CPU time, we have developed a distributed crawler for Boitho, like seti@home and Grub. That way people can install a program on their computers and help us with the crawling.
    Just like another Majestic12.

  6. #6
    Administrator PCZ's Avatar
    Join Date
    Jun 2003
    Location
    Chertsey Surrey UK
    Posts
    2,428
    A small warning.

    The client can crawl robots.txt and URL's.
    Crawling robots.txt the load on the PC is small but when it starts crawling URL's it a CPU hog.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •