Post Reply 
Random Google Propaganda
Author Message
ZiNgA BuRgA
Smart Alternative

Posts: 17,022.2988
Threads: 1,174
Joined: 19th Jan 2007
Reputation: -1.71391
E-Pigs: 446.1274
Offline
Post: #1
Random Google Propaganda
...but still an interesting read (at least for me):

Quote:Wee've known it for a long time: the web is big. The first Google index in 1998 already had 26 million pages, and by 2000 the Google index reached the one billion mark. Over the last eight years, wee've seen a lot of big numbers about how much content is really out there. Recently, even our search engineers stopped in awe about just how big the web is these days -- when our systems that process links on the web to find new content hit a milestone: 1 trillion (as in 1,000,000,000,000) unique URLs on the web at once!

How do wee find all those pages? Wee start at a set of well-connected initial pages and follow each of their links to new pages. Then wee follow the links on those new pages to even more pages and so on, until wee have a huge list of links. In fact, wee found even more than 1 trillion individual links, but not all of them lead to unique web pages. Many pages have multiple URLs with exactly the same content or URLs that are auto-generated copies of each other. Even after removing those exact duplicates, wee saw a trillion unique URLs, and the number of individual web pages out there is growing by several billion pages per day.

So how many unique pages does the web really contain? Wee don't know; wee don't have time to look at them all! :-) Strictly speaking, the number of pages out there is infinite -- for example, web calendars may have a "next day" link, and wee could follow that link forever, each time finding a "new" page. Wee're not doing that, obviously, since there would be little benefit to you. But this example shows that the size of the web really depends on your definition of what's a useful page, and there is no exact answer.

Wee don't index every one of those trillion pages -- many of them are similar to each other, or represent auto-generated content similar to the calendar example that isn't very useful to searchers. But wee're proud to have the most comprehensive index of any search engine, and our goal always has been to index all the world's data.

To keep up with this volume of information, our systems have come a long way since the first set of web data Google processed to answer queries. Back then, wee did everything in batches: one workstation could compute the PageRank graph on 26 million pages in a couple of hours, and that set of pages would be used as Google's index for a fixed period of time. Today, Google downloads the web continuously, collecting updated page information and re-processing the entire web-link graph several times per day. This graph of one trillion URLs is similar to a map made up of one trillion intersections. So multiple times every day, wee do the computational equivalent of fully exploring every intersection of every road in the United States. Except it'd be a map about 50,000 times as big as the U.S., with 50,000 times as many roads and intersections.

As you can see, our distributed infrastructure allows applications to efficiently traverse a link graph with many trillions of connections, or quickly sort petabytes of data, just to prepare to answer the most important question: your next Google search.
- Source: [Googleblog]
10/08/2008 06:01 PM
Visit this user's website Find all posts by this user Quote this message in a reply
Slushba132
BustyLoli-Chan

Posts: 3,125.3993
Threads: 508
Joined: 20th Feb 2008
Reputation: -8.27558
E-Pigs: 73.1299
Offline
Post: #2
RE: Random Google Propaganda
SUGOI!!!!
I don't understand. Too lazy to comprehend.

And they just realized the internet was big?
like double you tee eff.
do you look at the sun and think ha ha ha that thing is tiny.
NO! It's the sun of course it's big. Biggest fricken thing wee have.


Maybe tomorrow I will go prove gravity exists and then make an article about it.

10/08/2008 06:14 PM
Visit this user's website Find all posts by this user Quote this message in a reply
feinicks
One day... we Fly...

Posts: 6,124.6050
Threads: 531
Joined: 27th Mar 2008
Reputation: 2.35695
E-Pigs: 210817.3958
Offline
Post: #3
RE: Random Google Propaganda
web + infinite urls= Major pain in donkey for indexing!

◄◄••• 天使たちの夢か? •••►►

[Image: ewualizer.gif]
My works!
10/08/2008 10:01 PM
Find all posts by this user Quote this message in a reply
Slushba132
BustyLoli-Chan

Posts: 3,125.3993
Threads: 508
Joined: 20th Feb 2008
Reputation: -8.27558
E-Pigs: 73.1299
Offline
Post: #4
RE: Random Google Propaganda
indexing is stupid.
It slows down your computer and rarely comes in handy...
I guess it would be good for the web though.


Hey, did you know gravity exists?

10/08/2008 10:09 PM
Visit this user's website Find all posts by this user Quote this message in a reply
boogschd
boogyman
Worlds End

Posts: 4,954.3196
Threads: 90
Joined: 29th Nov 2007
Reputation: 4.19708
E-Pigs: 43.6852
Offline
Post: #5
RE: Random Google Propaganda
Slushba132 Wrote:indexing is stupid.
It slows down your computer and rarely comes in handy...
I guess it would be good for the web though.

yes it would :D ... faster search results from google
10/08/2008 10:35 PM
Visit this user's website Find all posts by this user Quote this message in a reply
Tetris999
..............................

Posts: 2,390.4622
Threads: 298
Joined: 15th Apr 2007
Reputation: -6.7936
E-Pigs: 82.5657
Offline
Post: #6
RE: Random Google Propaganda
boogschd Wrote:
Slushba132 Wrote:indexing is stupid.
It slows down your computer and rarely comes in handy...
I guess it would be good for the web though.

yes it would :D ... faster search results from google

arent they already fast? or do i not get the joke? Yay

MY SIG IS FUCKING DEAD
10/08/2008 10:38 PM
Find all posts by this user Quote this message in a reply
Nacos
Soon to be Moderator?

Posts: 2,004.2538
Threads: 181
Joined: 21st May 2007
Reputation: -0.41086
E-Pigs: 12.1482
Offline
Post: #7
RE: Random Google Propaganda
i has a bigger local area network.

[Image: 17312564gf1.png]
10/08/2008 10:55 PM
Visit this user's website Find all posts by this user Quote this message in a reply
Sparker
Super Lame Productions

Posts: 8,165.3369
Threads: 549
Joined: 19th Jan 2007
Reputation: 10.74638
E-Pigs: 187.8972
Offline
Post: #8
RE: Random Google Propaganda
Slushba132 Wrote:Hey, did you know gravity exists?
Yes I do, in fact there's a law for it apparently.

10/08/2008 11:15 PM
Find all posts by this user Quote this message in a reply
Slushba132
BustyLoli-Chan

Posts: 3,125.3993
Threads: 508
Joined: 20th Feb 2008
Reputation: -8.27558
E-Pigs: 73.1299
Offline
Post: #9
RE: Random Google Propaganda
No wai

10/08/2008 11:16 PM
Visit this user's website Find all posts by this user Quote this message in a reply
boogschd
boogyman
Worlds End

Posts: 4,954.3196
Threads: 90
Joined: 29th Nov 2007
Reputation: 4.19708
E-Pigs: 43.6852
Offline
Post: #10
RE: Random Google Propaganda
Tetris999 Wrote:
boogschd Wrote:
Slushba132 Wrote:indexing is stupid.
It slows down your computer and rarely comes in handy...
I guess it would be good for the web though.

yes it would :D ... faster search results from google

arent they already fast? or do i not get the joke? Yay

what joke ?

reason its fast cause they index uRL's

that's why tis good for the web :D

* boogschd is confoooozd
11/08/2008 03:39 AM
Visit this user's website Find all posts by this user Quote this message in a reply
Post Reply 


Forum Jump:


User(s) browsing this thread: 1 Guest(s)

 Quick Theme: