Everything you always wanted to know about Google… :Everything you always wanted to know about Google…
Slide 3:Index Who and what is Google?
Google’s vision, mission and values
Google’s History & milestones
Googleplex
Google Web Search
Google ImageSearch
Google Maps
Google Products
Google Labs
Google API’s
Google’s future: Beyond 2009
Slide 4:Who and what is Google
Slide 5:Google (nicknamed "BackRub") began in January 1996 as a research project by
Larry Page and Sergey Brin, a Ph.D.
student at Stanford, working on the
Stanford Digital Library Project (SDLP).
Google search is a web search engine owned by Google Inc.
Beyond the original word-search capability, Google Search provides more than 22 Special features Don’t Be Evil
Original Hardware :Original Hardware In 1998 Co-founders Larry Page and Sergey Brin used Stanford equipment and donated machines to run Google's search engine duties:-
Two 300MHz Dual Pentium II servers with 512 MB of memory
A four-processor F50 IBM RS6000 computer with 512 MB of memory
A dual-processor Sun Ultra II computer with 256 MB of memory
Several hard drives ranging from 4 to 9 GB for a total of more than 350 GB of storage space
History :History By early 1997, the backrub page described the state as follows:
Some Rough Statistics (from August 29th, 1996)Total indexable HTML urls: 75.2306 MillionTotal content downloaded: 207.022 gigabytes...
BackRub is written in Java and Python and runs on several Sun Ultras and Intel Pentiums running Linux. The primary database is kept on an Sun Ultra II with 28GB of disk.
-Larry Page page@cs.stanford.edu
Slide 8:2. Google's Vision, Mission and values
Google Vision :Google Vision "The perfect search engine," says Google co-founder Larry Page, "would understand exactly what you mean and give back exactly what you want."
Given the state of search technology today, that's a far-reaching vision requiring research, development and innovation to realize.
Google is committed to blazing that trail.
Slide 10:History & milestones
Google’s History and milestones (1/14) :Google’s History and milestones (1/14) 1998: In September 1998, Google Inc. opened its door in Menlo Park, California.
2000: Google introduces Ad Words, a self-service ad program that could be activated online with a credit card in a matter of minutes. Google Toolbar. 100 million search queries/day.
2001: Google captured the top trending searches and institutionalized them as the Google Zeitgeist.
Google hires Dr. Eric Schmidt as new CEO. Google penetrates further into Asia and Latin America.
2002: Introduction of the Google Search Appliance for corporate Search (Googlebot search robot). Release of Google API.
Google’s History and milestones (2/14) :Google’s History and milestones (2/14) 2002 (cont.): The launch of Google Labs enabled Google’s engineers to present their pet ideas proudly to an adventurous audience.
Google News launched in beta in September of 2002, offering access to 4,500 leading news sources from around the world.
Google launches Google Sets for product search.
2003: Google acquires Pyra Labs and became the home for Blogger.
Google Desktop launched and Google Toolbar improved.
Google’s History and milestones (3/14) :Google’s History and milestones (3/14) 2004: New web-based mail service called Gmail, which at launch included a gigabyte of free storage for each user.
Acquisition of Picasa, Inc. This Pasadena, Calif.-based digital photo management company helps users to organize, manage and share their digital photos.
Acquisition of Keyhole Corp., a digital and satellite image mapping company based in Google's own headquarter town, Mountain View, Calif. The start of Google Earth.
Google’s History and milestones (4/14) :Google’s History and milestones (4/14) 2004 (cont.):Launch of Google Groups (Former usenet groups)
New R&D center to attract the best and brightest among Japanese and other Asian engineers in Tokyo (Japan). Additional engineering center in Washington. 8 billion Web pages indexed.
The Google Books program announced agreements with leading libraries. Enables users worldwide to search scanned books in Google.
Google’s History and milestones (5/14) :Google’s History and milestones (5/14) 2005: The Google Search Appliance spawned a new blue Google Mini. The Mini is the first Google hardware product to be sold only through the Google Store
Google Maps launched for North America
Google introduces My Search History (saving time by knowing, and showing, what you've searched before) and the Google Web Accelerator (saving time serving web pages by "pre-fetching" them – delivering only updated content).
Google acquires San Diego-based web analytics firm Urchin Software. Google Analytics.
Google’s History and milestones (6/14) :Google’s History and milestones (6/14) 2005 (cont.): Own Google Blog runs frequent postings about Google products and people by those who know them best.
Personalized Homepage (iGoogle).
Google Sitemaps. Webmasters are able to prioritize the pages they want crawled first and speed up updates.
Launch of Google Talk.
Launch of Google Blog Search for live content.
Google’s History and milestones (7/14) :Google’s History and milestones (7/14) 2005 (cont.)
We now offer Local service via mobile phones with Google Mobile.
Google Base to upload content in a structured searchable format is launched.
Google acquires Orkut
5,000 Googlers around the world
Google’s History and milestones (8/14) :Google’s History and milestones (8/14) 2006: Google.org aspires to use the power of information and technology to address the global challenges of our age: Climate change, Poverty and Emerging disease.
New Launches:
Page creator
Google Finance
Google Calendar
Custom Search
Trends
Notebook
Google’s History and milestones (9/14) :Google’s History and milestones (9/14) 2006 (cont.): Introduced U.S. Government Search to make it easier to find Federal information and keep up-to-date on government news. Later U.S. Patents.
Multi-Year Pact to Provide Search and Advertising Across Fox Interactive Media's Growing Online Network Including the MySpace Community.
Launch of Checkout.
Tied to Google Search and Google AdWords, improving the user experience with those products and making e-commerce more efficient for everyone
Google’s History and milestones (10/14) :Google’s History and milestones (10/14) 2006 (cont.):Google Packages for specific communities:
Education: Getting into the Education Space. Google for educators and Google Applications.
SME’s: Google Apps for SME’s. Deals with leading ISP’s to promote Google Apps + Domains.
Families: Google Apps for Families.
Mobile Devices: Google for mobile.
Internet Access: Wi-Fi initiatives
Partnerships Matter: Adobe (Toolbar), eBay (Advertising), Intuit (SME’s tools), Dell (SW)
Google’s History and milestones (11/14) :Google’s History and milestones (11/14) 2006 (cont.):
More Google services for Mobile: News, Gmail, Maps, Mobile Ads, SMS Scrap for Orkut users
Video: Acquisition of YouTube
Ecologically compromised: Installation of a large solar panel installation at Mountain View campus.
More Products and Services: Web site Optimizer for Webmasters; Docs&Spreadsheets
Wiki-Platforms acquisition Jot Spot creates Google Sites. Allows easy home page creation.
10,674 Googlers around the world – the first time employee base hit 5 digits
Slide 22:2007: Partnerships and Business Deals .Mobile matters!
China Mobile cooperation: to provide mobile and Internet search services in China
Samsung: Samsung mobile phones equipped with Google TM applications will enable consumers to search information, find locations, and manage their email on the move
More Google Mobile services: Calendar, Picasa Photos
Android Mobile Applications Development: Total of $10 million for best applications developed Google’s History and milestones (12/14)
Slide 23:2007: M&A’s
In-Game Advertising Provider AdScape acquired for ($23 million)
Gapminder's Trendalyzer software, which generates dynamic graphics and other novel effects in displaying facts, figures, and statistics in presentations.
Advertising Network DoubleClick acquired for ($3.24 billion)
Postini; ($625 million) provides on-demand solutions that help protect businesses worldwide from malicious internet-based attacks, and help ensure compliance with government, industry, and internal mandates.
Feedburner acquired ($100 million) Google’s History and milestones (13/14)
Google’s History and milestones (14/14) :2007 :Partnerships > Entering the offline Advertising market
With EchoStar and Astound Cable on a TV Ads Trial
Radio advertising services expanded thanks to a partnership with Clear Channel as well as new support from leading radio station systems for Google AdSense for Audio.
2008 :
Google launches knol,Chrome
Google launches first Android powered T-Mobile G1
Google introduces Search wiki
Gmail tasks-task management tool in Gmail labs Google’s History and milestones (14/14)
Slide 25:Googleplex
Googleplex :Googleplex In Mountain View, Calif.,there's a complex of buildings that serves as the headquarters for one of the most powerful companies in the technology industry. It's the home of Google, the Internet search engine giant.
Inside the Googleplex :Inside the Googleplex Every hour Google receives 144 resumes. That’s 1.26 million resumes a year.
How does Google do it ?!!!!!!!! Just look at the perks.
Google offers employees at its HQ…
On-site physician and dentist
New moms and dads can expense $500 worth of take-out meals
the first month after birth
On-site massage and yoga
Free breakfast, lunch and dinner every day
Private Google child care five minutes from the HQ
On-site dry cleaning and coin laundry
On-site haircuts
Flexible work schedules (roll out of bed at 11 a.m.)
Lap pool
Pool tables and ping pong
Contd….. :Contd….. The 20% time perk – to pursue projects they’re passionate about
Toys, toys and more toys
Beach volleyball
Mini-kitchens throughout
Bring your dogs to work (whenever)
On-site fitness centers
Ride motorized scooters to and from meetings
On-site author presentations and forums
Pianos and lava lamps
High-powered technology and equipment
Video games
Bicycles and large rubber exercise balls
Work with the most brilliant minds in the industry… from all over the world
Slide 29:Life as a Googler…
Slide 30:Google WebSearch
Gooooogle web search :Gooooogle web search ‘What really happens behind the scenes at Google when you hit the “Google Search” button?’
Google uses a special algorithm to generate search results.
While Google shares general facts about its algorithm, the specifics are a company secret
Google has three distinct parts:
Google Web Crawler (Googlebot), which finds and fetches web pages.
Google Indexer, which sorts every word on every page and stores the resulting index of words in a huge database.
Google Query Processor, which compares your search query to the index and recommends the documents that it considers most relevant.
Crawling :Crawling Google sends out Web crawlers, that go through all the Web pages on the World Wide Web. Its principle is simple (but not its implementation!)
Google bots are made up of millions of computers that surf the Web much like how people do it, except they do it much more quickly.
Googlebot, Google’s Web Crawler :Googlebot, Google’s Web Crawler Google uses spiders or crawlers -automated programs, just like most search engines.
To make sure that bots have the most current sites, these Googlebots are working all the time, crawling and re-crawling websites.
How often a site is crawled depends on how quickly its content changes.
For example, a newspaper site that has frequent updates will get crawled more often than a static company site.
The bots also detect all links on the site, which are then put into a queue system for crawling later on. These links are also important in deciding the search ranking of a site.
Indexing :Indexing The Google Indexing software rips through the page and pulls keywords out of it.
Copies of all these pages are stored in Google's index database. The index works like the contents page of a book and contains all the possible search terms.
Query processor :Query processor When a user types in a query in the search engine, the index will match the query with all the webpages in which the query terms appear and then grab the pages for the search results.
Result’s Page :Result’s Page Web pages are ranked before they are displayed on the results page. Google considers some 200 factors when ranking sites.
One of them is PageRank
Page Rank 1/2 :Page Rank 1/2 Page rank is Google’s link analysis algorithm that measure the probability that a page will be relevant to user’s query:
Based on the correlation between the amount of links toward a page & their relevance
Page Rank 2/2 :Page Rank 2/2 PageRank’s simplified formula is: PR(u)=Σ PR(v)
L(v)
Google claims that Page Rank is one of its search engine’s main competitive advantage.
The Search engine’s algorithm was subjected to 450 modification in 2007
How PageRank is determined? :How PageRank is determined? A Web page's PageRank depends on a few factors:
The frequency and location of keywords within the Web page
How long the Web page has existed
The number of other Web pages that link to the page in question: Google looks at how many Web pages link to a particular site to determine its relevance
Current Hardware :Current Hardware Servers are commodity-class x86 PCs running customized versions of Linux.
Estimates of the power required for over 450,000 servers range upwards of 20 megawatts, which cost on the order of US$2 million per month in electricity charges.
Specifications:
Upwards of 15,000 servers ranging from 533 MHz Intel Celeron to dual 1.4 GHz Intel Pentium III (as of 2003). A 2005 estimate by Paul Strassmann has 200,000 servers, while unspecified sources claimed this number to be upwards of 450,000 in 2006.
One or more 80GB hard disks per server (2003)
2–4 GB of memory per machine (2004)
Data centres :Data centres How many data centers does Google have?Nobody knows for sure, and the company isn’t saying.
Google has numerous data centers scattered around the world.
At least 12 significant Google data center installations are located in the United States.
Google Servers :Google Servers Google uses servers for different tasks.
Web servers receive & process user queries, sending the request on to the next appropriate server.
Index servers store Google's indexes and search results.
Document servers to store search summaries, user information, gmail and Google Docs files.
Ad servers store the advertisements Google displays on search pages.
Spelling servers make suggestions about the spelling of queries.
Google Query Processor :Google Query Processor
Technological infrastructure behind Google :Technological infrastructure behind Google Google Web Server (GWS) is the name for the web server software that Google uses for their web infrastructure.
Some people estimate that Google maintains over 450,000 servers, arranged in racks located in clusters in cities around the world.
When an attempt to connect to Google is made, DNS servers resolve www.google.com to multiple IP addresses, which acts as a first level of load balancing by directing clients to different Google clusters.
The exact size and whereabouts of the data centers Google uses are unknown, and official figures remain intentionally vague.
Google for the first time showed off its server design :Google for the first time showed off its server design
Google's big surprise :Google's big surprise Each server has its own 12-volt battery to supply power if there's a problem with the main source of electricity.
The company also revealed for the first time that since 2005, its data centers have been composed of standard 1AAA shipping containers—
each with 1,160 servers and a power consumption that can reach 250 kilowatts.
Google Software 1/2 :Google Software 1/2 Most of the software stack that Google uses on their servers was developed in-house.
It is believed that C++, Java and Python are favored over other programming languages.
Google has acknowledged that Python has played an important role from the beginning, and that it continues to do so as the system grows and evolves.