Search CS site
Search WWW
Maintained by web@cs.rutgers.edu

Rutgers University
DCIS Colloquium
Date: Friday, April 2, 2004
Time: 2:00 PM
Location: CoRE Building First Floor Auditorium, Busch Campus, Rutgers University

Title: Finding needles in a 20 TB haystack, 200 million times per day


Speaker: Craig Nevill-Manning, Google


Faculty Host: Thu D. Nguyen

Abstract:

Google faces two large technical challenges: ensuring that our search results are as relevant as possible, and serving hundreds of millions of queries in a fraction of a second at a reasonable cost. To solve the first problem, we perform an offline matrix computation to produce PageRank, a query independent measure of page reputation, and combine it with more traditional query-specific scoring. To solve the distributed computing problem, we use tens of thousands of commodity PCs and highly fault-tolerant software. I will discuss some details of these solutions, and also share some interesting statistical tidbits about search and the web.

Speaker Bio:

Craig Nevill-Manning is a Senior Staff Research Scientist at Google, and Director of the New York Engineering center in Manhattan. He is responsible for the research behind Froogle and web definitions. Prior to his three years at Google, Dr. Nevill-Manning was an assistant professor in the Computer Science Department at Rutgers University.