From: Kendra Smith
Sent: Friday, April 21, 2000 11:01 PM
To: M?crosöft Research Tech Talk, Sem. Notice
Cc: Kendra Smith
Subject: UW-CSE Colloq / 4-26-2000 / Maier / OGI / A Petabyte in Your Pocket: Directions for Net Data Management
UW-CSE Colloq / 4-26-2000 / Maier / OGI / A Petabyte in Your Pocket: Directions for Net Data Management
NOTE: This lecture will NOT be videotaped.
UNIVERSITY OF WASHINGTON
Seattle, Washington 98195
Department of Computer Science and Engineering
Box 352350
(206) 543-1695
COLLOQUIUM
SPEAKER: David Maier, Oregon Graduate Institute
TITLE: A Petabyte in Your Pocket: Directions for Net Data Management
DATE: Wednesday, April 26, 2000
TIME: 12noon - 1:20pm
PLACE: EE1-025
HOST: Alon Levy
ABSTRACT:
In 2015, for a few hundred dollars a year, you can have a personal
petabyte database (PetDB) that you can access from any point of
connection, with any device. It stores and organizes any kind of digital
data you want to have, without losing structure or information. All this
data is queryable and it is arranged by type, content, structure,
association and multiple categorizations and groupings. You can also
locate items by when or how you encountered them, what you have done with
them, where you were when you accessed them.
What could you fit in a personal petabyte store? Some possibilities:
*The contents of every book and magazine you ever bought.
*Every web page you've ever visited.
*Every email you've ever sent or received, including all attachments.
*Every version of any piece of software you have ever used.
*Maps and images of every place you have ever been.
*Descriptions and price lists for every kind of product you might want
to buy-a "universal, personal catalog."
*Portions of the web you might want to browse, including snapshots at past
points of time.
Your PetDB doesn't appear to reside on any particular computer-you are
never on the "wrong" machine to access it. More importantly, you don't
have to take any explicit action to insert data into you PetDB; your PetDB
doesn't appear to have an "outside" where data is concerned. Thus your
PetDB is also your personal Internet portal: your evolving and customized
view of all on-line digital data.
The PetDB isn't predicated upon some massive improvement in holographic
memory technology or DNA-based storage units. Rather, it is an example of
what could be done with a new generation of software infrastructure we
term Net Data Managers (NDMs). NDMs are a departure from the capabilities
and structure of current database management systems. They focus on data
movement, rather than data storage, working equally well with live streams
of data as with files in secondary storage. They will be capable of
storing data of arbitrary types, without a matching database schema having
been defined previously. They will efficiently execute queries over
thousands or tens of thousands of information sites. They will locate and
select data items by both internal content as well as a variety of
external contexts. NDMs will also support monitoring rapidly changing
information sources in a way that scales to thousands or even millions of
triggers.
This talk lays out the requirements for Net Data Management, and reports
on the research directions being pursued by the NIAGARA project, a joint
undertaking with David DeWitt and Jeffrey Naughton at the University of
Wisconsin. I will focus in particular on incremental query processing, the
merge operator and some ideas about stream semantics.
Email: talk-info@cs.washington.edu
Info: http://www.cs.washington.edu