CS 553
 
Internet Services
Spring 2006

Time: Wednesdays, 5th and 6th Period (3:20-6:20PM)
 Place: Hill 254

Instructor: Richard Martin

Announcements

Overview

This course is about the technologies needed to construct large-scale, highly available services. We will utilize the new Web services set of standards to ease the construction of distributed client server computing.  This class will explore how web services can enable highly available large-scale services can be realized with a minium of human labor.

There will be a research component (50%) and a technology exploration component (50%).
For the research component, each student will present 1-2 papers on related background or research results to the application of these emerging WS technologies.  Students will also write, critique and revise a short position paper. For the tech exploration components, students will (1) extend a web service, (2) write an assertion program to test the correctnes of the service and (3) try to cause anther group's assertions to fail (e.g. The web service does not operate correctly, but no automatic system catches the fault.

Prerequisites

Students should be very familiar with  material in cs352 (networking) and cs336 (databases). In addition, students should know some of the material in cs 416 (operating systems).

Objectives

At the end of course, students should have a deep understanding of web services and their strengths, weaknesses, and resulting potential to deliver on their expectations of distributed computing.

Students should also learn how to critically evaluate the various design schemes and evaluation methodologies proposed in the literature. In addition, students are expected to articulate why various designs and methodologies have, or will have, succeeded or failed.

Course Structure and Expected Work

We will focus on a suite of commercial services, as opposed to scientific or medical, because these are currently the most well-defined as to how the interconnections between services. 

As an advanced graduate-level course, student participation is essential. Students are not only expected to have read the paper, but also are expected to form critical judgments bring them to class, and express them during the discussion period.

Students will be required to write a 5-6 page position paper on related to a topic provided by the instructor .  In addition, as part of the grade for the position paper, students will evaluate the position papers of two another students (anonymously).

Grading and Evaluation

Schedule

Week

Dates

Assignment

Topic

Readings

Presenters

Presentation

1

01/18/06

 

 

 

R. Martin 

Introduction 

 


 


 


PDF

2

01/25/06




S. Smalldone


 




 

T. Uphill


3

02/01/06


 


R. Martin


 




 


 

4

02/08/06




 S. Alexandrov

 

 





 A. Tjang

 

5

02/15/06

 

 


T. Chumash







M. Qadir


6

02/22/06


 

 

 S. Smalldone

 

 


 


 

 S. Bhagat

 

7

03/01/06

 

 


 K. Ramachandran

 

 



 

 

 A. Stere

 

8

03/08/06


 

 

 L. Han

 

 


 


 

 T. Chumash

 

9

03/15/06

 

SPRING

 

 

 

 


 

BREAK

 

 

 

10

03/22/06

 

 


 S. Bhagat


 


 

 


 A. Stere

 

11

03/29/05

 

 

 

 T. Uphill

 

 


 

 


 S. Sayyid

 

12

04/05/06


 

 

 M. Qadir

 

 



 


 S. Sayyid


13

04/12/06

 

 

 

 S. Alexandrov

 

 



 


 L. Han

 

14

04/19/06


 


 

 

 



 


 

 

15

04/26/06


 


 


 


 

 

 

 

 

16      

04/31/06


 


 


17     



 

 

 

 

 

Reading List

Introduction and Motivation

  1. Presenter: T. Uphill
    Author: Eric A. Brewer
    Title: Lessons from Giant-Scale Services
    Published: IEEE Internet Computing, Vol 5, Num. 4
    URL: readings/brewer01.pdf

  2. Presenter: Steve Smalldone
    Title: Why Do Internet Services Fail, and What Can Be Done About It?
    Author: David Oppenheimer, Archana Ganapathi, and David A. Patterson
    Published: USITS 2003
    URL:http://roc.cs.berkeley.edu/papers/usits03.pdf

Human Factors and Computer Systems

  1. Presenter: R. Martin
    Title: Understanding and Dealing with Operator Mistakes in Internet Services
    Author: Kiran Nagaraja, Fabio Olivera, Ricardo Bianchini, Richard P. Martin, Thu D. Nguyen
    Published: Proceedings of the 6th Symposium on Operating Systems Design and Implementation (OSDI '04), December, 2004.
    URL:

  2. Title: Towards Availability and Maintainability Benchmarks: A Case Study of Software RAID Systems
    Author: Aaron B. Brown
    Published: UC Berkeley Technical Report UCB/CSD-01-1132, Jan. 2001 (Note: Read the Full TR, not the Usenix version)
    URL: readings/brown01.pdf

Assertion Languages and Systems

  1. Presenter: A. Alexandrov
    Title: Performance Assertion Checking
    Author: S. E. Perl, W. E. Weihl
    Published: SOSP
    URL: readings/sharon_perl.pdf

  2. Presenter: Andrew Tjang
    Title: Writing assertion programs using the A language and run-time.
    URL:

  3. Presenter:
    Title: The Z notation
    Author:
    Published:
    URL:

Traditional Dependability Concepts

  1. Presenter: S. Smalldone
    Title: Fundamental concepts in dependability
    Author: A. Avizienis, J.-C. Laprie, B. Randell
    Published: Proceedings of the Third Information Survivability Workshop, October 2000.
    URL: readings/avizienis00.pdf

  2. Presenter: T. Chumash
    Title: Why do computers stop and what can be done about it?
    Author: J. Gray.
    Published: Proceedings Fifth Symposium on Reliability in Distributed Software and Database Systems, Jan. 1986.
    URL: readings/gray85.pdf

  3. Presenter: M. Qadir
    Title: A census of Tandem system availability between 1985 and 1990
    Author: Author = J. Gray
    Title: IEEE Transactions on Reliability
    URL: readings/gray-whystop-90.pdf

Availability Measurment, Modeling and Anaysis

  1. Presenter: S. Bhagat
    Title: Specification and Construction of Performability Models.
    Author: J. F. Meyer and W. H. Sanders.
    Published: Proceedings of the 2nd International Workshop on Performability Modeling of Computer and Communication Systems, June, 1993.
    URL: http://www.crhc.uiuc.edu/PERFORM/Papers/USAN_papers/93M01.pdf

  2. Presenter: K. Ramachandran
    Title: Approximate availability analysis of VAXcluster systems.
    Author: O. C. Ibe, R. C. Howe, and K. S. Trivedi
    Published: IEEE Trans. Rel., 38(1):146-152, Apr. 1989.
    URL: readings/ibe89.pdf

  3. Presenter:
    Title: Using Fault Injection and Modeling to Evaluate the Performability of Cluster-Based Services
    Author: Kiran Nagaraja, Xiaoyan Li, Ricardo Bianchini, Richard P. Martin, and Thu D. Nguyen,
    Published: USITS 2003
    URL: http://www.panic-lab.rutgers.edu/Research/mendosus/publications/DCS-TR-491.pdf

  4. Presenter:
    Title: Performability Analysis of Wirelss Cellular Networks
    Author: K. S. Trivedi, X. Ma
    Published: Proc. Symposium on Performance Evaluation of Computer and Telecommunication Systems, SPECTS2002, July 2002
    URL:

  5. Presenter: A. Stere
    Title: A Probabilistic Approach to Estimating Computer System Reliability
    Author: Robert Apthorpe
    Published: LISA 2001
    URL: http://www.usenix.org/events/lisa2001/tech/apthorpe.html

  6. Presenter: L. Han
    Title: An Approach to Benchmarking Configuration Complexity
    Author: A. B. Brown, J. L. Hellerstein
    Published: Proc. 2004 SIGOPS European Workshop

  7. Presenter:
    Title: Measuring End-User Availability on the Web: Practical Experience.
    Author: M. Merzbacher, Dan Patterson
    Published: International Performance and Dependability Symposium (DSN), Washington DC, June 2002.
    URL: readings/merzbacher02.pdf

Human Operators and Interaction

  1. Presenter: S. Bhagat
    Title: Task-Analytic Models of Human Operators: Designing Operator-Machine Interaction
    Author: Christine M. Mitchell
    Published: Technical report, Georgia Institute of Technology, 1996.
    URL: readings/mitchell96.pdf

  2. Presenter: A. Stere
    Title: Research Issues in No-Futz Computing
    Author: David A. Holland, William Josephson, Kostas Magoutis, Margo I. Seltzer, Christopher A. Stein, Ada Lim
    Published: HotOS 2001
    URL:

  3. Title: Risk Analysis, Impact and Interaction Modelling
    Author: A.M. Dearden, M.D. Harrison
    Publishbed: Proceedings of DSVIS 96
    URL: http://citeseer.nj.nec.com/35406.html

Misconfiguration Errors

  1. Presenter: T. Uphill
    Title: Understanding BGP Misconfiguration
    Author: Ratul Mahajan
    Published: SIGCOMM 2002
    URL:

  2. Presenter: S. Sayyid
    Title: Impact of Configuration Errors on DNS Robustness
    Author: Vasileios Pappas, Zhiguo Xu, Songwu Lu, Daniel Massey, Andreas Terzis , Lixia Zhang.
    Published: SIGCOMM 2004
    URL:

  3. Presenter: M. Qadir
    Title: Automatic Misconfiguration Troubleshooting with PeerPressure
    Author: Helen J. Wang, John C. Platt, Yu Chen, Ruyun Zhang, and Yi-Min Wang
    Publsihed: OSDI 2004

Fault Detection and Diagnosis

  1. Presenter: S. Sayyid
    Title: Pinpoint: Problem Determination in Large, Dynamic Internet Services
    Author: M. Chen, E. Kiciman, E. Brewer and A. Fox
    Published: In Proceedings of the International Conference on Dependable Systems and Networks (DSN-2002), Washington, D.C., June 2002.
    URL:http://roc.cs.berkeley.edu/papers/roc-pinpoint-ipds.pdf

  2. Presenter: S. Alexandrov
    Title: Combining Visualization and Statistical Analysis to Improve Operator Confidence and Efficiency for Failure Detection and Localization
    Author: Peter Bodik, Greg Friedman, Lukas Biewald, Helen Levine, George Candea, Kayur Patel, Gilman Tolle, Jon Hui, Armando Fox, Michael I. Jordan, David Patterson.
    Published: The 2nd IEEE International Conference on Autonomic Computing (ICAC '05), June 2005
    URL:

  3. Presenter: L. Han
    Title: Diagnosing Network-Wide Traffic Anomalies
    Author: Anukool Lakhina , Mark Crovella, Christophe Diot
    Published: SIGCOMM 2004
    URL:

  4. Presenter: K. Ramachandran
    Title: Detecting and handling MAC layer Misbehavior in wireless networks
    Author: Kyasanur and Vaidy
    Published:
    URL: