Vous êtes sur la page 1sur 21

Distributed Systems

Xining Li
Dept. of Computing and Information Science University of Guelph Canada
Distributed Systems 1

Chapter 1: Introduction

What is a Distributed system? Why do we need distributed Systems? History of Distributed Systems. Applications of Distributed Systems. Goals and Objectives.

Distributed Systems 2

Computer processor performance evolution:

Distributed Systems

Gordon Moore(1965) Prediction:

Distributed Systems

Definition of a Distributed System (Tanenbaum):


A distributed system is:

A collection of independent computers that appears to its users as a single coherent system.

Distributed Systems

Centralised Systems:
System shared by users all the time

All resources accessible


Software runs in a single process Single physical location Single point of control Single point of failure

Distributed Systems

Decentralised Systems:

Multiple autonomous components Components shared by users Some resources may not be accessible Software can run in concurrent processes on different processors Multiple physical locations Multiple points of control Multiple points of failure No global time No shared memory (in most cases)
Distributed Systems 7

Applications (Killer):
Computing dominated problems (distributed computing): Mathematical Computations, Environmental and Biological Modeling, Economic and Financial modeling, Graphics rendering for visualization, Network Simulations. Storage dominated problems (distributed data): Data Mining, Image Processing, Information retrival, Insurance Analysis. Communications dominated problems (network computing): Transaction processing, Video on Demand, Ecom, Electronic banking, electronic shopping
Distributed Systems 8

Common distributed computing examples


rlogin or telnet (for remote access) network file system, network printer etc ATM (cash machine) Distributed databases Network computing Global positioning systems Retail point-of-sale terminals Air-traffic control Enterprise computing WWW
Distributed Systems 9

SETI: Search for Extraterrestrial Intelligence

To look for aliens Radio Telescope: Arecibo (305m) Located in: Puerto Rico Accept 4,000,000 radio bands Screen saver to do analysis UC Berkeley

Distributed Systems

10

Radio Telescope: Arecibo (305m)


Puerto Rico

Distributed Systems

11

Distributed.com RC5

To find the correct solution for RSA Labs Secret Keys Award: $10,000USD RC_56: 250 days (1997) RC_64: 1,757 days (2002) RC_72: ? Ex: RC_64: 18,446,744,073,709,551,616 keys 160,000 PCs are working on this project
Distributed Systems 12

CERN: European Organization for Nuclear Research

CERN is the world's largest particle physics centre. Here physicists come to explore what matter is made of and what forces hold it together. A new Collider: Large Hadron Collider (2007) Expected data: 10,000,000GB Need 20,000,000 CDs to store Solution: Distributed systems
Distributed Systems 13

Brief History of Distributed Systems:


System CM* Cambridge DCS Locus V System Mach CORBA Distributed COM JINI Organization Network Computer PDP LSI-4 PC Sun Sun, PC Any PC Any Date 1975 1979 1980 1982 1985 1990 1996 2000
14

Carnegie Mellon Hierarchical Univ. Bus Cambridge Univ. UCLA Stanford Univ. Cambridge Ring Ethernet Ethernet

Carnegie Mellon Ethernet Univ. OMG Microsoft SUN Internet Internet Internet
Distributed Systems

Basics of Distributed Systems:


Networked computers (close or loosely coupled)

that provide a degree of operation transparency Distributed Computer System = independent processors + networking infrastructure Communication between processes (on the same or different computer) using message passing technologies is the basis of distributed computing

Distributed Systems

15

Goals of Distributed Systems:


Resource sharing: easy for users to access

remote resources. Transparency: to hide the fact that processes and resources are physically distributed across multiple computers. Openness: to offer services according to standard rules. Scalability: easy to expand and manage.
Distributed Systems 16

ISO RM-ODP: forms of transparency :


Transparency Access Location Migration Relocation Replication Concurrency Failure Persistence Description Hide differences in data representation and how a resource is accessed Hide where a resource is located Hide that a resource may move to another location Hide that a resource may be moved to another location while in use Hide that a resource may be shared by several competitive users Hide that a resource may be shared by several competitive users Hide the failure and recovery of a resource

Hide whether a (software) resource is in memory or on disk


Distributed Systems 17

Scalability Problems: Centralized paradigm

Concept

Example

Centralized services Centralized data Centralized algorithms

A single server for all users A single on-line telephone book Doing routing based on complete information

Distributed Systems

18

Scalability Problems: Decentralized paradigm

No machine has complete information about the systems state. Machines make decisions based only on local information. Failure of one machine does not ruin the algorithm. There is no implicit assumption that a global clock exists.
Distributed Systems 19

User Requirements :
What services the system can provide?

How easy to use and manage the system?


What benefits the system can offer? What is the ratio of performance/cost? How reliable the system is? How secure the system can guarantee?

Distributed Systems

20

Distributed Computer System Metrics


Latency network delay before any data is sent Bandwidth maximum channel capacity (analogue communication Hz, digital communication bps) Granularity relative size of units of processing required. Distributed systems operate best with coarse grain granularity because of the slow communication compared to processing speed in general Processor speed MIPS, FLOPS Reliability ability to continue operating correctly for a given time Fault tolerance resilience to partial system failure Security policy to deal with threats to the communication or processing of data in a system Administrative/management domains issues concerning the ownership and access to distributed systems components

Distributed Systems

21