Académique Documents
Professionnel Documents
Culture Documents
Teradata corporation
Teradata corporation is a vendor specializing in data warehousing and analytic applications . Its products are commonly used by companies to manage data warehouses for analytical and business intelligence purpose . Teradata was formerly a division of NCR Corporation
Features
Teradata is Relational Database Management System (RDBMS) Used for Data warehousing Executes on Unix ,Windows NT or Windows 2000 operating systems
Win NT Win 98
Teradata DATABASE
IBM Mainframe
UNIX
Models the Business - 3NF, robust view processing, & provides star schema capabilities Low TCO (Total Cost of Ownership) - ease of setup, maintenance, & administration; no re-orgs, lowest disk to data ratio, and robust expansion utility (recon fig). High Availability - no single point of failure. Parallel Load utilities - robust, parallel, and scalable load utilities such as Fast Load, MultiLoad, and TPump.
Gateway S/W
Parser Engine Parser Engine
Message Passing Layer AMP Vdisk AMP Vdisk AMP Vdisk AMP Vdisk
d) MOSI Micro Operating s/m interface -> It provides OS independent interface For connecting to Teradata RDBMS network attached system needs Ethernet adapter
The Parsing Engine is responsible for: Managing individual sessions (up to 120) Dispatching the optimized plan to the AMPs Input conversion (EBCDIC / ASCII) - if necessary Sending the answer set response back to the requesting client
Optimizer
Dispatcher
AMP
AMP
AMP
AMP
Checks for syntax errors Checks the access permission for requested db object Check the existence of requested object and also returns error message if appropriate access is not there.
b)Optimizer It prepares access plan and Explain plan which shows how my query is solved efficiently means it restructure the query in such a way that it will run more efficiently c)Generator Takes the explain plan created by optimizer and converts it into db understandable language called AMP steps
2) MPL Message passing layer -> Handles the internal communication of Teradata dbms -> AMP steps are distributed to corresponding AMP based on the Hashing algorithm over the Message Passing Layer (BYNET)
3) AMP Access module processor -> Virtual processors running under a multitasking environment -> BYNET interface -> Manage database -> Interface to disk subsystem
Parsing Engine
AMP
AMP
AMP
AMP
The AMPs are responsible for: - Finding the rows requested - Lock management - Sorting rows - Aggregating columns - Join processing - Output conversion and formatting - Creating answer set for client - Disk space management - Accounting - Special utility protocols - Recovery processing
Teradata
Parsing Engine(s)
AMP 1
AMP 2
AMP 3
AMP 4
2 18
54
41
12 90 75
80
25
32
67 6
The Parsing Engine dispatches request to insert a row. The Message Passing Layer insures that a row gets to the appropriate AMP (Access Module Processor). The AMP stores the row on its associated (logical) disk. An AMP manages a logical disk which is mapped to multiple physical disks in a disk array.
AMP 1
AMP 2
AMP 3
AMP 4
2 18
54 41
12
80
25 32
90
75
67
6
The Parsing Engine dispatches a request to retrieve one or more rows. The Message Passing Layer insures that the appropriate AMP(s) are activated. The AMP(s) locate and retrieve desired row(s) in parallel access. Message Passing Layer returns to retrieved rows to PE. The PE returns row's to requesting client application.
Teradata Parallelism
Each PE can handle up to 120 sessions in parallel. Each Session can handle multiple REQUESTS. The Message Passing Layer can handle all message activity in parallel. Each AMP can perform up to 80 tasks in parallel. All AMPs can work together in parallel to service any request. Each AMP can work on several requests in parallel.
AMP 1
AMP 2
AMP 3
Ideally, each AMP will hold roughly the same amount of data.
EMPLOYEE ROWS EMPLOYEE ROWS EMPLOYEE ROWS EMPLOYEE ROWS DEPARTMENT ROWS DEPARTMENT ROWS DEPARTMENT ROWS DEPARTMENT ROWS JOB ROWS JOB ROWS JOB ROWS JOB ROWS
SMP
SMP
SMP
SMP
DAC
DAC
DAC
DAC
DAC
DAC
DAC
DAC
Multiple nodes may be configured to provide a Massively Parallel Processing (MPP) system. A physical message passing layer called the BYNET is needed to interconnect multiple nodes. Teradata is a linearly expandable RDBMS - as your database grows, additional nodes may be added.
BYNET Features: Enables multiple SMP nodes to communicate. Automatic load balancing of message traffic. Automatic reconfiguration after fault detection. Fully operational dual BYNETs provide fault tolerance. Scalable bandwidth as nodes are added. Even though there are two physical BYNETs to provide redundancy and bandwidth, Teradata and TCP/IP software only see a single network.
BYNET 0
BYNET 1
SMP
SMP
SMP
SMP
SMP
SMP
SMP
SMP
Teradata file system Is a layer between Teradata RDBMS and PDE Also provides set of service calls that allows Teradata RDBMS to store and retrieve data efficiently
Disk Arrays
A disk array contains drive groups Drive groups contain set of drivers LUN (Logical units) contains a portion of every drive pdisk is a slice of LUN Group of pdisks assigned to an AMP is called vdisk
According to the primary index selected, the rows get distributed randomly among all the AMP's.
Data Distribution
More the unique the primary index, the more even the distribution will be.
THANKS