deleted from the original AIA text. Document B101 – 2017 ...
B101 Overview
Transcript of B101 Overview
-
8/22/2019 B101 Overview
1/13
Module 1: Teradata Product Overview
After completing this module, you will be able to:
Describe the purpose of the Teradata product
Give a brief history of the product
List major architectural features of the product
-
8/22/2019 B101 Overview
2/13
What is Teradata?
Teradata is a Relational Database Management System (RDBMS).
Designed to run the worlds largest commercial databases.
Preferred solution for enterprise data warehousing
Executes on UNIX MP-RAS and Windows 2000 operating systems
Compliant with ANSI industry standards
Runs on a single or multiple nodes Acts as a database server to client applications throughout the enterprise
Uses parallelism to manage terabytes of data
Capable of supporting many concurrent users from various client platforms (over aTCP/IP or IBM channel connection).
Win XPWin 2000
UNIXClient
MainframeClient
TeradataDATABASE
-
8/22/2019 B101 Overview
3/13
Teradata A Brief History
1979 Teradata Corp founded in Los Angeles, California
Development begins on a massively parallel computer
1982 YNET technology is patented
1984 Teradata markets the first database computer DBC/1012 First system purchased by Wells Fargo Bank of Cal. Total revenue for year - $3 million
1987 First public offering of stock
1989 Teradata and NCR partner on next generation of DBC
1991 NCR Corporation is acquired by AT&T Teradata revenues at $280 million
1992 Teradata is merged into NCR
1996 AT&T spins off NCR Corp. with Teradata product
1997 Teradata database becomes industry leader in data warehousing
2000 100+ Terabyte system in production
2002 Teradata V2R5 released 12/2002; major release including features such as PPI,
roles and profiles, multi-value compression, and more.2003 Teradata V2R5.1 released 12/2003; includes UDFs, BLOBs, CLOBs, and more.
-
8/22/2019 B101 Overview
4/13
How Large is a Trillion?
1 Kilobyte = 10
3
= 1000 bytes1 Megabyte = 106 = 1,000,000 bytes1 Gigabyte = 109 = 1,000,000,000 bytes1 Terabyte = 1012 = 1,000,000,000,000 bytes1 Petabyte = 1015 = 1,000,000,000,000,000 bytes
1 million seconds = 11.57 days1 billion seconds = 31.6 years1 trillion seconds = 31,688 years
1 million inches = 15.7 miles1 trillion inches = 15,700,000 miles (30 roundtrips to the moon)
1 million square inches = .16 acres = .0002 square miles1 trillion square inches = 249 square miles (larger than Singapore)
$1 million = < $ .01 for every person in U.S.$1 billion = $ 3.64 for every person is U.S.
$1 trillion = $ 3,636 for every person in U.S.
-
8/22/2019 B101 Overview
5/13
Designed for Todays Business
Teradatas Charter meets the business needs of today
and tomorrow with:
Relational databasestandard for database design
Enormous capacity billions of rows, terabytes ofdata
High performance parallel processing
Single database server for multiple clientsSingleVersion of the Truth
Network and mainframe connectivity
Industry standard access language StructuredQuery Language (SQL)
Manageable growth via modularity
Fault tolerance at all levels of hardware andsoftware
Data integrity and reliability
-
8/22/2019 B101 Overview
6/13
Evolution of Data Processing
Type Example Number of Rows ResponseAccessed Time
OLTP Update a checking account Small Secondsto reflect a deposit
DSS How many child size blue Large Seconds or minutesjeans were sold across
all of the our Eastern storesin the month of March?
OLCP Instant credit How much Small to moderate; Minutescredit can be extended to possibly across
this person? multiple databases
OLAP Show the top ten selling Large number of Seconds or minutesitems across all stores detail rows orfor 2003. moderate number
of summary rows
TRADIT
IONAL
T
oday
The need to process DSS, OLCP, and OLAP type requests across anenterprise and its data leads to the concept of a Data Warehouse.
-
8/22/2019 B101 Overview
7/13
What is a Data Warehouse?
A Data Warehouse is a central, enterprise-wide database that contains
information extracted from Operational Data Stores (ODS).
Based on enterprise-wide model Can begin small but may grow large rapidly Populated by extraction/loading data from operational systems Responds to end-user what if queries Can store detailed as well as summary data
OperationalData
Data Warehouse
Examples ofAccess Tools
End Users
ATM PeopleSoft Point of Service(POS)
Teradata Database
TeradataWarehouse Miner
Cognos MicroStrategies
-
8/22/2019 B101 Overview
8/13
Data Warehouse Usage Evolution
STAGE 1REPORTING
WHAThappened?
STAGE 2ANALYZING
WHYdid it happen?
STAGE 3PREDICTING
WHYwill it happen?
PrimarilyBatch
Increase inAd Hoc
Queries
AnalyticalModeling
Grows
Batch Ad Hoc Analytics
Continuous Update &Time Sensitive Queries
Become ImportantContinuous Update
Short Queries
STAGE4OPERATIONALIZING
WHATIS Happening?
STAGE 5ACTIVE WAREHOUSING
MAKINGit happen!
Event-Based
Triggering
Event BasedTriggering
Takes Hold
-
8/22/2019 B101 Overview
9/13
What is Active Data Warehousing?
Data Warehousing is the timely, integrated, logically consistent store of
detailed data available foranalytic business decision making.
Primarily batch feeds and updates Ad hoc queries to support strategic decisions that return in minutes and maybe
hours
Active Data Warehousing is the timely, integrated, logically consistentstore of detailed data available forstrategic, tactical driven businessdecisions.
Timely updates close to real time Short, tactical queries that return in seconds Event driven activity plus strategic queries
Business requirements for an ADW (Active Data Warehouse)?
Performance response within seconds Scalability support for large data volumes, mixed workloads, and concurrent
users Availability 7 x 24 x 365
Data Freshness Accurate, up to the minute, data
-
8/22/2019 B101 Overview
10/13
Teradatas Competitive Advantages
Unlimited, Proven Scalability amount of data and number of users; allows
for an enterprise wide model of the data.
Unlimited Parallelism parallel access, sorts, and aggregations.
Mature Optimizer handles complex queries, up to 64 joins per query, ad-hocprocessing.
Models the Business 3NF, robust view processing, & provides star schemacapabilities.
Provides a single version of the truth.
Low TCO (Total Cost of Ownership) ease of setup, maintenance, &administration; no re-orgs, lowest disk to data ratio, and robust expansionutility (reconfig).
High Availability no single point of failure.
Parallel Load and Unload utilities robust, parallel, and scalable load and
unload utilities such as FastLoad, MultiLoad, TPump, and FastExport.
-
8/22/2019 B101 Overview
11/13
Teradata Manageability
Things a Teradata DBAneverhas to do!
Reorganize data or index space
Pre-allocate table/index space, format partitions
Pre-prepare data for loading (convert, sort, split, etc.)
Ensure that queries run in parallel
Unload/reload data spaces due to expansion
Design, implement and support partition schemes.
Write or run programs to split the input source files into partitions forloading
A DBA knows that if the data doubles, the system canexpand easily to accommodate it.
The command and workload for creating a table that willhave 100,000 rows is the same as creating a table that will
have 1,000,000,000 rows!
-
8/22/2019 B101 Overview
12/13
Review Questions
1. Name the two primary operating systems that the Teradata RDBMS executes on.
______________________________________________
2. Which of the following represents a trillion bytes or a TB of data? ____
a. 106b. 109
c. 1012d. 1015
3. Which feature allows Teradata to process enormous volumes of data quickly? ____
a. High availability software and hardware components
b. Parallelismc. Proven Scalabilityd. High performance servers from Intel
4. The Teradata RBDMS is primary a ____ .
a. Server
b. Client
-
8/22/2019 B101 Overview
13/13
Module 1: Review Question Answers
1. Name the two primary operating systems that the Teradata RDBMS executes on.
UNIX MP-RASWindow s 2000
2. Which of the following represents a trillion bytes or a TB of data? ____
a. 106b. 109
c. 1012d. 1015
3. Which feature allows Teradata to process enormous volumes of data quickly? ____
a. High availability software and hardware components
b. Parallelismc. Proven Scalabilityd. High performance servers from Intel
4. The Teradata RBDMS is primary a ____ .
a. Server
b. Client