B101 Overview

download B101 Overview

of 13

Transcript of B101 Overview

  • 8/22/2019 B101 Overview

    1/13

    Module 1: Teradata Product Overview

    After completing this module, you will be able to:

    Describe the purpose of the Teradata product

    Give a brief history of the product

    List major architectural features of the product

  • 8/22/2019 B101 Overview

    2/13

    What is Teradata?

    Teradata is a Relational Database Management System (RDBMS).

    Designed to run the worlds largest commercial databases.

    Preferred solution for enterprise data warehousing

    Executes on UNIX MP-RAS and Windows 2000 operating systems

    Compliant with ANSI industry standards

    Runs on a single or multiple nodes Acts as a database server to client applications throughout the enterprise

    Uses parallelism to manage terabytes of data

    Capable of supporting many concurrent users from various client platforms (over aTCP/IP or IBM channel connection).

    Win XPWin 2000

    UNIXClient

    MainframeClient

    TeradataDATABASE

  • 8/22/2019 B101 Overview

    3/13

    Teradata A Brief History

    1979 Teradata Corp founded in Los Angeles, California

    Development begins on a massively parallel computer

    1982 YNET technology is patented

    1984 Teradata markets the first database computer DBC/1012 First system purchased by Wells Fargo Bank of Cal. Total revenue for year - $3 million

    1987 First public offering of stock

    1989 Teradata and NCR partner on next generation of DBC

    1991 NCR Corporation is acquired by AT&T Teradata revenues at $280 million

    1992 Teradata is merged into NCR

    1996 AT&T spins off NCR Corp. with Teradata product

    1997 Teradata database becomes industry leader in data warehousing

    2000 100+ Terabyte system in production

    2002 Teradata V2R5 released 12/2002; major release including features such as PPI,

    roles and profiles, multi-value compression, and more.2003 Teradata V2R5.1 released 12/2003; includes UDFs, BLOBs, CLOBs, and more.

  • 8/22/2019 B101 Overview

    4/13

    How Large is a Trillion?

    1 Kilobyte = 10

    3

    = 1000 bytes1 Megabyte = 106 = 1,000,000 bytes1 Gigabyte = 109 = 1,000,000,000 bytes1 Terabyte = 1012 = 1,000,000,000,000 bytes1 Petabyte = 1015 = 1,000,000,000,000,000 bytes

    1 million seconds = 11.57 days1 billion seconds = 31.6 years1 trillion seconds = 31,688 years

    1 million inches = 15.7 miles1 trillion inches = 15,700,000 miles (30 roundtrips to the moon)

    1 million square inches = .16 acres = .0002 square miles1 trillion square inches = 249 square miles (larger than Singapore)

    $1 million = < $ .01 for every person in U.S.$1 billion = $ 3.64 for every person is U.S.

    $1 trillion = $ 3,636 for every person in U.S.

  • 8/22/2019 B101 Overview

    5/13

    Designed for Todays Business

    Teradatas Charter meets the business needs of today

    and tomorrow with:

    Relational databasestandard for database design

    Enormous capacity billions of rows, terabytes ofdata

    High performance parallel processing

    Single database server for multiple clientsSingleVersion of the Truth

    Network and mainframe connectivity

    Industry standard access language StructuredQuery Language (SQL)

    Manageable growth via modularity

    Fault tolerance at all levels of hardware andsoftware

    Data integrity and reliability

  • 8/22/2019 B101 Overview

    6/13

    Evolution of Data Processing

    Type Example Number of Rows ResponseAccessed Time

    OLTP Update a checking account Small Secondsto reflect a deposit

    DSS How many child size blue Large Seconds or minutesjeans were sold across

    all of the our Eastern storesin the month of March?

    OLCP Instant credit How much Small to moderate; Minutescredit can be extended to possibly across

    this person? multiple databases

    OLAP Show the top ten selling Large number of Seconds or minutesitems across all stores detail rows orfor 2003. moderate number

    of summary rows

    TRADIT

    IONAL

    T

    oday

    The need to process DSS, OLCP, and OLAP type requests across anenterprise and its data leads to the concept of a Data Warehouse.

  • 8/22/2019 B101 Overview

    7/13

    What is a Data Warehouse?

    A Data Warehouse is a central, enterprise-wide database that contains

    information extracted from Operational Data Stores (ODS).

    Based on enterprise-wide model Can begin small but may grow large rapidly Populated by extraction/loading data from operational systems Responds to end-user what if queries Can store detailed as well as summary data

    OperationalData

    Data Warehouse

    Examples ofAccess Tools

    End Users

    ATM PeopleSoft Point of Service(POS)

    Teradata Database

    TeradataWarehouse Miner

    Cognos MicroStrategies

  • 8/22/2019 B101 Overview

    8/13

    Data Warehouse Usage Evolution

    STAGE 1REPORTING

    WHAThappened?

    STAGE 2ANALYZING

    WHYdid it happen?

    STAGE 3PREDICTING

    WHYwill it happen?

    PrimarilyBatch

    Increase inAd Hoc

    Queries

    AnalyticalModeling

    Grows

    Batch Ad Hoc Analytics

    Continuous Update &Time Sensitive Queries

    Become ImportantContinuous Update

    Short Queries

    STAGE4OPERATIONALIZING

    WHATIS Happening?

    STAGE 5ACTIVE WAREHOUSING

    MAKINGit happen!

    Event-Based

    Triggering

    Event BasedTriggering

    Takes Hold

  • 8/22/2019 B101 Overview

    9/13

    What is Active Data Warehousing?

    Data Warehousing is the timely, integrated, logically consistent store of

    detailed data available foranalytic business decision making.

    Primarily batch feeds and updates Ad hoc queries to support strategic decisions that return in minutes and maybe

    hours

    Active Data Warehousing is the timely, integrated, logically consistentstore of detailed data available forstrategic, tactical driven businessdecisions.

    Timely updates close to real time Short, tactical queries that return in seconds Event driven activity plus strategic queries

    Business requirements for an ADW (Active Data Warehouse)?

    Performance response within seconds Scalability support for large data volumes, mixed workloads, and concurrent

    users Availability 7 x 24 x 365

    Data Freshness Accurate, up to the minute, data

  • 8/22/2019 B101 Overview

    10/13

    Teradatas Competitive Advantages

    Unlimited, Proven Scalability amount of data and number of users; allows

    for an enterprise wide model of the data.

    Unlimited Parallelism parallel access, sorts, and aggregations.

    Mature Optimizer handles complex queries, up to 64 joins per query, ad-hocprocessing.

    Models the Business 3NF, robust view processing, & provides star schemacapabilities.

    Provides a single version of the truth.

    Low TCO (Total Cost of Ownership) ease of setup, maintenance, &administration; no re-orgs, lowest disk to data ratio, and robust expansionutility (reconfig).

    High Availability no single point of failure.

    Parallel Load and Unload utilities robust, parallel, and scalable load and

    unload utilities such as FastLoad, MultiLoad, TPump, and FastExport.

  • 8/22/2019 B101 Overview

    11/13

    Teradata Manageability

    Things a Teradata DBAneverhas to do!

    Reorganize data or index space

    Pre-allocate table/index space, format partitions

    Pre-prepare data for loading (convert, sort, split, etc.)

    Ensure that queries run in parallel

    Unload/reload data spaces due to expansion

    Design, implement and support partition schemes.

    Write or run programs to split the input source files into partitions forloading

    A DBA knows that if the data doubles, the system canexpand easily to accommodate it.

    The command and workload for creating a table that willhave 100,000 rows is the same as creating a table that will

    have 1,000,000,000 rows!

  • 8/22/2019 B101 Overview

    12/13

    Review Questions

    1. Name the two primary operating systems that the Teradata RDBMS executes on.

    ______________________________________________

    2. Which of the following represents a trillion bytes or a TB of data? ____

    a. 106b. 109

    c. 1012d. 1015

    3. Which feature allows Teradata to process enormous volumes of data quickly? ____

    a. High availability software and hardware components

    b. Parallelismc. Proven Scalabilityd. High performance servers from Intel

    4. The Teradata RBDMS is primary a ____ .

    a. Server

    b. Client

  • 8/22/2019 B101 Overview

    13/13

    Module 1: Review Question Answers

    1. Name the two primary operating systems that the Teradata RDBMS executes on.

    UNIX MP-RASWindow s 2000

    2. Which of the following represents a trillion bytes or a TB of data? ____

    a. 106b. 109

    c. 1012d. 1015

    3. Which feature allows Teradata to process enormous volumes of data quickly? ____

    a. High availability software and hardware components

    b. Parallelismc. Proven Scalabilityd. High performance servers from Intel

    4. The Teradata RBDMS is primary a ____ .

    a. Server

    b. Client