3 Databases Ver2

download 3 Databases Ver2

of 30

Transcript of 3 Databases Ver2

  • 7/31/2019 3 Databases Ver2

    1/30

    Udayan Shinde 2011

    Information and

    Databases

  • 7/31/2019 3 Databases Ver2

    2/30

    Data

    Udayan Shinde 2011

    Is a basic unit of information in a raw and

    unorganized format

    Bits

    On /Off

    0 or 1

    Text, Images, Audio ,Video

    Character A z 9

    ASCII 65 122

    Binary 0100 0001 01111010 00111001

  • 7/31/2019 3 Databases Ver2

    3/30

    Data

    Udayan Shinde 2011

    Image

  • 7/31/2019 3 Databases Ver2

    4/30

    Table

    Udayan Shinde 2011

    NO Firstname Lastname Birthdate Address MobileDepar

    tment

    Picture Position Salary

    1Amit Joshi 1/1/1980121,

    Akruti,SB

    Road,Pune

    -001

    9898967676 Marke

    ting

    Executive 15000

    2Amit Raj 2/1/1975878/1,

    Galaxy

    Complex,

    Flat No.

    301, MG

    Rd, Pune-

    004

    8808811111 Accou

    nts

    Manager 22000

    3Madhuri Agarwal 13/11/198

    2

    77/1, Air

    Palace,

    Wakad,

    Pune-033

    9823046789 HR Director 30000

  • 7/31/2019 3 Databases Ver2

    5/30

    Table

    Udayan Shinde 2011

    A table is a set of data elements (values) that isorganized using a model of vertical columns(which are identified by their name) andhorizontal rows.

    Rows

    Columns

    Fields

    Refered to the data in row/column Records

    Contains several data fields

  • 7/31/2019 3 Databases Ver2

    6/30

    Database

    Udayan Shinde 2011

    Several data fields form a record

    Several Records form a data file/table

    Several Data Files form a database

    Collection of data in an organized format

    Structured collection of electronically stored

    data , which is controlled and accessed through

    computers based on predefined relationships

    between predefined types of data items, related

    to a specific business, situation or a problem

  • 7/31/2019 3 Databases Ver2

    7/30

    Database Management

    Systems

    Udayan Shinde 2011

    A set of programs to define, update and

    control, maintain the Database

    Store data in an efficient and compact manner

    Data Access to external applications

    Typically supports query language

  • 7/31/2019 3 Databases Ver2

    8/30

    Data Tables

    Udayan Shinde 2011

  • 7/31/2019 3 Databases Ver2

    9/30

    File system/DB

    Concerns

    Udayan Shinde 2011

    Data size increases to huge extent

    Lacks organization

    Redundancy

    Repeating groups

    Subject to modification anomalies

    Difficult to Manage

    Not Efficient

    Security

  • 7/31/2019 3 Databases Ver2

    10/30

    Database Structure

    Udayan Shinde 2011

    Hierarchical Structure

    Network Structure

    Relational Structure Multidimensional Structure

  • 7/31/2019 3 Databases Ver2

    11/30

    Types of Database

    Udayan Shinde 2011

    OLTP Vs OLAP

    Relational DBs

    Multidimensional DBs

    Data Warehouses

    Geographic Information Systems

    Text and Image DBs

    Hypermedia DB

  • 7/31/2019 3 Databases Ver2

    12/30

    Relational DatabaseManagement Systems

    Udayan Shinde 2011

  • 7/31/2019 3 Databases Ver2

    13/30

    RDBMS

    Udayan Shinde 2011

    Normalization

    Organize and store data in a relational model

    for efficient Data management

    Entity Relations

  • 7/31/2019 3 Databases Ver2

    14/30

    Normalization

    Udayan Shinde 2011

    Normalization

    Efficiently organize data

    Eliminate redundancy

    Storing only related data

    Expandable and easier to extend

    Make data model more informative

  • 7/31/2019 3 Databases Ver2

    15/30

    Normalization

    Udayan Shinde 2011

    First Normal Form (1NF)

    Eliminate duplicative columns from the same

    table.

    Create separate tables for each group of related

    data and identify each row with a unique column

    or set of columns (the primary key)

  • 7/31/2019 3 Databases Ver2

    16/30

    Normalization

    Udayan Shinde 2011

    Second Normal Form (2NF)

    Meet all the requirements of the first normal

    form.

    Remove subsets of data that apply to multiple

    rows of a table and place them in separate tables.

    Create relationships between these new tables

    and their predecessors through the use of foreignkeys.

  • 7/31/2019 3 Databases Ver2

    17/30

    Normalization

    Udayan Shinde 2011

    Third normal form (3NF)

    Meet all the requirements of the second normal

    form.

    Remove columns that are not dependent upon

    the primary key.

    Other Normal Forms

    Boyce-Codd Normal Form [BCNF]

    Fourth Normal Form 4NF

  • 7/31/2019 3 Databases Ver2

    18/30

  • 7/31/2019 3 Databases Ver2

    19/30

    Entity Relationship Diagrams

    Udayan Shinde 2011

    To show how information is, or should be

    stored and used within a business system

    Success of any IS depends on efficient flow

    and processing of the information

    Identifying Entities and their relationship

  • 7/31/2019 3 Databases Ver2

    20/30

    Entity Relationship

    Diagrams

    Udayan Shinde 2011

    Entities are the "things" about which we seek

    information.

    Attributes are the data we collect about the

    entities.

    Relationships provide the structure needed to

    draw information from multiple entities.

    ~ERwin, MS Visio, Oracle Designer, Toad Data Modeler

  • 7/31/2019 3 Databases Ver2

    21/30

    Relationship types

    Udayan Shinde 2011

    One-to-Many

    One-to-One

    Many-to-Many

  • 7/31/2019 3 Databases Ver2

    22/30

    Entity Relationship

    Diagrams

    Udayan Shinde 2011

  • 7/31/2019 3 Databases Ver2

    23/30

    Types of Database

    Udayan Shinde 2011

    Relational DBs

    Multidimensional DBs

    Data Warehouses

    Geographic Information Systems

    Text and Image DBs Hypermedia DB

  • 7/31/2019 3 Databases Ver2

    24/30

    Multidimensional

    Database

    Udayan Shinde 2011

    Variation of relational structure

    Uses multidimensional structure to organize

    data and express relationships between data

    Cubes of data

    Each side is considered a dimension Each cell contains aggregated data related to

    elements among each dimension

  • 7/31/2019 3 Databases Ver2

    25/30

    Types of Database

    Udayan Shinde 2011

    OLTP Vs OLAP

    Relational DBs

    Multidimensional DBs

    Data Warehouses

    Geographic Information Systems

    Text and Image DBs

    Hypermedia DB

  • 7/31/2019 3 Databases Ver2

    26/30

    Data Warehouse

    Udayan Shinde 2011

  • 7/31/2019 3 Databases Ver2

    27/30

    Data Warehouse

    Udayan Shinde 2011

    Storing company wide data together

    Repository for historical data

    Organized data vital for creating reports andmonitoring an organizations status

    Backbone of Business Intelligence

  • 7/31/2019 3 Databases Ver2

    28/30

    Data Warehouse

    Udayan Shinde 2011

    Combination of DB and Software to support

    business analysis and decision making

    Design steps

    Extraction, Consolidation, Filtering, Cleansing,

    transformation, aggregation and updating

    Data Mart: Subset of larger data warehouse

    devoted to a particular business function or

    department

  • 7/31/2019 3 Databases Ver2

    29/30

    Data Mining

    Udayan Shinde 2011

    Analysis and Summarizing

    Discover patterns-Classes, Clusters, Associations,Sequential Patterns

    To help determine relationships among internal factors

    Data Mining is the discovery of useful patterns in data.And is used for prediction analysis and classification -e.g. what is the likelihood that a customer will migrateto a competitor.

    ~IBM Intelligent Miner, Cognos Scenario, SASEnterprise Miner

  • 7/31/2019 3 Databases Ver2

    30/30

    Data as a resource

    Udayan Shinde 2011

    Extremely Vital Resource

    Having quality data about internal and externalenvironment is a key success factor

    Information Quality

    Accuracy,Precision and Completeness, Source

    Information Accessibility

    Availability Information Presentation

    Easily available summarized data in a presentable format