1010-Chapter5

download 1010-Chapter5

of 61

Transcript of 1010-Chapter5

  • 7/31/2019 1010-Chapter5

    1/61

    ITEC 1010 Information and Organizations

    Chapter 5

    Organizing Data and

    Information

  • 7/31/2019 1010-Chapter5

    2/61

    ITEC 1010 Information and Organizations

    Data

    Data A necessity for almost any enterprise to carry out its

    business. Consists of raw facts, and when organizedmay be transformed into information

    Database A collection of data organized to meet users needs

    Database management system (DBMS) A group of programs that manipulate the database andprovide an interface between the database and the userof the database or other application programs

  • 7/31/2019 1010-Chapter5

    3/61

    ITEC 1010 Information and Organizations

    DBMS Discussion (1)

    A collection of programs that enables you to store, modify, and

    extract information from a database. There are many different types

    of DBMSs, ranging from small systems that run on personal

    computers to huge systems that run on mainframes. The following

    are examples of database applications: computerized library systems

    automated teller machines

    flight reservation systems computerized parts inventory systems

    From a technical standpoint, DBMSs can differ widely. The terms

  • 7/31/2019 1010-Chapter5

    4/61

    ITEC 1010 Information and Organizations

    DBMS Discussion (2)

    relational, network, flat, and hierarchical all refer to the way a

    DBMS organizes information internally. The internal organization can

    affect how quickly and flexibly you can extract information.

    Requests for information from a database are made in the form of a

    query, which is a stylized question. For example, the query

    SELECT ALL WHERE NAME = "SMITH" AND AGE > 35

    requests all records in which the NAME field is SMITH and the AGE

  • 7/31/2019 1010-Chapter5

    5/61

    ITEC 1010 Information and Organizations

    DBMS Discussion (3)

    field is greater than 35. The set of rules for constructing queries is

    known as a query language. Different DBMSs support different query

    languages, although there is a semi-standardized query language

    called SQL (structured query language). Sophisticated languages for

    managing database systems are called fourth-generation languages,

    or 4GLs for short.

    The information from a database can be presented in a variety of

    formats. Most DBMSs include a report writer program that enables

    you to output data in the form of a report. Many DBMSs also include

  • 7/31/2019 1010-Chapter5

    6/61

    ITEC 1010 Information and Organizations

    DBMS Discussion (4)

    a graphics component that enables you to output information in the

    form of graphs and charts.

  • 7/31/2019 1010-Chapter5

    7/61

    ITEC 1010 Information and Organizations

    Hierarchy of Data

    Schematic

  • 7/31/2019 1010-Chapter5

    8/61

    ITEC 1010 Information and Organizations

    Hierarchy of data Example

    DatabaseDatabase

    FilesFiles

    RecordsRecords

    FieldsFields

    Characters

    (bytes)

    Characters

    (bytes)

    Personel file

    Department file

    Payroll file

    (Project database)

    005-10-6321 Johns Francine 10-7-65

    549-77-1001 Buckley Bill 2-17-79

    098-40-1370 Fiske Steven 1-5-85

    (Personnel file)

    098-40-1370 Fiske Steven 1-5-85 598 (Record containing

    SSN, last name,

    first name, date of

    hire)Fiske (Last name field)

    1000100 (Letter F in ASCII)

  • 7/31/2019 1010-Chapter5

    9/61

    ITEC 1010 Information and Organizations

    Terminology

    Database A collection of integrated and related files

    File A collection of related records

    Record A collection of related fields

    Field A group of characters

    Character Basic building block of information, represented by a byte

  • 7/31/2019 1010-Chapter5

    10/61

    ITEC 1010 Information and Organizations

    Data Entities, Attributes, and

    Keys Entity

    A generalized class of people, places, or things (objects) forwhich data are collected, stored, and maintained

    E.g., Customer, Employee Attribute

    A characteristic of an entity; something the entity is identified by E.g., Customer name, Employee name

    Keys A field or set of fields in a record that is used to identify the

    record

    E.g, A field or set of fields that uniquely identifies the record

  • 7/31/2019 1010-Chapter5

    11/61

    ITEC 1010 Information and Organizations

    Keys and Attributes

    Key field Attributes (fields)Entities

    (records)

    Employee # Last name First name Hire date Dept. #

    005-10-6321 Johns Francine 10-7-65 257

    549-77-1001 Buckley Bill 2-17-79 650

    098-40-1370 Fiske Steven 1-5-85 598

  • 7/31/2019 1010-Chapter5

    12/61

    ITEC 1010 Information and Organizations

    The Traditional Approach

    The traditional approach

    Separate files are created and stored for each

    application program

    Schematic

  • 7/31/2019 1010-Chapter5

    13/61

    ITEC 1010 Information and Organizations

    PayrollPayroll

    programs

    Reports

    Data FilesApplication

    programs Users

    InvoicingInvoicing

    programsReports

    Inventory

    control

    Inventory

    control

    programs

    Reports

    Management

    inquiries

    Management

    inquiries

    programs

    Reports

  • 7/31/2019 1010-Chapter5

    14/61

    ITEC 1010 Information and Organizations

    Drawbacks

    Data redundancy

    Duplication of data in separate files

    Lack of data integrity The degree to which the data in any one file is accurate

    Program-data dependence

    A situation in which program and data organized for

    one application are incompatible with programs and

    data organized differently for another application

  • 7/31/2019 1010-Chapter5

    15/61

    ITEC 1010 Information and Organizations

    Database Approach

    The database approach

    A pool of related data is shared by multiple

    application programs Rather than having separate data files, each

    application uses a collection of data that is

    either joined or related in the database

    Schematic

  • 7/31/2019 1010-Chapter5

    16/61

    ITEC 1010 Information and Organizations

    Payroll data

    Inventory

    data

    InvoicingData

    Other

    data

    Database

    management

    system

    Payroll

    program

    Inventory

    program

    Invoicing

    program

    Otherprograms

    Reports

    Reports

    Reports

    Reports

    Database Interface Applications programs Users

  • 7/31/2019 1010-Chapter5

    17/61

    ITEC 1010 Information and Organizations

    Advantages

    Improved strategic use of corporate data

    Reduced data redundancy

    Improved data integrity

    Easier modification and updating

    Data and program independence

    Better access to data and information

    Standardization of data access

    A framework for program development

    Better overall protection of the data

    Shared data and information resources

  • 7/31/2019 1010-Chapter5

    18/61

  • 7/31/2019 1010-Chapter5

    19/61

    ITEC 1010 Information and Organizations

    Data Modeling and

    Database Models (1) Planned data redundancy

    A way of organizing data in which the logical

    database design is altered so that certain dataentities are combined

    Summary totals are carried in the data recordsrather than calculated from elemental data

    Some data attributes are repeated in more thanone data entity to improve database

    performance

  • 7/31/2019 1010-Chapter5

    20/61

    ITEC 1010 Information and Organizations

    Data Modeling and

    Database Models (2) Data model

    A map or diagram of entities and their relationships

    Enterprise data modeling Data modeling done at the level of the entire

    organization

    Entity-relationship (ER) diagrams

    A data model that uses basic graphical symbols to show

    the organization of and relationships between data

  • 7/31/2019 1010-Chapter5

    21/61

    ITEC 1010 Information and Organizations

    Example:

    Entity Relationship (ER)Diagram for a Customer

    Ordering Database

    Schematic

  • 7/31/2019 1010-Chapter5

    22/61

    ITEC 1010 Information and Organizations

    OrderCustomer Product

    1:N one-to-many

    relationship

    Last name Colour

    Identification

    number

    NameFirstname

    Identification

    number

    Attributes

    Entities

    1 N

  • 7/31/2019 1010-Chapter5

    23/61

    ITEC 1010 Information and Organizations

    Hierarchical Database Model

    Hierarchical database model

    A data model in which data are organized in a

    top-down, or inverted tree structure

    Schematic

  • 7/31/2019 1010-Chapter5

    24/61

    ITEC 1010 Information and Organizations

    Department C

    Employee

    1

    Employee

    2

    Employee

    3

    Employee

    4

    Employee

    5

    Employee

    6

    Department B

    Project 1

    Department A

  • 7/31/2019 1010-Chapter5

    25/61

    ITEC 1010 Information and Organizations

    Network Data Model

    Network data model An expansion of the hierarchical database model

    with an owner-member relationship in which amember may have many owners

    Project 1 Project 2

    Department A Department B Department C

  • 7/31/2019 1010-Chapter5

    26/61

  • 7/31/2019 1010-Chapter5

    27/61

  • 7/31/2019 1010-Chapter5

    28/61

    ITEC 1010 Information and Organizations

    Relational Database Terminology

    Selecting Data manipulation that eliminates rows according to

    certain criteria

    Projecting Data manipulation that eliminates columns in a table

    Joining

    Data manipulation that combines two or more tables Linked

    Relating tables in a relational database together

  • 7/31/2019 1010-Chapter5

    29/61

    ITEC 1010 Information and Organizations

    Linking Data Tables to Answer

    an Inquiry

    Schematic

  • 7/31/2019 1010-Chapter5

    30/61

    ITEC 1010 Information and Organizations

    Project Number Description Dept. Number

    155 Payroll 257

    498 Widgets 632

    226 Sales manager 598

    Dept. Number Dept. Name Manager SSN

    257 Accounting 421-55-99993

    632 Manufacturing 765-00-3192

    598 Marketing 098-40-1370

    SSN Last Name First Name Hire Date Dept. Number

    005-10-6321 Johns Francine 10-7-65 257

    549-77-1001 Buckley Bill 2-17-79 650

    098-40-1370 Fiske Steven 1-5-85 598

  • 7/31/2019 1010-Chapter5

    31/61

    ITEC 1010 Information and Organizations

    Building and Modifying a

    Relational DatabaseUsing MicrosoftAccess

    Screen snap

  • 7/31/2019 1010-Chapter5

    32/61

    ITEC 1010 Information and Organizations

  • 7/31/2019 1010-Chapter5

    33/61

    ITEC 1010 Information and Organizations

    Schemas and Subschemas

    Schema

    A description of the entire database

    Subschema

    A file that contains a description of a subset of

    the database and identifies which users can

    perform modifications on the data items in thatsubset

    Schematic

  • 7/31/2019 1010-Chapter5

    34/61

    ITEC 1010 Information and Organizations

    DBMS

    Schema

    Subschema

    B

    Subschema

    A

    Subschema

    C

    User

    1

    User

    2

    User

    3

    User

    4

    User

    5

  • 7/31/2019 1010-Chapter5

    35/61

    ITEC 1010 Information and Organizations

    Schema Discussion

    Pronouncedskee-ma, the structure of a database system, described

    in a formal language supported by the database management

    system (DBMS). In a relational database, the schema defines thetables, the fields in each table, and the relationships between fields

    and tables.

    Schemas are generally stored in a data dictionary. Although aschema is defined in text database language, the term is often used

    to refer to a graphical depiction of the database structure.

  • 7/31/2019 1010-Chapter5

    36/61

    ITEC 1010 Information and Organizations

    Data Definition Language

    Data Definition Language (DDL)

    A collection of instructions and commands used

    to define and describe data and datarelationships in a specific database

    Schematic

  • 7/31/2019 1010-Chapter5

    37/61

    ITEC 1010 Information and Organizations

    SCHEMA DESCRIPTION

    SCHEMA NAME IS XXXX

    AUTHOR XXXX

    DATE XXXX

    FILE DESCRIPTION

    FILE NAME IS XXXXASSIGN XXXX

    FILE NAME IS XXXX

    ASSIGN XXXX

    AREA DESCRIPTION

    AREA NAME IS XXXX

    RECORD DESCRIPTION

    RECORD NAME ISXXXXRECORD ID IS XXXX

    LOCATION MODE ISXXXX

    WITHIN XXX AREA FROM XXXX THRU XXXX

    SET DESCRIPTION

    SET NAME IS XXXX

    ORDER IS XXXX

    MODE IS XXXXMEMBER IS XXXX

    .

    .

    .

  • 7/31/2019 1010-Chapter5

    38/61

    ITEC 1010 Information and Organizations

    Data Dictionary

    Data Dictionary

    A detailed description of all data used in the

    database

    Schematic

  • 7/31/2019 1010-Chapter5

    39/61

    ITEC 1010Information and Organizations

    NORTHWESTERN MANUFACTURING

    PREPARED BY: D. BORDWELL

    DATE: 04 AUGUST

    APPROVED BY: J. EDWARDS DATE: 13 OCTOBER

    VERSION: 3.1

    PAGE: 1 OF 1

    DATA ELEMENT NAME: PARTNO

    DESCRIPTION: INVENTORY PART NUMER

    OTHER NAMES: PTNO

    VALUE RANGE: 100 TO 5000

    DATA TYPE: NUMERIC

    POSITIONS: 4 POSITIONS OR COLUMNS

  • 7/31/2019 1010-Chapter5

    40/61

    ITEC 1010Information and Organizations

    Data Dictionary Features

    Provide a standard definition of terms and dataelements

    Assist programmers in designing and writingprograms

    Simplify database modification

    Reduce data redundancy

    Increase data reliability Faster program development

    Easier modification of data and information

  • 7/31/2019 1010-Chapter5

    41/61

    ITEC 1010Information and Organizations

    Logical and Physical

    Access Paths

    Logical access path (LAP)

    Application requires information from the

    DBMS Physical access path (PAP)

    DBMS accesses a storage device to retrieve

    data

    Schematic

  • 7/31/2019 1010-Chapter5

    42/61

  • 7/31/2019 1010-Chapter5

    43/61

    ITEC 1010Information and Organizations

    Manipulating Data

    Concurrency control

    A method of dealing with a situation in which two or

    more people need to access the same record in a

    database at the same time

    Data manipulation language (DML)

    The commands that are used to manipulate the data in a

    database Structured query language (SQL)

    A standardized data manipulation language

  • 7/31/2019 1010-Chapter5

    44/61

    ITEC 1010Information and Organizations

    Structured Query Language (SQL)

    Invented at IBMs Almaden Research

    Centre (San Jose, CA) in the 1970s

    E.g.,

    Select all (*) columns from the EMPLOYEE

    table in which the JOB_CLASSIFICATION field

    is equal to C2

    SELECT * FROM EMPLOYEE WHERE

    JOB_CLASSIFICATION = C2

  • 7/31/2019 1010-Chapter5

    45/61

    ITEC 1010Information and Organizations

    SQL Discussion (1)

    Abbreviation of structured query language, and pronounced either

    see-kwellor as separate letters. SQL is a standardized query

    language for requesting information from a database. The original

    version called SEQUEL (structured English query language) was

    designed by an IBM research center in 1974 and 1975. SQL was first

    introduced as a commercial database system in 1979 by Oracle

    Corporation.

    Historically, SQL has been the favorite query language for database

    management systems running on minicomputers and mainframes.

  • 7/31/2019 1010-Chapter5

    46/61

    ITEC 1010Information and Organizations

    SQL Discussion (2)

    Increasingly, however, SQL is being supported by PC database

    systems because it supports distributed databases (databases that

    are spread out over several computer systems). This enables

    several users on a local-area network to access the same database

    simultaneously.

    Although there are different dialects of SQL, it is nevertheless the

    closest thing to a standard query language that currently exists. In

    1986, ANSI approved a rudimentary version of SQL as the official

    standard, but most versions of SQL since then have included many

  • 7/31/2019 1010-Chapter5

    47/61

    ITEC 1010Information and Organizations

    SQL Discussion (3)

    extensions to the ANSI standard. In 1991, ANSI updated the

    standard. The new standard is known as SAG SQL.

  • 7/31/2019 1010-Chapter5

    48/61

    ITEC 1010Information and Organizations

    Database Output

    Screen snap

  • 7/31/2019 1010-Chapter5

    49/61

    ITEC 1010Information and Organizations

  • 7/31/2019 1010-Chapter5

    50/61

    ITEC 1010Information and Organizations

    Popular Database Management

    Systems for End Users MicrosoftAccess 98

    LotusApproach 98

    Inprise (formerly Borland) dBASE

    DBMS Selection Criteria Database size Number of concurrent users Performance Integration Features The vendor Cost

  • 7/31/2019 1010-Chapter5

    51/61

    ITEC 1010Information and Organizations

    Distributed Databases

    Distributed database

    A database in which the actual data may be

    spread across several smaller databasesconnected via telecommunications devices

    Pretty picture

  • 7/31/2019 1010-Chapter5

    52/61

    ITEC 1010Information and Organizations

  • 7/31/2019 1010-Chapter5

    53/61

    ITEC 1010Information and Organizations

    Data Warehouse

    Data warehouse

    A relational database management system designed

    specifically to support management decision making

    Current evolution of Decision Support Systems (DSSs)

    Data mart

    A subset of a data warehouse for small and medium-

    size businesses or departments within larger companies

    Schematic

    R l ti l

  • 7/31/2019 1010-Chapter5

    54/61

    ITEC 1010Information and Organizations

    Relational

    databases

    Hierarchical

    databases

    Network

    databases

    Flat files

    Spreadsheets

    Data

    extraction

    process

    Query and

    analysis

    tools

    Data

    wharehouse

    Data

    cleanup

    process

    End user access

  • 7/31/2019 1010-Chapter5

    55/61

    ITEC 1010 Information and Organizations

    Designing a Customer Data Warehouse

    Sharply define your goals and objectives before

    you build the warehouse

    Choose the software that best fits your goals Determine who/what should be in the database

    Develop a plan

    Measure results

  • 7/31/2019 1010-Chapter5

    56/61

    ITEC 1010 Information and Organizations

    Data Mining Applications

    Data mining

    The automated discovery of patterns and relationships

    in a data warehouse

    Data mining applications Market segmentation

    Customer queries

    Fraud detection

    Direct marketing

    Market basket analysis

    Trend analysis

  • 7/31/2019 1010-Chapter5

    57/61

    ITEC 1010 Information and Organizations

    On-Line Analytical

    Processing (OLAP)

    On-line analytic processing (OLAP) Access to multidimensional databases providing

    managerially useful display techniques

    Now used to store and deliver data warehouseinformation

    Data warehouse and OLAP Provides top-down, query-driven analysis

    Data mining Provides bottom-up, discovery-driven analysis

  • 7/31/2019 1010-Chapter5

    58/61

    ITEC 1010 Information and Organizations

    Open Database

    Connectivity (ODBC)

    Open database connectivity (ODBC) A set of standards that ensures software written to

    comply with these standards can be used with anyODBC-compliant database

    Schematic

    dBASE

  • 7/31/2019 1010-Chapter5

    59/61

    ITEC 1010 Information and Organizations

    ACCESS

    database

    Lotus 1-2-3

    spreadsheet

    Paradox

    database

    ODBC Import

    ODBC Link

    ODBC Export

  • 7/31/2019 1010-Chapter5

    60/61

    ITEC 1010 Information and Organizations

    Object-Relational Database Management

    Systems (ORDBMS) Object-relational database management system (ORDBMS) A DBMS capable of manipulating audio, video, and graphical data.

    Hypertext

    Users can search and manipulate alphanumeric data in anunstructured way

    Hypermedia

    Allows businesses to search and manipulate multimedia forms ofdata

    Spatial data technology Use of an object-relational database to store and access data

    according to the location it describes and to permit spatial queriesand analysis

  • 7/31/2019 1010-Chapter5

    61/61

    ITEC 1010 I f i d O i i

    End of Chapter 5

    Chapter 6

    http://var/www/apps/conversion/current/tmp/scratch7088/1010-Chapter6.ppthttp://var/www/apps/conversion/current/tmp/scratch7088/1010-Chapter6.ppt