Database fundamentals(database)

35
ISOM3260 Database Design & Administration Dr. Muller Cheung (L1/L2) Dr. James Thong (L3) Office hour: By appointment Email: [email protected]; [email protected] Spring 2014

description

 

Transcript of Database fundamentals(database)

Page 1: Database fundamentals(database)

ISOM3260 Database Design & Administration

Dr. Muller Cheung (L1/L2)

Dr. James Thong (L3)

Office hour: By appointment

Email: [email protected]; [email protected] 2014

Page 2: Database fundamentals(database)

2

Today’s Agenda

• Introduction to course– ISOM3260 website: http://teaching.ust.hk/~isom3260– Midterms and Final exam– Group project

• Database Fundamentals (Chap. 1)

Page 3: Database fundamentals(database)

3

Conduct in Class

• Attend the lab you are enrolled in• Attend the lecture you are enrolled in• Be punctual for lab/lecture• Turn off mobile phones• Do not distract other students or instructor

by talking with your friends

Page 4: Database fundamentals(database)

4

How to study for ISOM3260

• Attend lectures– Questions in exams will only include topics

covered in lectures• Read the textbook for more information• Review lecture notes/textbook after lecture• Email questions to me or make appointment

Page 5: Database fundamentals(database)

5

What you will learn from this course• Database fundamentals

– introduction to database concepts• Database development process

– steps to develop a database• Conceptual data modeling

– entity-relationship (ER) diagram; enhanced ER• Logical database design

– transforming ER diagram into relations; normalization• Physical database design

– technical specifications of the database• Database implementation

– Structured Query Language (SQL), advanced SQL• Advanced topic

– data warehousing; data and database administration

Page 6: Database fundamentals(database)

Lecture 1: Database Fundamentals

ISOM3260, Spring 2014

Page 7: Database fundamentals(database)

7

Database Fundamentals

• Concepts• Disadvantages of file processing systems• The database approach• Advantages of database approach• Costs and risks of database approach• Range of database applications• Components of database environment• Evolution of database systems• Current development

Page 8: Database fundamentals(database)

8

Concepts• Data

– stored representations of meaningful objects and events– structured data: numbers, text, dates– unstructured data: images, video, documents

• Information– data processed to be useful in decision making– by putting data in a context or summarizing data

• Database– an organized collection of logically related data– e.g. automobile repair database contains data on customers,

automobiles, and repair history• Metadata

– data that describes properties of user data

Page 9: Database fundamentals(database)

9

Figure 1-1a: Data in Context

Page 10: Database fundamentals(database)

10

Figure 1-1b: Summarized data

Useful information that managers can use for decision making and interpretation

Page 11: Database fundamentals(database)

11

Table 1-1: Metadata for Class Roster

Descriptions of the properties or characteristics of the data, including data types, field sizes, allowable values, and data context

Page 12: Database fundamentals(database)

12

Disadvantages of File Processing• Program-data dependence

– all programs maintain metadata for each file they use– change to file structure requires changes to all programs that

access the file• Data redundancy (duplication of data)

– data changes in one file could cause inconsistencies, compromising data integrity

• Limited data sharing– no centralized control of data

• Lengthy development times– programmers must design their own file formats

• Excessive program maintenance– consume 80% of information systems budget

Page 13: Database fundamentals(database)

13

Figure 1-2: Old file processing systems at Pine Valley Furniture Company

Duplicate Data

Page 14: Database fundamentals(database)

14

The Database Approach

Database Management System (DBMS) manages data resources like an operating system manages hardware resources

Databasecontainingcentralizedshared data

Page 15: Database fundamentals(database)

15

Advantages of Database Approach

• Program-data independence – metadata not stored in programs, so programs do not need to

worry about changes to data formats– results in increased productivity of application development and

reduced program maintenance

• Minimal data redundancy– avoid wasted storage space– leads to increased data integrity/consistency

Page 16: Database fundamentals(database)

16

Advantages of Database Approach

• Improved data sharing– different users get different views of the data

• Enforcement of standards– naming conventions, data quality standards, and uniform

procedures for accessing, updating, and protecting data• Improved data quality

– constraints are business rules that cannot be violated by database users

– enforced by DBMS• Improved data accessibility and responsiveness

– use of structured query language (SQL)– end users without programming experience can easily retrieve

data

Page 17: Database fundamentals(database)

17

Costs and Risks of the Database Approach

• Requires new, specialized personnel• Installation and management cost and complexity

– requires new software and upgrades to hardware and data communications

– substantial annual maintenance and support costs• Conversion costs

– converting from legacy systems costs money and time• Need for explicit backup and recovery

– shared corporate database must be accurate and available at all times

• Organizational conflict– agreement on data definitions and ownership, responsibilities

for accurate data maintenance– need strong top management support to resolve

Page 18: Database fundamentals(database)

18

Figure 1-3: Segment from enterprise data model(shows the high-level entities and their relationships)

Page 19: Database fundamentals(database)

19

Figure 1-3: Segment from enterprise data model(shows the high-level entities and their relationships)

One customer places many orders, but each order is placed by a single customer

One-to-many relationship

Page 20: Database fundamentals(database)

20

Figure 1-3: Segment from enterprise data model(shows the high-level entities and their relationships)

One order contains many order lines; each order line is contained in a single order

One-to-many relationship

Page 21: Database fundamentals(database)

21

Figure 1-3: Segment from enterprise data model(shows the high-level entities and their relationships)

One product can be in many order lines, each order line refers to a single product

One-to-many relationship

Page 22: Database fundamentals(database)

22

Figure 1-3: Segment from enterprise data model(shows the high-level entities and their relationships)

Therefore, one order involves many products and one product is involved in many orders

Many-to-many relationship

Page 23: Database fundamentals(database)

23

Order, Order_Line, Customer, and Product tables

Relationships established in special columns that provide links between tables

Page 24: Database fundamentals(database)

24

Range of Database Applications

Page 25: Database fundamentals(database)

25

Typical data from a personal database on a PC, notebook, smartphone

Page 26: Database fundamentals(database)

26

Fig. 1-11: Two-Tier Database with Local Area Network

Page 27: Database fundamentals(database)

Enterprise Database Applications

• Enterprise Resource Planning (ERP)– business management system that integrates all

enterprise functions (e.g., manufacturing, finance, sales, marketing, inventory, accounting, human resources)

• Data Warehouse– an integrated decision support system derived

from various operational databases

27

Page 28: Database fundamentals(database)

28

An enterprise data warehouse

Page 29: Database fundamentals(database)

29

Components of the Database Environment

• Computer-Aided Software Engineering (CASE) Tools – automated tools used to design databases and application programs

• Repository – centralized storehouse of metadata• DBMS – software for managing the database• Database – storehouse of the data• Application Programs – software using the data• User Interface – text and graphical displays to users• Data/Database Administrators – personnel responsible for

maintaining the database• System Developers – personnel responsible for designing

application programs• End Users – people who use the applications and databases

Page 30: Database fundamentals(database)

30

Figure 1-5: Components of the database environment

Note: All interactions with the database must go through the DBMS

Page 31: Database fundamentals(database)

31

Evolution of Database Systems

Page 32: Database fundamentals(database)

32

Current Development

• Relational DBMS has > 80% market share

• Major Database Vendors– Oracle: Oracle 11g, Oracle 12c– IBM: DB2, Informix– Microsoft: MS SQL Server– SAP: Sybase– Teradata: Teradata

Page 33: Database fundamentals(database)

33

Current Development• Overall Market Share in 2012

– Oracle, IBM, and Microsoft dominate the market

Source: Gartner, March 2013

Oracle, 48.30%

IBM 19.30%

Microsoft 18.17%

Others 14.23%

RDBMS Market Share

OracleIBMMicrosoftOthers

Page 34: Database fundamentals(database)

34

Current Development• Oracle

– strong customer base on enterprise RDBMS market– industry recognition of Oracle 11g and 12c– strong penetration in Linux/Unix platforms

• IBM– DB2 dominates mainframe platforms

• Microsoft– strong penetration in Windows platform– getting popular particularly with Small and Medium Enterprises

• Teradata– emphasis on business intelligence and data warehousing

Page 35: Database fundamentals(database)

35

Review Questions

• Differences between data, database, information and metadata

• What are the disadvantages of file processing?• What is the database approach?• What are the advantages of the database approach?• What are the costs and risks of the database approach?• What are the range of database applications?• What are the components of the database environment?• What are the popular databases?