1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information...

124
1 Information Systems & Databases

Transcript of 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information...

Page 1: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

1

Information Systems &

Databases

Page 2: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

2

2.1 Information Systems

Look at Integrated Information Processes

Page 3: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

3

Store&

Retrieve

ProcessOrganiseAnalyseC

olle

ctD

isplay

Transmit&

Receive

Participant User

InformationProcesses

CDDisc

Memory

Processor

Sca

nner

Key

boar

dM

icro

phon

e

Printer

Monitor

Speaker

ModemNIC

Partici

pantUser

HardwareI/O

(Internal)

ProcessOrganiseAnalyseIn

put

Output

ExternalI/O

Partici

pantUser

Input&

Output

Understand the Problem

Page 4: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

4

Store&

Retrieve

Process

Organise

Analyse

Col

lect

Display

Transmit&

Receive

Enter data into WP

Save docum

ent

Print

Ret

riev

eRerieve &

Fax

Interdependenceof

Information ProcessesWord Processing

Format doc

Spellcheck

Understand the Problem/Make Decision

Page 5: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

5

Store&

RetrieveProcess

Organise

Analyse

Col

lect

Display

Transmit&

Receive

Enter data into WP

Save docum

ent

Print

Ret

riev

e

Rerieve &

Fax

Format doc

Spellcheck

WordProcessing

OCRWord

Process PrinterInputText

Outputdocument

Context Diagram

Data Flow Diagram

OCRWord

Process PrinterInputText

Printdocument

UserFolder

Save Retrieve Document

3rd level Data Flow Diagram

OCRCreate

docPrinterInput

TextThreeCopies

Editdoc

Printdoc

Memory

UserFolder

keystrokes sentences

Savedoc

Paragraphs

pagesDraft

Final Draft

Memory

Errors Corrections

CorrectedDraft

OCRCreate

docPrinterInput

TextThreeCopies

Editdoc

Printdoc

UserFolder

Savedoc

Paragraphs

pagesDraft

Final Draft

Detailed level Data Flow Diagram

CorrectedDraft

Design Solution

Page 6: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

6

Store&

RetrieveProcess

Organise

Analyse

Col

lect

Display

Transmit&

Receive

Enter data into WP

Save docum

ent

Print

Ret

riev

e

Rerieve &

Fax

Format docSpellcheck

3rd level Data Flow Diagram

OCRCreate

docPrinterInput

TextThreeCopies

Editdoc

Printdoc

Memory

UserFolder

keystrokes sentences

Savedoc

Paragraphs

pagesDraft

Final Draft

Memory

Errors Corrections

CorrectedDraft

Source

ScanOK?

No

Yes

CheckOCR

OCRFile

Scandoc

VDU

SaveInput

EditInput

OCRFile

DraftSaveDraft

Edit/saveDraftDraft

FinalDraft

PrintDocument

FinalDraft

3 copies

HowDisplay?

Print

FaxDocument

FinalDraft

Via type 3 modem

Fax

SystemFlowChart

Design Solution

Page 7: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

7

Store&

Retrieve

Process

Organise

Analyse

Col

lect

Display

Transmit&

Receive

Enter data into WP

Save docum

ent

Print

Ret

riev

eRerieve &

Fax

Interdependenceof

Information ProcessesWord Processing

Format doc

Spellcheck

Test & Evaluate

Page 8: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

8

Characteristics of an information system

•Organising, or •Analysing

•Sorting•Grouping•Indexing•Summarising•Listing•Tabling

•Synthesis•Hierarchy (priority listing)•Questioning (querying)•Reporting

Page 9: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

9

Types of information systems

• Process Transactions– Transaction Processing System (TPS)

• Information about an organisation– Management Information System (MIS)

• Help make decisions– Decision Support Systems (DSS)– Expert System

• Manage information within organisation– Office Automation Systems (OAS)– Multimedia System– Database Management System (DBMS)

Page 10: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

10

Comparing Systems

•Type •Purpose •Examples

•TPS •Process Transactions

•Banking; POS; Reservations; Loans

•MIS •Information about an organisation

•Accounting; ordering budgeting; production

•DSS •To help make decisions

•Expert system; data mining; risk analysis; modelling; simulations

•OAS •Office administration

•Personal productivity e.g MS Office; Groupware - email; calendars

Page 11: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

11

Representing an information system

Purpose• Who is it for?• Need(s) they have

Information System

PEOPLEParticipants UsersOthers

Data(Input)Information

(Output)

Information Technology

H/WS/W

Comms

Information ProcessesC O A S T/- P D

Page 12: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

12

2.2 Database information systems

Examples

Page 13: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

13

School Databases

• Represent a School Information System diagrammatically showing:– Purpose– Processes– People (Participants/Users)– Data/Information– Information Technology

• Describe the relationships between the participants, data/information and information technology

Page 14: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

14

School Administration Database

• Purpose– The efficient and effective operation of the school

• Processes– Timetabling; staff allocations; take roll; record marks; s

• People (Participants/Users)– Teachers; students; administrators; clerks; assistants; technical

support; gardeners; executive

• Data/Information– Classrooms; subjects; class rolls; equipment; books; timetable;

attendance; marks; names & addresses

• Information Technology– Keyboard, monitor, printer, network, file server– School Administration System; Library system

Page 15: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

15

RTA

VehicleInspectors

Cashiers

Managers

RegistrationClerksDrivers

Vehicles

DrivingInspectors

Police

RTASystem

Page 16: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

16

RTA ?

Purpose• Who is it for?• Need(s) they have

Information System

Participants & Users

Data/Information

Information Technology

Information Processes

Page 17: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

17

Video Store Database

Purpose• Who is it for?• Need(s) they have

Information System

Participants & Users

Data/Information

Information Technology

Information Processes

Page 18: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

18

Video Store System

Information ProcessesC: Scan video/card S: save database T: Transfer dataA: Search database A: Update data D: Display information

Participants Data/ InformationManager Member data Technology

Staff Video data H: Personal ComputerUSERS Barcodes H: Barcode reader

Customers S: DatabaseOTHERS Information C: PhoneSuppliers Receipt

Page 19: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

19

2.3 Organisational Methods

Page 20: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

20

Organising dataNon-computer methods

• Organising is the structuring of data

• Manual type databases– Telephone book– Dictionaries– Recipe card files– Encyclopedia– To do list

Page 21: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

21

Advantages & Disadvantagesof computer-based methods of organising data

Advantages• Faster• Data does not have to be in set order (flexible)• Data management is easier• Data exchange possible• Large storageDisadvantages• Need a computer• Training usually required• Security e.g during exchange of data

Page 22: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

22

Computer based methodsof organising data

• Flat file systems

• Database management systems

• Hypermedia

Page 23: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

23

Flat-file databases

• Data is stored and retrieved from a single Table

• Database file contains multiple records

• Each record contains many fields

• Each field contains data (characters)

• Ability to search the database using data in any field (key field or search field

Page 24: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

24

Flat-file database

File

•1

•2

•3 •John

•4

•5Records

Field

Page 25: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

25

Database

Page 26: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

26

Definitions

•Level •Description •Example •(Vet’s DB)

•File •Block of data •Dogs

•Record •Collection of data about one entity

•Spotty’s

•Field •Specific aspect, data of same type

•Spotty, Bassett, 1998, Y, N, Jan03

•Key field •Data unique to record •Year Born = 1998

•Character •Individual letter or number, etc

•Spotty

Page 27: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

27

DBMS

• Data Base Management System• Software package that allows users to

access a database• E.g Microsoft Access; Oracle 8i; Informix;

IBM DB2, SQL Server• The data is stored in a database• DBMS database can have different inherent

structures

Page 28: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

28

DataBase Structures

The common database structures

• Single Table– Flat-file

• Multiple Tables– Hierarchical– Network– Relational

Page 29: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

29

Hypermedia c.f. Databases

Hypermedia• Many files & types• Many locations

Flat-file DB• One file & type• One location• One ‘table’

DBMS• One file & type• One location• Many tables

Distributed DB• Many files• One file type• Many locations• Many tables• Synchronisation

Page 30: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

30

Hierarchical DB Structure

Organises data as a series of levels.

Top down structure (nodes and branches)

Each node can have many branches

Each lower level node (child) is linked to a single higher order node (parent)

Page 31: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

31

Network DB Structure

Data organised as series of nodes linked by branches

Each node can have many branches

Low-level node (child) can link to more than one high-level node (parent)

Page 32: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

32

Relational DB Structure

Organises data in a series of related tables

Relationships established between the tables to provide a flexible way of storing and accessing the data

Page 33: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

33

DB Structures Compared

Hierarchical Network Relational

Page 34: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

34

Type of DB KeysDB Key is a field that allows data to be accessed; any

field can be used a a key field• Single key - uses one field• Composite key - a group of two or more keys that can

be used to uniquely identify a record in the database• Primary key - data unique to DB• Secondary key - not necessarily unique (can be a

single key or a composite key• Foreign Key (relational databases) Primary key from

one table used to create relationship with another entity

Page 35: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

36

DB Keys

• Name, Address, Gender, Age

• J Smith, 3 River Rd, F, 20

• J Smith, 3 River Rd, M 30

• J Smith, 3 River Rd, M, 50

• J Smith, 16 East St, M, 30

Page 36: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

37

Relational Database Schemas

• SCHEMA– Organised plan of the entire database

– Shows how and where data is found

– Describes data and the data’s logical relationships

• DB Schema consists of:– Entities

– Attributes

– Relationships

Page 37: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

38

ERD

• Entity-Relationship Diagram

• Graphical method of identifying entities and showing relationships

• A type of Data Modelling Tool•CUSTOMER

•Customer_Number

•Customer_Type

•Customer_Name

•Street

•Suburb

•State

•PETS

•Pet_ID

•Customer_Number

•Pet_Name

•Animal_Type

•Breed

•DOB

Primary Key

Foreign Key

Page 38: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

39

DB Definitions

• Entity– Specific thing about which information is

collected e.g a company, a person, a product

• Attribute– Defined property of an entity e.g for a entity

(student) attributes could be Surname, Given Name, DOB, Courses Taken etc

– In a flat-file database an attribute equates to a field

• Relationship

Page 39: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

40

Data Dictionary

• Data dictionaries hold METADATA (data about data in the database)

• Used to design & manage the database• Known as a Data Modelling tool• Data Dictionary contains

– Table Name– Attribute Name (Field Name)– Field Data Type– Field Size– Field Description/example (purpose)

Page 40: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

41

DB Relationships

• The ways the entities in the database are related– One-to-one– One-to-many– Many-to-one– Many-to-many

Page 41: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

42

One-to-one

•CUSTOMER

•Customer_Number

•Customer_Type

•Customer_Name

•Street

•Suburb

•State

•PETS

•Pet_ID

•Customer_Number

•Pet_Name

•Animal_Type

•Breed

•DOB

Page 42: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

43

One-to-many

•PETS

•Pet_ID

•Customer_Number

•Pet_Name

•Animal_Type

•Breed

•DOB

•VISIT

•Pet_ID

•Visit_Date

•FollowUp_Date

•Amount

•VISIT

•Pet_ID

•Visit_Date

•FollowUp_Date

•Amount

Page 43: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

44

Many-to-many

Repairers

Repairer_IDNameStreetSuburbPostcode

Repairers

Repairer_IDNameStreetSuburbPostcode

Suppliers

Supplier_IDNameStreetSuburbPostcode

Suppliers

Supplier_IDNameStreetSuburbPostcode

Suppliers

Supplier_IDNameStreetSuburbPostcode

Page 44: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

45

Flat-file c.f Relational

•Feature •Flat-file DB •Relational DB

•# files •One •One

•# tables •One •Many

•Redundancy •High •Low

•Processing •Simple •Complex

•Relationships •Nil •Yes

Page 45: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

46

Which to choose?

• Consider a variety of scenarios

• What factors would influence you decision of choosing either a computer based or non-computer based method organise data?

Page 46: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

47

Which database?

• Flat-file for simple tasks, a single entity and when redundancy not an issue

• Relational for complex tasks or where multiple entities are involved

Page 47: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

48

Flat-file database

• A relatively simple database system in which each database is contained in a single table.

• In contrast, relational database systems can use multiple tables to store information, and each table can have a different record format.

Page 48: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

49

Database Management Systems• A collection of programs that enables you to store, modify, and extract

information from a database. There are many different types of DBMSs, ranging from small systems that run on personal computers to huge systems that run on mainframes. Examples of database applications: are

• computerized library systems • automated teller machines • flight reservation systems • computerized parts inventory systems

• The terms relational, network, flat, and hierarchical all refer to the way a DBMS organizes information internally. The internal organization can affect how quickly and flexibly you can extract information.

Page 49: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

50

Advantages/DisadvantagesDBMS

ADVANTAGES• Data independence• Reduced data

redundancy• Easier to maintain

data integrity• Easier data security• Economy of scale

DISADVANTAGES• Larger file sizes• Higher cost• More hardware

required• Higher impact of

failure

Page 50: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

54

Data Modelling Tools

• Data Modelling: the process of identifying entities, attributes and relationships

• Data Modelling Tools include:– Entity Relationship Diagrams (ERDs)– Data Dictionary

Page 51: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

55

Database Normalisation• Process of organising data to minimise data

redundancy• Relates to relational databases only• Involves dividing a database into two or

more tables and defining the relationship between the tables

• Objective: isolate data so that additions, deletions and modifications of a field can be made in juts one table and then propagated through the database via defined relationships

Page 52: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

56

‘Zero Normal Form - ZNF’

• Let’s say we want to store some personal bookmarks (URLs) for a number of users

• ZNF: because none of the rules of normalisation have been applied

• If we want more URLs, we would need to add more columns - this means data input screen would need to change everytime we added an extra URL

Page 53: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

57

First Normal Form

1. Eliminate repeating groups in individual tables. 2. Create a separate table for each set of related data. 3. Identify each set of related data with a primary key.

4.• We've solved the problem of url field limitation.

• But look at the headache we've now caused ourselves

• Every time we input a new record we've got to duplicate company and user name

• Our database grow much larger than we'd ever want it to

• We could easily begin corrupting our data by misspelling redundant information.

Page 54: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

58

Second Normal Form

1. Create separate tables for sets of values that apply to multiple records.

2. Relate these tables with a foreign key.

• We're in much better shape. • But what happens when we want to add another employee of company ABC? Or 200 employees?

• Now we've got company names and addresses duplicating themselves, a situation just rife for introducing errors into our data.

Page 55: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

59

Third Normal Form1. Eliminate fields that

do not depend on the key.

• Users and urls tables can grow without unnecessary duplication or corruption of data.

• Most developers will say the Third Normal Form is far enough - and in most cases they would be correct.

• But look at our url fields - do you notice the duplication of data?

Page 56: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

60

Fourth Normal Form1. In a many-to-many

relationship, independent entities can not be stored in the same table.

Page 57: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

61

Fifth Normal Form

• There is one more form of normalization which is sometimes applied, but it is indeed very esoteric and is in most cases probably not required to get the most functionality out of your data structure or application. It's tenet suggests:

• The benefit of applying this rule ensures you have not created any extraneous columns in your tables, and that all of the table structures you have created are only as large as they need to be. It's good practice to apply this rule, but unless you're dealing with a very large data schema you probably won't need it.

1. The original table must be reconstructed from the tables into which it has been broken down.

Page 58: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

62

Data Relationships• One-to-one, one-to-many, and many-to-many.• Look at the users table in the First Normal Form example above. For a

moment let's imagine we put the url fields in a separate table, and every time we input one record into the users table we would input one row into the urls table. We would then have a one-to-one relationship: each row in the users table would have exactly one corresponding row in the urls table. For the purposes of our application this would neither be useful nor normalized.

• Now look at the tables in the Second Normal Form example. Our tables allow one user to have many urls associated with his user record. This is a one-to-many relationship.

• The many-to-many relationship, however, is slightly more complex. Notice in our Third Normal Form example we have one user related to many urls. As mentioned, we want to change that structure to allow many users to be related to many urls, and thus we want a many-to-many relationship. Let's take a look at what that would do to our table structure before we discuss it:

Page 59: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

63

Database Keys

• In database management systems, a key is a field that you use to sort data.

• It can also be called a key field , sort key, index, or key word. • For example, if you sort records by age, then the age field is a key.• Most database management systems allow you to have more than

one key so that you can sort records in different ways. • One of the keys is designated the primary key, and must hold a

unique value for each record. • A key field that identifies records in a different table is called a

foreign key.

Page 60: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

64

Sorting & Searching

• SORT: ascending (A-Z; 0-9) or descending (Z-A; 9-0)– Multiple sorts

• SEARCH– Query the database using Structured Query

Language (SQL)

Page 61: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

65

SQL Syntax

Keyword Source Description Example

SELECT Field(s) What is to be displayed

Surname

FROM Table(s) Tables the fields are to come from

Students

WHERE Operator, data Search criteria Gender=‘F’

ORDER BY Field(s) Orders of display

Surname ASC

[]

Page 62: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

66

Operators

Type Operator Description Example

Relational >

<=

Contains

Greater than

Less or equal

Criteria

A>B

A<=B

Contains ‘a”

Logical AND

OR

NOT

Both must be true

Either true

false

A AND B

A OR B

A NOT B

Page 63: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

67

Sizing a database - Formal method

• 1) Determine how many fields there are on the schema.• 2) Next determine the datatypes of each of the fields -record the field length • 3) Next, locate information on these datatypes storage requirements.• 4) Identify any Diary fields and fields > 255 characters in length • 5) Add up the byte values for each field's storage requirements.• 6) Calculate for diary fields or fields > 255 characters in length, • 7) Determine if you have any indexes built on any fields in the schema. If so, multiply the value of

the storage length of that fields datatype by 1.5. • 8) Add up all the byte values you determined and recorded in the previous steps. This is the base

value for how much space each ticket or record requires in your database.• 9) Estimate how many tickets a day will be entered into the system. Multiply that number by the

value you got in step 8.• 10) Estimate how many days a year your system will be in production. Use a figure of 200 days for

normal businesses (taking into account holidays, weekends, etc.) or 365 if operating a 24x7 production system. Multiply this by the value you got in step 9.

• 11) Multiply the value in step 10 by 1.2 (this adds 20% to account for system "slop" and other miscellaneous overhead). This value now represents approximately how much disk storage it will take to accomodate this schema's growth for the next year. If you have multiple schemas - you will need to do all the previous calculations for each schema

Page 64: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

68

• 1) For every 20 or so fields on a schema use a base value of about 1K per ticket. For example,. if your schema has 60 fields, figure a base of 3K per ticket. If there are any indexed fields, add another 0.5k to the base value.

• 2) If there are large text fields or diary fields on the schema, add up all their lengths, multiply by 1/3 and add this to the base total obtained in step 1. For example, my schema has 60 fields, of which 5 are fields > 255 characters in length. I use a base of 3.5K per ticket (it has some indexes). I add up the lengths of each of the 5 large text fields (10K, 20K, 5K, 13K, 10K - total of 58K) and multiply by 1/3 for a total of about 19.3K. Add this to the base of 3.5K. This gives a value of about 22.8K per ticket or record.

• 3) Now multiply the value you obtained in step 2 by the number of tickets a day you expect to enter into the system.

• 4) Now multiply the value in step 3 by the number of business days per year (200 for normal businesses, 365 for 24x7 shops). This is the ballpark figure for how much disk space your schema will need to store a year's worth of data.

Sizing a database - Quick & Dirty Method

Page 65: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

69

References

• www.webopedia.com– Database, databases - normalization

• www.phpbuilder.com/columns/barry20000731.php3– Database Normalization And Design Techniques– Barry Wise (itcn.com)– INT Media Group - 2002

• support.microsoft.com• www.devshed.com

Page 66: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

70

2.5 Other Information Processes

DISPLAYING - for database information systems

Page 67: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

71

Database Views

• Data in a database can be viewed different ways for different purposes

View Purpose

Form view Record displayed as thought it were on paper. (can be used for delete, edit and add data)

Report view Displays only the information that will be printed.

Query view Shows on the data that answers the question

Page 68: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

72

Database Reports

• a program, usually part of a database management system, that extracts information from one or more files and presents the information in a specified format. Most report writers allow you to select records that meet certain conditions and to display selected fields in rows and columns. You can also format data into pie charts, bar charts, and other diagrams. Once you have created a format for a report, you can save the format specifications in a file and continue reusing it for new data.

Typically from a Report Generator

Page 69: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

73

Report Design Principles

• Consistency of styles, fonts, formatting

• Clear headings and lables

• Clean, simple page layouts

• Page numbers, dates, version details

Page 70: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

74

Database screens

• Design and create screens for interacting with selected parts of a database and justify their appropriateness– Form views

Page 71: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

75

Hypermedia/Hypertext

• Hypermedia: a combination of media whose locations are linked electronically to provide an easy way to navigate between information

• Information stored as separate documents (or files)

• Hypertext: a system that allows documents to be cross-linked.

Page 72: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

76

Nodes & Links

• Link (Hyperlink) usually indicated by a highlighted item. The author must specify the location of the item.

• Node: If another is the destination of the link, that computer is called a node.

Page 73: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

77

Characteristics of Hypermedia

•Characteristic •Definition •Example

•Hypertext •Text with links to other documents

•“Hot Spot”

•Link •Connection between elements

•Hypertext; Image Map

•Node •Location where data is stored

•Server on WWW

•URL •Internet resource address

•http:www…..

•Metadata •Data about data •HTML tags

Page 74: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

78

URLs

• Uniform Resource Locator = node

http://www.cambridge.edu.au/IPT/ipthsc.htm

Protocol

WorldWideWeb

Domainname

Country

DirectoryFolder

Webpage

Page 75: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

79

Metadata Tags

• Metadata: Data about data

• HTML uses tags to tell the browser how the following data is to be handled

• E.g <H1> Metadata</H1>

Page 76: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

80

Software for Hypermedia

• Microsoft Powerpoint and can be used to organise text, graphics and sounds for a presentation

Page 77: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

81 Wh

ich

sys

tem

to u

se?

Computer-based systems

Information task

Manual System

Flat-file DB

Relational DBMS

Hypermedia

Appointments Diary

Ideal for personal

use

Ideal for business

use

Too complex for task

Ideal for travellers

Product Catalogue

Ideal for letterbox

drops

Simple lists only

Suitable but generallly

too complex

Ideal for web access

Stock/Inventory

Too slow/ difficult to maintain

Simple lists only

Ideal for complex

tasks

Generally not suted

Personnel Records

Too slow/ difficult to maintain

Simple lists only

Ideal for complex

tasks

Generally not suitable

(privacy/security)

Class marks Test results

Ideal (private &

secure)

Ideal for porocessing

Too complex for task

Generally not suited (except HSC results)

Mail Merging Not suited IdealSuited but

genrally too compolex

Generally not suited

Phone/Address List

Ideal for personal

use

Ideal for larger lists

Suitable but generallly

too complex

Ideal for web access

Page 78: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

82

How to decide?

• Simple tasks, small amount of stable data (Manual System)

• Complex tasks, large amount of changing data (Computer-based System)– Basic processing, easy to learn/use, simple structures

(Flat-file database)– Complex processing, complex structures (Relational

Database)– Limited processing, needs wide distribution of data

(Hypermedia)

Page 79: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

83

2.4 Storage & Retrieval

Page 80: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

84

Data Access MethodsDirect (Random) & sequential

To go from point A to point Z in a sequential-access system, you must pass through all intervening points. In a random-access system, you can jump directly to point Z. Disks are random access media, whereas tapes are sequential access media.

Page 81: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

85

Storage

• ON-LINE: data processed under direct control of CPU e.g memory of Direct Access Disk (DAD)

• OFF-LINE: data controlled by the system e.g tape of disk

Device Description Storage method

Capacity Data access

Hard disk Fixed metal or glass platters

Magnetic Gigabytes Direct

CD-ROM, DVD

Plastic disk, metal coating

Optical (laser) 650MB-17GB Direct

Removable cartridge

External metal or plastic disk

Magnetic 100MB-2GB Direct

Tape Thin strip plastic on reels

Magnetic Gigabytes Sequential

Page 82: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

86

Distributed databases

• A database that consists of two or more data files located at different sites on a computer network.

• Because the database is distributed, different users can access it without interfering with one another.

• However, the DBMS must periodically synchronize the scattered databases to make sure that they all have consistent data.

Page 83: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

87

Encryption & Decryption

• Encryption: The translation of data by a secret code.

• The most effective way to achieve data security.

• To read an encrypted file, you must have access to a secret key or password that enables you to decrypt it.

• Unencrypted data is called plain text ; encrypted data is referred to as cipher text.

• There are two main types of encryption: asymmetric encryption (also called public-key encryption) and symmetric encryption.

Page 84: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

88

Symmetric & Asymmetric encryption

• Symmetric encryption: the same key is used to encrypt and decrypt the message.

• Asymmetric: A cryptographic system that uses two keys -- a public key known to everyone and a private or secret key known only to the recipient of the message. When John wants to send a secure message to Jane, he uses Jane's public key to encrypt the message. Jane then uses her private key to decrypt it.

Page 85: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

89

Hypermedia - search engines

PRELIMINARY  SEARCHING HINTS

1. Choose a search engine, directory or library in accordance with the kind of search you are doing and the kind of results you are seeking.

2. Consider: Are you looking for a Web site? Information that might be contained within Usenet? Academic articles that may only be retrievable with gopher?

3. Determine your aims: Do you want a specific hard-to-find document on an esoteric subject, or general information on a broader topic? Do you need to search the entire Web, or is what you are seeking likely to be found on a number of sites, or only the most popular sites?

4. In making your choice, determine whether the information you are looking for is likely to be in a page's title or first paragraph, or buried deeper within the document or site.

5. Use a search engine's advanced features, if available, and read the help files if you are unclear about its searching procedure.

SOURCE: http://www.windweaver.com/searchguide.htm#PRELIMINARY

Page 86: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

90

SEARCH TERMS AND SYNTAX

1. Enter synonyms, alternate spellings and alternate forms (e.g. dance, dancing, dances) for your search terms.

2. Enter all the singular or unique terms which are likely to be included in the document or site you are seeking.

3. Avoid using very common terms (e.g. Internet, people) which may lead to a preponderance of irrelevant search results.

4. Determine how your search engine uses capitals and plurals, and enter capitalized or plural forms of your search words if appropriate.

5. Use a phrase or proper name if possible to narrow your search and therefore retrieve more relevant results (unless you want a large number of results)

6. Use multiple operators (e.g. AND, NOT) if a search engine allows you to do so. 7. If you receive too many results, refine and improve your search. (After perusing the

results, you may become aware of how to use NOT - e.g. Boston AND hockey AND NOT Bruins)

8. Pay attention to proper spacing and punctuation in your search syntax (i.e. no space when using + means +term not +  term)

9. Words between quotations treated as ‘phrases”

Page 87: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

91

Web References

• Use full referencing– Author– Title– Publisher– Publications Date– PLUS– Date referenced– URL!

Page 88: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

92

World Wide Web

• Use a search engine to locate data on the World Wide Web

Page 89: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

93

How a search engine works

• http://www.howstuffworks.com/search-engine.htm

Type Advantages Disadvantages

“Crawler” Finds more pages

Less ‘dead links’

Less reliable page descriptions

Web directory Better page classification system

More reliable page descriptions

Many ‘dead links’

Fewer pages indexed

Page 90: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

94

Web crawler

Page 91: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

95

Web Directory

• People create the directory

Page 92: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

96

2.6 Issues

Related to information systems and databases

Page 93: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

97

Issues in handling data

• Privacy, security and accuracy

• What would be examples of inappropriate use of data in a school web site relating to the above issues?

Page 94: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

98

Acknowledgement of data sources

• Use full referencing– Author– Title– Publisher– Publications Date– PLUS– Date referenced– URL!

Page 95: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

99

Freedom of Information

In New South Wales, the Freedom of Information Act 1989 gives you the legal right to :

• Obtain access to information held as records by State Government Agencies, a Government Minister, local government and other public bodies;

• Request amendments to records of a personal nature that are inaccurate; and

• Appeal against a decision not to grant access to information or to amend personal records.

Page 96: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

100

Privacy

• The Privacy and Personal Information Protection Act was passed in 1998 and established the Office of the NSW Privacy Commissioner. The jurisdiction of the Act is generally limited to state and local government agencies.

• the Act introduces a set of privacy standards for the NSW public sector. These standards regulate the way public sector agencies deal with personal information.

Page 97: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

101

Data accuracy & reliability

• Data integrity - data should be validated and cross-checked– Check the source of the data– Check data against other sources– Use your own intelligence– Acknowledge bias

Page 98: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

102

Data Validation methods

• Range checks (dates, amounts)

• List check (against other known data e.g names)

• Type check (numerical, date, text)

• Check digit

Page 99: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

103

Access to data

• Who owns my data? Who should be able to access it?

• Should all information be free and available?

• What about medical and credit data?

Page 100: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

104

New Trends

• In the organisation, processing, storage and retrieval of data

• Syllabus refers to:– Data warehousing– Data mining

Page 101: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

105

Data Warehouse• A collection of data designed to support

management decision making. Data warehouses contain a wide variety of data that present a coherent picture of business conditions at a single point in time.

• Development of a data warehouse includes development of systems to extract data from operating systems plus installation of a warehouse database system that provides managers flexible access to the data.

• The term data warehousing generally refers to combine many different databases across an entire enterprise.

Page 102: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

106

An extra DB!

Page 103: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

107

Features of a Data Warehouse

• Strategic data (not operational)

• Temporal data (time periods)

• Summary data

• Read-only

Page 104: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

108

Data Mining• A hot buzzword for a class of database

applications that look for hidden patterns in a group of data.

• For example, data mining software can help retail companies find customers with common interests.

• The term is commonly misused to describe software that presents data in new ways.

• True data mining software doesn't just change the presentation, but actually discovers previously unknown relationships among the data.

Page 105: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

109

“Drill Down”• to move from summary information to detailed

data by focusing in on something. • To drill down through a series of folders, for

example, on a desktop means to go through the hierarchy of folders to find a specific file or to click through drop-down menus in a GUI.

• To drill down through a database is to access information by starting with a general category and moving through the hierarchy of field to file to record.

Page 106: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

110

Activity

• Identify and apply issues of ownership, accuracy, security and privacy of information.

Page 107: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

111

Activity

• Discuss issues of access to and control of information

• Validate information retrieved from the Internet

Page 108: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

112

Activity

• Design and generate reports from a database

Page 109: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

113

Activity

• Summarise, extrapolate and report on data retrieved from the Internet

Page 110: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

114

ActivityCreate a Database

• Create a simple relational database from a schematic diagram and data dictionary

• Populate a relational database with data

Page 111: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

115

ActivityCreate a Data Dictionary

• Create a data dictionary for a given set of data

Page 112: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

116

ActivityDocumentation

• Create documentation, including data modelling, to indicate how a relational database has been used to organise data

Page 113: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

117

Construct

• Construct a hypertext document from a storyboard

Page 114: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

118

ActivitySchema Changes

• Modify an existing schema to meet a change in user requirements

Page 115: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

119

Data Flow Diagram

• Diagrammatically represents the flow of information within an information system

Process

ExternalEntity

DataStore

Data Flow

Page 116: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

120

Activity

• Develop DFDs for a Library System

BorrowBook

BookBorrower

BorrowersFile

BorrowerDetails

Acquisitions

Book

OrdersFile

Book Details

Page 117: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

121

DFD - Video Store System

MovieTimeVideo

Member

VideoDatabase

Card, PIN, cash

Video, receipt

TransactionData

Member dataVideo data

Page 118: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

122

DFD - Appointment Diary

CheckDiary

Person1

Diary

Request appointment

Enter appointment

DiaryPerson

2

Enter appointment

Confirm appointment

Page 119: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

123

Backup & Security

• To copy files to a second medium (a disk or tape) as a precaution in case the first medium fails

• FIREWALL: hardware or software that prevents unauthorised access to a network

Page 120: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

124

Methods of data backupBackup Description Advantages Disadvantages

Full All data on system is backed up

Quick recovery

All data saved

Longer backup

Differential Only changed data since last full backup

Rapid backup

Minimal space

Recovery requires full backup and one incremental

Incremental Only changed data since last incremental backup

Fastest backup

Minimal space

Longer recovery - need full backup and number of incremental backups

Page 121: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

125

Design & develop a storyboard

• For each of the following situations, decide the most appropriate storyboard layout.

• Design a set of simple labelled sketches to illustrate– A. A very simple children’s educational

program on desert animals for young children– B. A glossary of medical terms for Doctors (in

alphabetical order).

Page 122: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

126

Storyboard

• A series of frames, each representing a different action or scene

• Storyboards are used to plan and organise hypermedia projects

• Usually drawn on paper; frequently edited• Contain navigation paths, content

information and graphic concepts• Simple to construct

Page 123: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

127

Storyboard Layout Options

• Linear

• Hierarchical

• Non-linear

• Combination

e.g “cartoon-like’

e.g simple web pages

e.g complex web site

Page 124: 1 Information Systems & Databases. 2 2.1 Information Systems Look at Integrated Information Processes.

128

Use Software

• Use software that links data, such as:

• HTML editors

• web page creation software or a hypertext package