Download - A Case for Object/Relational Persistence Tools

Transcript
Page 1: A Case for Object/Relational Persistence Tools

A Case for Object/Relational Persistence Tools

Scott McCrory, Bank One Corporation

Original Author: Donald Smith, Technology Director of Oracle Corporation

Columbus Association for Internet Professionals (AIP)

Page 2: A Case for Object/Relational Persistence Tools

J2EE Apps and Relational Data

J2EE is one of the leading technologies used for building n-tier, web applications

– J2EE based on object technology Relational databases are the most common

data source accessed by J2EE apps They are diverse technologies that need to be

used together This discussion identifies a few of the issues

to be considered

Page 3: A Case for Object/Relational Persistence Tools

Underestimation

Managing persistence related issues is the most underestimated challenge in

enterprise Java today – in terms of complexity, effort and maintenance

Page 4: A Case for Object/Relational Persistence Tools

J2EE Architectures

J2EE Architecture Options– Servlets– JSP– Session Beans– Message Driven Beans– Web Services

Bottom Line – Java application needs to access relational data. But how?…

Page 5: A Case for Object/Relational Persistence Tools

JDBC

Java standard for accessing databases

JDBC is simply the database connection utilities Java developers need to build upon

SQLrows

JDBC

Page 6: A Case for Object/Relational Persistence Tools

J2EE Access of Relational Data

Direct JDBC – Direct SQL calls, uses rows and ResultSets– Code can be very fast and efficient, but requires extensive

hand-coding and manipulation of typeless objects.– Approach lends itself to high repetition, low re-use,

blending of application tiers and DB connection leaks Object view

– Accessed as objects or components, transparent that the data is stored in relational database

– Need persistence layer in middle tier to handle the object-relational mapping and conversion

– Focus of this talk…

Page 7: A Case for Object/Relational Persistence Tools

Impedance Mismatch

The differences in relational and object technology is known as the “object-relational impedance mismatch”

Challenging problem to address because it requires a combination of relational database and object expertise

Page 8: A Case for Object/Relational Persistence Tools

Impedance MismatchFactor J2EE Relational Databases

Logical Data Representation

Objects, methods, inheritance

Tables, SQL, stored procedures

Scale Hundreds of megabytes Gigabytes, terabytes

Relationships Object references Foreign keys

Uniqueness Internal object id Primary keys

Key Skills Java development, object modeling

SQL, Stored Procedures, data management

Tools IDE, Source code management, Object Modeler

Schema designer, query manager, performance profilers, database configuration

Page 9: A Case for Object/Relational Persistence Tools

Object Level Options

Depends on what component architecture is used:

– 1. EJB Entity Beans BMP – Bean Managed Persistence

– 2. EJB Entity Beans CMP – Container Managed Persistence

– 3. Access Java objects via Persistence Layer

Page 10: A Case for Object/Relational Persistence Tools

ejbLoad() - “load yourself”ejbStore() - “store yourself”ejbCreate() - “create yourself”findBy…() - “find yourself”ejbRemove() - “remove yourself”

1. Entity Beans - BMP

In BMP, developers write the persistence code themselves – not trivial!…

Database reads and writes occur in specific methods defined for bean instances

The container callsthese methods - usually onmethod or transactionboundaries

Page 11: A Case for Object/Relational Persistence Tools

2. Entity Beans - CMP The Good: Persistence is based on information in the

deployment descriptors– More “automatic” persistence – managed by the Application

Server, can be faster than BMP– No special persistence code in the bean– Description of the persistence done with tools and XML files

The Bad: Less control, persistence capabilities are limited to the functionality provided

– Very difficult to customize or extend CMP features, since it is built-in to the container.

– You do have options to plug-in a 3rd party CMP solution on an app server but…

– You have to be careful of vendor/server/plugin tie-in

Page 12: A Case for Object/Relational Persistence Tools

3. Object Persistence Layer Abstracts persistence details from the application

layer, supports Java objects/Entity Beans

SQLrows

Objects

Persistence Layer

Objectsobject-level

querying and creationresults are objects

results arereturned as

raw data

API uses SQLor DATABASEspecific calls

object creation and updates through OBJECT-level API

J2EE & J2EE & Web Web

ServicesServices

JDBC

Page 13: A Case for Object/Relational Persistence Tools

General J2EE Persistence Interaction

Application business objects/components are modeled and mapped to relational data store

Java business objects/Entity Beans are created from relational data

Objects are edited, traversed, processed, created, deleted, cached, locked etc.

Store objects on the database Multiple concurrent clients sharing database

connections Contributing factors…

Page 14: A Case for Object/Relational Persistence Tools

J2EE Developer Bias

Data model should not constrain object model Don’t want database code in object/component code Accessing data should be fast Minimize calls to the database – they are expensive Want them to be object-based queries – not SQL Isolate J2EE app from schema changes Would like to be notified of changes to data occurring

at database

Page 15: A Case for Object/Relational Persistence Tools

DBA Bias

Adhere to rules of database (referential integrity, stored procedures, sequence numbers etc.)

Build the J2EE application but the schema might change

Let DBA influence/change database calls/SQL generated to optimize

Be able to log all SQL calls to database Leverage database features where appropriate (outer

joins, sub queries, specialized database functions)

Page 16: A Case for Object/Relational Persistence Tools

Differences

Desires are contradictory– “Insulate application from details of database but

let me leverage the full power of it”– Different skill sets– Different methodologies– Different tools

Technical differences must also be considered!

Page 17: A Case for Object/Relational Persistence Tools

Do You Want to Build or Buy?

A homebrewed Example…

Page 18: A Case for Object/Relational Persistence Tools

A Homebrewed Example

Corporate Security PASSPort– AbstractDAO, AreaDAO, BadgeDAO

Hard-coded mappings in several places DB-specific SQL (Informix) No lazy initialization – leads to fillCategoryInfo(),

getBadgesWithLimitedData(), etc. If that’s not enough…

Page 19: A Case for Object/Relational Persistence Tools

Basic J2EE Persistence Checklist Mappings Object traversal Queries Transactions Optimized database interaction Database Features Database Triggers and Cascade Deletes Caching Locking

Page 20: A Case for Object/Relational Persistence Tools

Mapping

Object model and Schema must be mapped– True for any persistence approach

Most contentious issue facing designers– Which classes map to which table(s)?– How are relationships mapped?– What data transformations are required?

Page 21: A Case for Object/Relational Persistence Tools

Good and Poor Mapping Support

Good mapping support:– Domain classes don’t have to be “tables”– References should be to objects, not foreign keys– Database changes (schema and version) easily handled

Poor mapping support:– Classes must exactly mirror tables– Middle tier needs to explicitly manage foreign keys– Classes are disjointed– Change in schema requires extensive application changes

Page 22: A Case for Object/Relational Persistence Tools

Mapping Tools

GUIs are nice, but the underlying mapping support is what’s important

Page 23: A Case for Object/Relational Persistence Tools

Data and Object Models

Rich, flexible mapping capabilities provide data and object models a degree of independence

Otherwise, business object model will force changes to the data schema or vice-versa

Often, J2EE component models are nothing more than mirror images of data model – NOT desirable – O/R tool must be able to abstract this

An example of the need to handle data model complexity…

Page 24: A Case for Object/Relational Persistence Tools

Simple Object Model

Customer

id: intname: StringcreditRating: int

Address

id: intcity: Stringzip: String

1:1 Relationship

Page 25: A Case for Object/Relational Persistence Tools

Typical 1-1 Relationship Schema

CUST

ID NAME A_IDC_RATINGADDR

ID CITY ZIP

“That’s how I’d do it – this is easy stuff…”

Page 26: A Case for Object/Relational Persistence Tools

Other possible Schemas…

CUST

ID NAME C_RATE C_ID

ADDR

ID CITY ZIP

CUST

ID NAME C_RATING C_ID

ADDR

ID CITY ZIP

A_ID

CUST_ADDR

C_ID

CUST

ID NAME CITY ZIPC_RATING

Page 27: A Case for Object/Relational Persistence Tools

Even More Schemas…

CUST

ID NAME

ADDR

ID CITY ZIP

CUST_CREDIT

ID C_RATING A_IDCC_ID

CUST

ID NAME

ADDR

ID CITY ZIP

CUST_CREDIT

ID C_RATING A_ID

CUST

ID NAME A_ID

ADDR

ID CITY ZIP

CUST_CREDIT

ID C_RATING

CUST

ID NAME

CUST_CREDIT

ID C_RATING

ADDR

ID CITY ZIP C_ID

CUST

ID NAME

CUST_CREDIT

ID C_RATING

ADDR

ID CITY ZIP C_ID

!!!

Page 28: A Case for Object/Relational Persistence Tools

Mapping Summary

Just showed nine valid ways a 1-1 relationship could be represented in a database

– Most persistence layers and application servers will only support one

Without good support, designs will be forced Imagine the flexibility needed for other

mappings like 1-M and M-M

Page 29: A Case for Object/Relational Persistence Tools

Object Traversal – Lazy Reads

J2EE applications work on the scale of a few hundreds of megabytes

Relational databases routinely manage gigabytes and terabytes of data

Persistence layer must be able to transparently fetch data “just in time,” usually called “lazy reads” or “lazy instantiation.”

Page 30: A Case for Object/Relational Persistence Tools

Just in Time Reading – Faulting Process

Customer

Order

Proxy

Accessing relationship for first time

Get related object based on FK

SQL if not cached

Check Cache

Plug result into Proxy

OrderOrder

Page 31: A Case for Object/Relational Persistence Tools

Object Traversals

Even with lazy reads, object traversal is not always ideal

– To find a phone number for the manufacturer of a product that a particular customer bought, may do several queries:Get customer in questionGet orders for customerGet parts for orderGet manufacturer for partGet address for manufacturer

– Very natural object traversal results in 5 queries to get data that can be done in 1

Page 32: A Case for Object/Relational Persistence Tools

N+1 Reads Problem

Many persistence layers and application servers have an N+1 reads problem

Causes N subsequent queries to fetch related data when a collection is queried for

This is usually a side effect of the impedance mismatch and poor mapping and querying support in persistence layers

“Nah, the client says we’ll never have to traverse Customers, Accounts, Transactions and Addresses…”

Page 33: A Case for Object/Relational Persistence Tools

N+1 Reads Problem

Pool of Created Objects or Beans

PersistenceLayer or EJB

Container

findByCity()

1findByCity()

2

For each CustomerFetch their Address

Address

46

4

n

5 5

3 Returns collection

n

If Address had related objects, they too may be

fetched 2n+1 Reads!

Container returns results

C C C C

Page 34: A Case for Object/Relational Persistence Tools

N+1 Reads

Must have solution to minimize queries Need flexibility to reduce to 1 query, 1+1

query or N+1 query where appropriate– 1 Query when displaying list of customers and

addresses – known as a “Join Read”– 1+1 Query when displaying list of customers and

user may click button to see addresses – known as a “Batch Read”

– N+1 Query when displaying list of customers but only want to see address for selected customer

Page 35: A Case for Object/Relational Persistence Tools

Queries

Java developers are not usually SQL experts– Maintenance and portability become a concern

when schema details hard-coded in application

Allow Java based queries that are translated to SQL and leverage database options

– EJB QL, object-based proprietary queries, query by example

Page 36: A Case for Object/Relational Persistence Tools

Queries Persistence layer handles object queries and converts

to SQL for that specific database engine SQL issued should be as efficient as written by hand

(or pretty close to it) Should utilize other features to optimize

– Parameter binding, cached statements

Some benefits to dynamically generated SQL :– Ability to create minimal update statements

Only save objects and fields that are changed– Simple query-by-example capabilities

Page 37: A Case for Object/Relational Persistence Tools

Query Requirements

Must be able to trace and tune SQL Must be able use ad hoc SQL where

necessary Must be able to leverage database abilities

– Outer joins– Nested queries– Stored Procedures– Oracle Hints & other vendor optimizations

Page 38: A Case for Object/Relational Persistence Tools

Caching for Speed Any application that caches data, now has to

deal with stale data When and how to refresh? Will constant refreshing overload the

database? Problem is compounded in a clustered

environment App server may want be notified of database

changes

“Caching and clustering? My code is so fast, it’ll never need it…”

Page 39: A Case for Object/Relational Persistence Tools

Caching

QuerySQL Query (if needed)

Results(s)

Does PK for row exist in cache?

YES – Get from Cache

NO – Build bean/object from results

Return object results

Page 40: A Case for Object/Relational Persistence Tools

Cascaded Deletes

Cascaded deletes done in the database have a real effect on what happens at J2EE layer

Middle tier app must:– Be aware a cascaded delete is occurring– Determine what the “root” object is– Configure persistence settings or application

logic to avoid deleting related objects already covered by cascaded delete

Page 41: A Case for Object/Relational Persistence Tools

Database Triggers

Database triggers will be completely transparent to the J2EE application

However, their effects must be clearly communicated and considered

Example: Data validation –> audit table– Objects mapped to an audit table that is only

updated through triggers, must be read-only on J2EE

Page 42: A Case for Object/Relational Persistence Tools

Database Triggers

More challenging when trigger updates data in the same row and the data is also mapped into an object

Example: Annual salary change automatically triggers update of life insurance premium payroll deduction

– J2EE app would need to re-read payroll data after salary update OR

– Duplicate business logic to update field to avoid re-read– Saves a DB call but now business logic in 2 places

“I’ll just never allow the DBAs to use triggers...”

Page 43: A Case for Object/Relational Persistence Tools

Referential Integrity

Java developers manipulate object model in a manner logical to the business domain

May result in ordering of INSERT, UPDATE and DELETE statements that violate database constraints

Persistence layer should automatically manage this and allow options for Java developer to influence order of statements

Page 44: A Case for Object/Relational Persistence Tools

Transaction Management

J2EE apps typically support many clients sharing small number of db connections

Ideally would like to minimize length of transaction on database

Tim

e

Begin Txn

Commit TxnBegin Txn

Commit Txn

Page 45: A Case for Object/Relational Persistence Tools

Locking

J2EE developers want to think of locking at the object level

Databases may need to manage locking across many applications

Persistence layer or application server must be able to respect and participate in locks at database level

“Not a problem, my app is the only one that will ever need to access this database…”

Page 46: A Case for Object/Relational Persistence Tools

Optimistic Locking

DBA may wish to use version, timestamp and/or last update field to represent an optimistic lock

– Java developer may not want this in their business model

– Persistence layer must be able to abstract this

Must be able to support using any fields including business domain

Page 47: A Case for Object/Relational Persistence Tools

Pessimistic Locking Requires careful attention as a JDBC

connection is required for duration of pessimistic lock

Should support SELECT FOR UPDATE [NOWAIT] semantics

Tim

e

Begin Txn

Commit Txn

Begin Txn

Commit Txn

Pess Lock

Page 48: A Case for Object/Relational Persistence Tools

Other Impacts

Use of special types– BLOB, Object Relational

Open Cursors Batch Writing Sequence number allocations Connection pooling Audit logging

Page 49: A Case for Object/Relational Persistence Tools

Still Want to Write Your Own?

Page 50: A Case for Object/Relational Persistence Tools

Some Front-Running O/R Tools

Oracle TopLink– Rich graphical mapper– Very strong feature set– Somewhat expensive & proprietary

Hibernate– Open-Source, free, mature and popular – current version = 2.1– Strong feature set, and a graphical mapper is available with an Eclipse add-on– Very well documented – book to be released late 2004– Recently adopted by JBoss as their non-CMP persistence core– Author Gavin King works with JBoss and is on the JDO 2.0 board

Apache OJB– Open-Source, free and popular– Very clean design & has mature Query by Example (QbE)– Immature M:M support and lacks many features due to “clean” design constraints– Not yet at a 1.0 release as of January 2004

Page 51: A Case for Object/Relational Persistence Tools

Some Front-RunningO/R Tools (Continued)

Sun JDO Implementations (I.e. Kodo)– JDO 1.0 is a ratified spec, but lacks even basic query and life-cycle support– Requires implementation of persistence interfaces and Java bytecode

enhancement– JDO 2.0 looks promising, but not due for 6-12 months

Castor JDO– Proprietary, moderately expensive and not “real” (Sun) JDO

Jakarta Torque– Like Jakarta Turbine, it is sophisticated, but complex & not well documented

Page 52: A Case for Object/Relational Persistence Tools

The Hibernate Nickel-Tour (1/5)

Create a database table…

Page 53: A Case for Object/Relational Persistence Tools

The Hibernate Nickel-Tour (2/5)

Create a plain old java object (“POJO”)…

Page 54: A Case for Object/Relational Persistence Tools

The Hibernate Nickel-Tour (3/5)

Configure Hibernate’s session factory…

Currently supports Oracle, DB2, MySQL, PostgreSQL, Sybase, SAP DB, HypersonicSQL, Microsoft SQL Server, Informix, FrontBase, Ingres, Progress, Mckoi SQL, Pointbase and Interbase.

Page 55: A Case for Object/Relational Persistence Tools

The Hibernate Nickel-Tour (4/5)

Define the class and field mappings…

Page 56: A Case for Object/Relational Persistence Tools

The Hibernate Nickel-Tour (5/5)

Creating new data…

Reading data…

Page 57: A Case for Object/Relational Persistence Tools

Some Final O/R Considerations

• No O/R tool is perfect, and the field is not yet fully mature, but several (including TopLink and Hibernate) offer features, stability and value worth implementing today.

• JDO 2.0 developments should be carefully watched, but not waited for (I.e. Sun JCP).

• EJB CMP, Web Service and Business Service architecture strategies should be clearly articulated before employing any large-scale persistence framework, but don’t let this hold you up for medium-sized applications.

Page 58: A Case for Object/Relational Persistence Tools

Q&Q U E S T I O N SQ U E S T I O N SA N S W E R SA N S W E R S