Data SLA in the public cloud

Data SLA in the cloud

About Us

• ScaleBase is a new startup targeting the database-as-a-service market (DBaaS)

• We offer unlimited database scalability and availability using our Database Load Balancer

• We launch in September, 2010. Stay tuned at our site.

Agenda

• The requirements for data SLA in public cloud environments

• Achieving data SLA with NOSQL• Achieving data SLA with relational databases

THE REQUIREMENTS FOR DATA SLA IN PUBLIC CLOUD ENVIRONMENTS

What We Need

• Availability• Consistency• Scalability

Brewer's (CAP) Theorem

• It is impossible for a distributed computer system to simultaneously provide all three of the following guarantees:– Consistency (all nodes see the same data at the

same time)– Availability (node failures do not prevent survivors

from continuing to operate)– Partition Tolerance (the system continues to

operate despite arbitrary message loss)

http://en.wikipedia.org/wiki/CAP_theorem

What It Means

http://guyharrison.squarespace.com/blog/2010/6/13/consistency-models-in-non-relational-databases.html

Dealing With CAP

• Drop Partition Tolerance– Run everything on one machine.– This is, of course, not very scalable.

Dealing With CAP

• Drop Availability– If a partition fail, everything waits until the data is

consistent again. – This can be very complex to handle over a large

number of nodes.

Dealing With CAP

• Drop Consistency– Welcome to the “Eventually Consistent” term.

• At the end – everything will work out just fine - And hi, sometimes this is a good enough solution

– When no updates occur for a long period of time, eventually all updates will propagate through the system and all the nodes will be consistent

– For a given accepted update and a given node, eventually either the update reaches the node or the node is removed from service

– Known as BASE (Basically Available, Soft state, Eventual consistency), as opposed to ACID

Reading More On CAP

• This is an excellent read, and some of my samples are from this blog– http://www.julianbrowne.com/article/viewer/bre

wers-cap-theorem

ACHIEVING DATA SLA WITH RELATIONAL DATABASES

Databases And CAP

• ACID – Consistency• Availability – tons of solutions, most of them

not cloud oriented– Oracle RAC– MySQL Proxy– Etc.– Replication based solutions can solve at least read

availability and scalability (see Azure SQL)

Database Cloud Solutions

• Amazon RDS• NaviSite Oracle RAC

• Not that popular– Costs to cloud providers (complexity, not

standard)

So Where Is The Problem?

• Partition Tolerance just doesn’t work• Scaling problems (usually write but also read)• BigData problems

Scaling Up

•Issues with scaling up when the dataset is just too big

•RDBMS were not designed to be distributed•Began to look at multi-node database

solutions•Known as ‘scaling out’ or ‘horizontal scaling’•Different approaches include:

–Master-slave–Sharding

Scaling RDBMS – Master/Slave

•Master-Slave–All writes are written to the master. All reads

performed against the replicated slave databases–Critical reads may be incorrect as writes may not

have been propagated down–Large data sets can pose problems as master

needs to duplicate data to slaves

Scaling RDBMS - Sharding

•Partition or sharding–Scales well for both reads and writes–Not transparent, application needs to be partition-

aware–Can no longer have relationships/joins across

partitions–Loss of referential integrity across shards

Other ways to scale RDBMS

•Multi-Master replication•INSERT only, not UPDATES/DELETES•No JOINs, thereby reducing query time

–This involves de-normalizing data•In-memory databases

ACHIEVING DATA SLA WITH NOSQL

• A term used to designate databases which differ from classic relational databases in some way. These data stores may not require fixed table schemas, and usually avoid join operations and typically scale horizontally. Academics and papers typically refer to these databases as structured storage, a term which would include classic relational databases as a subset.

http://en.wikipedia.org/wiki/NoSQL

NoSQL Types•Key/Value

–A big hash table–Examples: Voldemort, Amazon Dynamo

•Big Table–Big table, column families–Examples: Hbase, Cassandra

•Document based–Collections of collections–Examples: CouchDB, MongoDB

•Graph databases–Based on graph theory–Examples: Neo4J

•Each solves a different problem

NO-SQL

http://browsertoolkit.com/fault-tolerance.png

Pros/Cons• Pros:

– Performance– BigData– Most solutions are open source– Data is replicated to nodes and is therefore fault-tolerant (partitioning)– Don't require a schema– Can scale up and down

• Cons:– Code change– No framework support– Not ACID– Eco system (BI, Backup)– There is always a database at the backend– Some API is just too simple

Amazon S3 Code Sample

AWSAuthConnection conn = new AWSAuthConnection(awsAccessKeyId, awsSecretAccessKey, secure, server, format);

Response response = conn.createBucket(bucketName, location, null);

final String text = "this is a test";

response = conn.put(bucketName, key, new S3Object(text.getBytes(), null), null);

Cassandra Code SampleCassandraClient cl = pool.getClient() ;KeySpace ks = cl.getKeySpace("Keyspace1") ;

// insert valueColumnPath cp = new ColumnPath("Standard1" , null, "testInsertAndGetAndRemove".getBytes("utf-8")); for(int i = 0 ; i < 100 ; i++){

ks.insert("testInsertAndGetAndRemove_"+i, cp , ("testInsertAndGetAndRemove_value_"+i).getBytes("utf-8"));}

//get valuefor(int i = 0 ; i < 100 ; i++){

Column col = ks.getColumn("testInsertAndGetAndRemove_"+i, cp);String value = new String(col.getValue(),"utf-8") ;

//remove valuefor(int i = 0 ; i < 100 ; i++){

ks.remove("testInsertAndGetAndRemove_"+i, cp);}

Cassandra Code Sample – Cont’try{

ks.remove("testInsertAndGetAndRemove_not_exist", cp);}catch(Exception e){

fail("remove not exist row should not throw exceptions");}

//get already removed valuefor(int i = 0 ; i < 100 ; i++){try{

Column col = ks.getColumn("testInsertAndGetAndRemove_"+i, cp);fail("the value should already being deleted");

}catch(NotFoundException e){

}catch(Exception e){fail("throw out other exception, should be

NotFoundException." + e.toString() );}

pool.releaseClient(cl) ;pool.close() ;

Cassandra Statistics

•Facebook Search•MySQL > 50 GB Data

–Writes Average : ~300 ms–Reads Average : ~350 ms•Rewritten with Cassandra > 50 GB Data

–Writes Average : 0.12 ms–Reads Average : 15 ms

MongoDBMongo m = new Mongo();

DB db = m.getDB( "mydb" );Set<String> colls = db.getCollectionNames();

for (String s : colls) {System.out.println(s);

MongoDB – Cont’BasicDBObject doc = new BasicDBObject();

doc.put("name", "MongoDB");doc.put("type", "database");doc.put("count", 1);

BasicDBObject info = new BasicDBObject();

info.put("x", 203);info.put("y", 102);

doc.put("info", info);

coll.insert(doc);

Neo4JGraphDatabaseService graphDb = new EmbeddedGraphDatabase("var/base");

Transaction tx = graphDb.beginTx();try {

Node firstNode = graphDb.createNode();Node secondNode = graphDb.createNode();Relationship relationship = firstNode.createRelationshipTo(secondNode,

MyRelationshipTypes.KNOWS);

firstNode.setProperty("message", "Hello, ");secondNode.setProperty("message", "world!");relationship.setProperty("message", "brave Neo4j ");tx.success();

System.out.print(firstNode.getProperty("message"));System.out.print(relationship.getProperty("message"));System.out.print(secondNode.getProperty("message"));

}finally {

tx.finish();graphDb.shutdown();

THE BOTTOM LINE

Data SLA

• There is no golden hammer• Choose your tool wisely, based on what you need• Usually– Start with RDBMS (shortest TTM, which is what we

really care about)– When scale issues occur – start moving to NoSQL

based on your needs• You can get Data SLA in the cloud – just think

before you code!!!

Data SLA in the public cloud

Technology

Transcript of Data SLA in the public cloud

Partial Utility-driven Scheduling for Flexible SLA and ...lveiga/papers/IEEE-TCC-SLA-Cloud-Pricing-2014.pdf · 10.1109/TCC.2014.2372753, IEEE Transactions on Cloud Computing 1 Partial

MCTModel: A Multi-clouds Trust Model Based on SLA in Cloud ...

Autonomic SLA-driven Provisioning for Cloud Applications

Service Level Agreement(SLA) based SaaS Cloud ...raj/papers/SLA-CloudMan2015.pdfService Level Agreement(SLA) based SaaS Cloud Management System Linlin Wu 1, Saurabh Kumar Garg2, and

OpenCloudware –ETSI – 30th Ocotber 2012 SLA for Cloud Services 1.

Dell Technologies Cloud Cloud Without the Chaos · Consistent hybrid cloud can address these issues Public Cloud Public Cloud Public Cloud Public Cloud Private Cloud Radically Simplified

Service Level Agreement(SLA) based SaaS Cloud … · Service Level Agreement(SLA) based SaaS Cloud ... at infrastructure level or ... based resource allocation system adapting dynamic

Development of IPX: Myth or Reality?...Acme Packet Page 4 The IPX Vision CONFIDENTIAL -IPX Solution Overview IPX IPX $ SLA $ SLA IPX MNO MNO IPX $ SLA Acme Packet Page Cloud services

MUlti-cloud Secure Applications · Generator MUSA Modeller Creation of the multi-cloud application Security SLA including automatic generation of SLA from the multi-cloud application

Cloud Services As An Enabler - SLA Ready · 2016-11-08 · Cloud Services As An Enabler The Main Cloud Objectives and SLA Essentials Janneke Breeuwsma Breeuwsma@arthurslegal.com.

SLA –Service Security · customer expectations for high‐level functionality (e.g., backup, software patches) • Several CSPs offer cloud service using third party IaaS public

Outsourcing SLA versus Cloud SLA by Jurian Burgers

SLA information management through dependency digraphs: the case of cloud data services

Policy-Based Automation of SLA Establishment for Cloud ...

White Paper for Network Managers - Amazon S3€¦ · Mobile End-to-End Cloud Service Delivery with SLA (Smart) Connectivity Service with SLA (Pipe) Application Application Cloud Service

Report on conceptual framework for Cloud SLA negotiation ... · Secure Provisioning of Cloud Services based on SLA Management SPECS Project – Deliverable 2.2.2 3 Executive summary

Multi-dimensional SLA-based Resource Allocation for Multi-tier Cloud Computing Systems

SLA Based Information Security Metric for Cloud Computing from COBIT 4.1 Framework

Enabling End-to-End Cloud SLA Management

Customer Driven SLA in Cloud Based Systems