EXASOL Dual Data Center - Sphinx

24
EXASOL Dual Data Center Using a dual data center approach for Disaster Recovery

Transcript of EXASOL Dual Data Center - Sphinx

Page 1: EXASOL Dual Data Center - Sphinx

EXASOL Dual Data Center

Using a dual data center approach for Disaster Recovery

Page 2: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 2

Agenda

ARZ – Who are we?

Why Exasol with Dual Data Center?

How does it work?

Prerequisites

Configuration

Switchover

Tooling

Advantages

Page 3: EXASOL Dual Data Center - Sphinx

Financial solution provider in Tyrol

ARZ – Who are we?

Page 4: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 4

ARZ – Who are we?

• Computing centre for banks in Austria

• Owned by customers

• Mainly Volksbanken and Hypo but also private banks and health institutions

• ~ 500 employees

• Complete service

o IT acquisition

o Workstation Management

o Dev and enhancing core banking system

o SaaS

o ...

Page 5: EXASOL Dual Data Center - Sphinx

Origins of the Dual Data Center Approach

Why Exasol with Dual Data Center?

Page 6: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 6

A History of Data in ARZ

Data Warehousing at ARZ - 2000

• Used for Basel I reporting regulations

• Core banking system does not keep history of data

• Monthly snapshots

• Only database platform we had: DB2 zOS IBM Mainframe

• Development started in Cobol

• High Costs for this platform as primarily used for Mission critical OLTP systems

Start of DB2 for Unix as Reporting Database - 2003

• Creation of Data Marts for Reporting

• Setup of Business Objects (SAP BI) as Reporting Tool

Setup of Datastage as ETL Tool (now IBM Information Server) – 2011

Start of a redesign project for our data platform - 2015

Page 7: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 7

Evaluation Project for new Database Platform

• Started Feb 2015 with 7 Vendors

• Ended Juli 2015

• PoC with Netezza, Exasol and SAP HANA

• Main Focus was

o Ease of development

o Ease of administration

o Performance

o Dual Data Center Solution

Page 8: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 8

Why Exasol with Dual Data Center?

• ARZ owns 2 data centers

o 16 km line length

o 2x 20 Gb dedicated direct connections between the data centers

o Network latency 120 µs

• Every system connected to our regulation reporting process has to be clustered

spanning both data centers

o Max delay for reports to the national bank is within hours

o High penalties for failure to report

• Blackout of one data center once a year as a disaster test

o Varying scenarios every year

o This year: simulation of failure of cooling system

Page 9: EXASOL Dual Data Center - Sphinx

Planning vs. reality

What do we do with Exasol?

Page 10: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 10

Processing Chain

Unload Sources

•Unload from systems that we don’t have direct access to

Load Files / Direct Access

•Load unloaded files

•Load files from external systems (non ARZ systems)

•Load data from systems from within Exasol (using Exasol Connections)

RAW Layer

•Only Data Type checks (through Exasol import)

CDWH Layer

•Data Cleansing

•Reshape into new data model

Compute Cores

•Enrichment

•Analytics

•Master Data Management

Page 11: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 11

Planned Data Processing vs Reality

• After the evaluation project we planned for

o Complete redesign of our data model

3NF data model enhanced by surrogate keys (SHA-1 over business key)

we had a look at Data Vault modeling but didn’t go that way

o 90 GB of RAW Data per day

o 90 days of daily time slices

o Infinite amount of ultimo time slices

• Currently

o Data Model completely redesigned

o 600 GB of RAW Data per day compressed 100 GB

o Time for load + data cleansing and reshape into new data model 3 hours

Page 12: EXASOL Dual Data Center - Sphinx

Prerequisites

How does it work?

Page 13: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 13

Prerequisites

“Common” Exasol installation

Page 14: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 14

Prerequisites

9 active nodes per side

1 standby node per side

License server is a

VMWare Image

Can be switched to

secondary DC

Uses mirrored storage

Stretch Cluster Setup

Page 15: EXASOL Dual Data Center - Sphinx

Switchover

How does it work?

Page 16: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 16

Switchover

Switchover

• Check if storage is in synco Syslog monitor for segment recovery

o Problem might be network problems prior to switchover

• Move License-Server VMWare to secondary DC

• Stop database on primary site

• Start database on secondary site

• Stop nodes on primary siteo Otherwise the master segments will be accessed remotely

Switchback

• Bring up primary nodes

• Wait for segment recovery

• Move License-Server VMWare to primary DC

• Stop database on secondary site

• Start database on primary site

Page 17: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 17

Switchover Problems

• Hard power down (explosion)

o Storage volumes get locked on abnormal power failure

o Exasol Support has to unlock volumes

• When switching to secondary DC power down of primary DC nodes recommended

o Severe performance degradations when secondary database is accessing master data in primary

DC

• Connect string for apps contains active/passive site

o Longer connect time compared to regular installations

Page 18: EXASOL Dual Data Center - Sphinx

Tooling

How does it work?

Page 19: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 19

Tooling

• Custom build Python Script to manage whole process and more

o Used by Operations Department in ARZ

• Make it foolproof

o Operations has to Manage > 1000 Systems no time for complex checks

o Errors may lead to corrupted data ongoing segment recovery, ...

• Automatic Shutdown of Nodes if takeover to secondary site

• Automatic Startup and wait for seg recovery on takeback from secondary site

Page 20: EXASOL Dual Data Center - Sphinx

Advantages compared to other vendors

Advantages

Page 21: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 21

Advantages

• Cheap license

o No full license for secondary site required

o Only additional hardware in secondary DC

• Functionality build in

o No extra components required

• Save even during network outages

o Quorum build in through license server

• No Impact on Query Performance

• Low Impact on Load Performance

o Is dependent on your network connection (latency)

o In our Installation < 5 %

Page 22: EXASOL Dual Data Center - Sphinx

Questions

Page 23: EXASOL Dual Data Center - Sphinx

ARZ Allgemeines Rechenzentrum GmbH 23

DBeaver Open Source

• Open Source DB-Query Tool

o I build an Exasol Plugin for DBeaver (shipped with DBeaver)

• Some enhancements to EXAPlus

o Column Auto-Complete

o Direct Table Data Editor

o Multiple Table Import/Export

o Manage Database Sessions

o Support for Virtual Schemas (Exasol V6)

o Display Table Statistics in Details (Size, Compressed Size …)

o Security Browser (Roles, Users)

o “Explain” SQL Feature

• Constantly enhanced

• I’m open for feature requests, bug reports …

• Details https://goo.gl/w2F3Wk

Page 24: EXASOL Dual Data Center - Sphinx

KONTAKT

Karl Grießer

Data and Information Management

DB2 LUW, Oracle, Exasol, Imperva

[email protected]

+43 50400 91166