SSD Aware Scan Operation Optimization in PostGreSQL Database

A Study on SSD Aware Scan Operation Optimization in

PostgreSQL Database

SSDs vs Traditional Spin Type HDDs

SSDsSilicon memory chipsNo moving partsNo rotational delayNear zero seek time

Both random and sequential block access time is almost the same !

But ...The cost models in RDBMS are based on the

characteristics of spin type HDDs.Assumes random_block_access_time >

sequential_block_access_timeWhen used with SSDs this assumption is not

valid- Is there opportunities for improvements ??

Background informationScan operation

- SELECT * FROM table WHERE condition

SelectivityScan operation alternatives in PostgreSQL

- Heap Scan- Bitmap index scan + Bitmap heap scan- Index scan

Our HypothesisIndex scan based on a secondary index can

perform better than other scan operations in databases which runs on SSD type storage media.

Based on the fact that in SSDs the random block access cost is almost similar to sequential block access cost

Our Hypothesis (Continued)SELECT * FROM table WHERE column = val

- column is indexed (not primary)- correlation between primary index and secondary index is zero

MethodologyKingston 8GB Data TravelerDedicated PC running Ubuntu 12.04 (i5 2.3 GHz processor

and 4GB system memory)PostgreSQL 9.3Table with 36 columns, 6,000,000 rows of dataSELECT * FROM table_1 WHERE column_1 > val_1 AND

column_1 < val_21.7 GB of data (with indexes)

Methodology (Continued)numeric field “idx_column” indexed using a

btree indexcorrelation between primary index and

secondary index is = 0.000000…cardinality of the “idx_column” field is 933900

Selectivity (log) seq scan BHS + BIS index scan

-4 10594 0 0

-3 10269 1 0

-2 10255 9 4

-1 10260 94 44

0 10278 644 457

1 10407 8794 4915

2 11600 16528 49395

In PostgreSQLrandom_block_access_time

= 4 * seq_block_access_timeThis is assuming spin type HDDsWhat is the relation in SSDs ?

random_block_access_time= seq_block_access_time ??

Selectivity (log)Running times before optimization(ms)

Optimum running times(ms)

Running times after optimization(ms)

Cost reduction (ms) Cost reduction (%)

-4 0 0 0 0 -

-3 1 0 0 1 100

-2 9 4 4 5 56

-1 94 44 44 50 53

0 644 457 457 187 29

1 8794 4915 4915 3879 44

2 11600 11600 11600 0 0

Are we done ??We haven’t consider an important factor

- relative size of the table compared to the system memory

ObservationsSequential scan remains consistent for all the

system memory values. why ?Both BIS + BHS and index scan drastically

underperforms when system memory is reduced.

BIS + BHS performs slightly better than index scan

So the optimization will work only in special conditions where at least majority of the table content can reside in the main memory.- Does this means the optimization is of no use ??

Potential of this optimization

- Small table size databases- Embedded devices- Mobile phones etc.

Questions ??

SSD Aware Scan Operation Optimization in PostGreSQL Database

Software

Transcript of SSD Aware Scan Operation Optimization in PostGreSQL Database

Documentation PostgreSQL 9.2 · Documentation PostgreSQL 9.2 ... Préface xxi

PostgreSQL: Introduction and Concepts - sgbdr.frsgbdr.fr/SGBDR/PostgreSQL/Documentation/PostgreSQL Concepts.pdf · PostgreSQL: Introduction and Concepts Creating Joined Tables Performing

Data warehousing with PostgreSQL - PostgreSQL wiki

Hacking PostgreSQL · PostgreSQL Source Code Hacking PostgreSQL Final Code PostgreSQL Subsystems Hacking the PostgreSQL Way Top Level Backend Code Backend Code - Down the Rabbit Hole

Documentation PostgreSQL 9.6 · Documentation PostgreSQL 9.6 ... Préface xix

PostgreSQL: Introduction and Concepts - justpainjustpain.com/eBooks/Databases/PostgreSQL/PostgreSQL Introductio… · Title: PostgreSQL: Introduction and Concepts Author: Bruce Momjian

Manually Upgrading PostgreSQL 9.1to PostgreSQL 9.4

PostgreSQL 9.2 efektivneˇ - PostgreSQL - PostgreSQL · PostgreSQL 9.2 efektivneˇ Programovan´ ´ı ulo zenˇ ych procedur´ Pavel Stehuleˇ 21. 1. 2013 Pavel Stehule ()ˇ ...

Documentation PostgreSQL 10 · Documentation PostgreSQL 10 ... Préface xix

by Feike Steenbergen PGConf.EU 2015 - PostgreSQL wiki · Patroni Forked from Compose Governor manages a single PostgreSQL cluster requires etcd, Zookeeper, or Consul is aware of its

Postgresql High Availability. - thebuild.com · Postgresql High Availability. Christophe Pettus PostgreSQL Experts, Inc. FOSDEM PGDay 2016

PostgreSQL-IE: An Image-handling Extension for PostgreSQL

PA-SSD: A Page-Type Aware TLC SSD for Improved Write/Read ...ranger.uta.edu/~jiang/publication/Conferences/2018/2018-ICS-PA-SS… · the TLC SSD performance. Because the three bits

SSD Adapter Selection Guide Selection Guide.pdfSSD Adapter Selection Guide PCIe SSD Card 2.5”SSD Case USB3.0 SSD Enclosure 4GB SSD 8GB SSD 16GB SSD 32GB SSD 60/64GB SSD 80GB SSD

C3 PostgreSQL 運用テクニック・レベルアップ編 · PostgreSQL Conference 2012 1 【 C3 】 PostgreSQL 運用テクニック・レベルアップ編 PostgreSQL Conference

Future In-Core Replication for PostgreSQL - PostgreSQL wiki

25 Interesting features of PostgreSQL 12 · 1 © 2019 Percona Jobin Augustine 25 Interesting features of PostgreSQL 12 PostgreSQL 12 Senior Support Engineer - PostgreSQL Percona PostgreSQL

Scanned by CamScanner - CRGmediacrgmedia.eu/clienti/cardiologie/venituri_cardio.pdf · 2017. 10. 9. · tehnician. merceolog,contabil, referent ia ssd ssd ssd ssd ssd ssd ssd 3671

PostgreSQL, PostgreSQL monitoring and monitoring postgresql · PostgreSQL, PostgreSQL monitoring and monitoring postgresql.org ... stefan@kaltenbrunner.cc Nagios conference 2008 ...

Availability of PostgreSQL in the Datacenter - PostgreSQL wiki