Exploring the Blockchain - GitHub Pages · 2020-01-06 · We downloaded an ethereum client called...

Adrianna Diaz Siddhartha Kattoju

Exploring the Blockchain

The Ethereum blockchain

In February 1 Ether ~ $800 USD Now 1 Ether ~ $400 USD

● Decentralized● Global distributed ledger● Programmable via smart contracts● Each state change requires a

cryptographically verified transaction.

Data Preparation

● Blockchain data is stored on each node as merkle trees in level db files● Data can be queried using the ethereum client API using a REST interface

○ We expected this would be slower than reading the data from disk

● We downloaded an ethereum client called geth and synced the block chain from the network.

○ ~5 million blocks, thousands of transactions per block○ ~27,800 files, ~2.06 MB each

● Blockchain data was then exported into RPL encoded binary files.● We prepared binary files containing increasing larger number of blocks to

aid in developing our code○ 100, 200, 500, 1k, 2k, 5k, 10k, 20k, 50k, 100k, 200k ...

Dataframe

timestamp: long (nullable = false)

number: decimal(38,0) (nullable = true)

value: decimal(38,0) (nullable = true)

receiveAddress: binary (nullable = true)

sendAddress: binary (nullable = true)

hash: binary (nullable = true)

Issues encounteredInteger Overflow

● Basic unit is Wei 1018 Wei = 1 Ether ● Transaction value as Java Long: 64 bits, signed… -2**63 to 2**63 -1● Internal representation 256 bit integer number

Solution:

● Replace longs with Big Decimal in the library used to read the blockchain data. We contacted the developers and they pushed a fix…

Issues encountered (2)Bouncy Castle

● Some fields had to be computed using the crypto library bouncy castle. Spark 2.2 had an older version of this library which was conflicting with the one used in the hadoop crypto ledger library

Solutions:

● Create a “shaded jar” that included the classes needed under a different package name.

● Migrate to Spark 2.3 but Compute Canada didn’t have it by default until mid-March. (2.3 was released on February 28)

Issues encountered (3)Intermittent issues with Slurm

● We ran a number of small-ish jobs ~ 1 hr 2k blocks, they were timing out. ● Ultimately we found out that the scheduler was sometimes down or too

busy to serve our requests on the weekends.

Solution:

● Wait till it became available again.

Graph Analysis

Spark GraphFrames package

● GraphFrame consists of vertices and edges● Vertexes are ethereum account addresses● Edges are pairs of addresses in each transaction.

Algorithms applied

● Connected Components● Page Rank

Connected Components among Transactions larger than 100 ETH

The Highest Page Rank in the Largest Connected Component was a Cryptocurrency Exchange

Connected Components 0 ETH Transactions

The Highest Page Rank in the Largest Connected Component was an EOS Token Contract

Decreasing Running TimeBOTTLENECK

● We noticed that a significant amount of processing time was spent reading from the binary file and populating our dataframe

SOLUTION:

● We decided to preprocess the data and write the relevant fields to a csv file in a separate job.

● This resulted in about a 6x improvement in the time it took to run our algorithms.

Performance seems linear

Next Steps● Apply Page Rank and Connected Components Algorithms to a larger

sample of the dataset● If possible process the entire ~56G dataset● Explore changes over time

Exploring the Blockchain - GitHub Pages · 2020-01-06 · We downloaded an ethereum client called...

Documents

Transcript of Exploring the Blockchain - GitHub Pages · 2020-01-06 · We downloaded an ethereum client called...

geosci.uchicago.eduarcher/jazz_band/sheet/stormy_weather.pdfSTORMY WEATHER (KEEPS ALL THE TIME) Words by Ted Koehler Music by Harold Arlen there's gloom geth geth no and er, er, Am

Enterprise Ethereum Alliance Home - Enterprise Ethereum ...

Video synced interactive web visualisations

Blockchain: Ethereum Perspective · Blockchain: Ethereum Perspective S.Venkatesan 19/08/2018 S.Venkatesan @IIITA Blockchain: Ethereum Perspective 1/34. Indian Institute of Information

Getting Ready for Ethereum Frontier (Ethereum Toronto Meetup) presented by Paul Paschos

EthOn - an Ethereum Ontology - EDCON Pfeffer... · EthOn - an Ethereum Ontology Introducing semantic Ethereum ConsenSys Primary Colors ConsenSys Primary Color Shades

Ethereum hackers

Ethereum investor guide

Academic Ethereum

Ethereum Presentation

Security Analysis Methods on Ethereum Smart Contract ... · Ethereum platform, Ethereum accounts, and the execution of the Ethereum smart contracts. A. Ethereum Platform Ethereum

Synced Clients and Traffic Trends

Reversing & Vulnerability Research of Ethereum …archive.hack.lu/2018/hack_lu_2018_Reversing_and...What is Ethereum? “Ethereum is a decentralized platform that runs smart contracts:

Introduction to Ethereum

gething started - ethereum & using the geth golang client

Ethereum pre-sale

Turbo-Geth: Optimising Ethereum client(s).ppt - edcon.io AkhunovTurbo-Geth... · How it started 30 November 2017 > git clone git@github.com:ethereum/go-ethereum.git > cd go-ethereum

ethereum-accounts Documentation · 2019. 4. 2. · ethereum-accounts Documentation, Release 0.1.0 ethereum-accounts is a Python package for working with Ethereum accounts. Its main

Frederik Geth | Opslagtechnieken voor het net

The Ethereum Geth Client