Why the Address Translation Scheme Matters?

Jiaqing Du

Address Translation/Mapping

• Where is 0x1f344000?• DRAM Devices– A multi-dimensional array– Inside a DIMM: Rank, Bank, Row, Column– Among DIMMs: Memory Controller, Channel

Inside Memory Controller

• Accesses to Different Parts == High Parallelism == High Throughput

• In favor of locality property – Logically adjacent means physically distant

Agenda

• A Scalable Software Router• Performance of A Commodity Server• Memory Translation Disclosure– Experiment Design– Experiment Result

• Understanding the Imbalance• Possible Solutions• Conclusion

A Scalable Software Router

• A Valiant Load-Balanced Mesh• Aggregated Throughput: N x R (bps)

2R/N R

Performance of A Commodity PC

• Experiment Environment– 2 Xeon 1.6GHz sockets, 4 cores/socket– Each of 2 cores share a 8MB L2 cache– 1 GHz FSB, 8GB DDR2 667MHz– 2 MCs manage 4 channels– 4 quad-port 1Gbps NICs (16 ports)– Click 1.6.0 on Linux 2.6.19

• Simple “Point-to-Point” Forwarding• A Chipset Monitoring Tool (Emon)

• Maximum Loss-free Forwarding Rate– 16Gbps input

• Memory Load Distribution

• My work is to dig further– Explain the imbalance– But, we don’t know how an address is mapped :(

stream benchmark 1024B 64B

Disclose Address Translation

• What We Want?– Which bit selects channel, rank, bank, and …– What parallelism really gives us

• What We Have?– Emon: tells us throughput and load distribution

• What We Need?– Enough traffic to one single memory location– Enough traffic to two memory locations,

e.g., 0x1f344000 and 0x1f34100

• Artificial Memory Access Pattern– One writing flow to ADDR1– One flow to ADDR1

The other to ADDR1+2^b ( b = 0, 1, …, 31)• Utilize the Cache– Cache coherency protocol (MESI)– Bind two threads to two cores don’t share L2,

Force them to keep writing to one location– Write to an invalid cache line goes directly to memory– Two threads generate one writing flow

• Experiment Result– ADDR1, ADDR1+2^b

Understand the Imbalance

• Memory Management– Pre-allocated 2KB socket buffer– Reclaimed & reallocated by the kernel

• A Limited number of buffers serve all packets.• A 2KB buffer spans the entire rank-bank grid.• Large Packets (1024B)– Cover at least half of the grid (high parallelism)

• Small Packets (64B)– Hit some elements W.H.P. (poor parallelism)

• In real world, even worse.

Understand the Imbalance

0 1 2 3 4 5 6 7

1 2 3 4 5 6 7

0 1 2 3

4 5 6 70

0 1 2 3 4 5 6 7

1 2 3 4 5 6 7

0 1 2 3

4 5 6 70

Memory Pool

Mapped Grid

What Can We Do?

• Hack Network Adaptor Driver– Introduce random offset

0 1 2 3 4 5 6 7

1 2 3 4 5 6 7

0 1 2 3

4 5 6 70

Memory Pool

Mapped Grid

What Can We Do?

• Hack Slab Allocator and kmalloc()– Maintain a special slab– Provide access through kmalloc()

0 1 2 3 4 5 6 7

4 5 6 7 0 1 2

0 1 2 3

4 5 6 73

Memory Pool

Mapped Grid

What Can We Do?

• Maintain buffers with various sizes– NIC supports multiple descriptor rings– A hardware feature

0 1 2 3 4 5 6 7

0 1 2 3 4 5 6

0 1 2 3

4 5 6 77

Memory Pool

Mapped Grid

Conclusion

• Figured out memory translation scheme • Explained memory load imbalance• Proposed two possible solutions

Why the Address Translation Scheme Matters?

Technology

Transcript of Why the Address Translation Scheme Matters?

An Automatic Translation Scheme from Prolog to the Andorra ...An Automatic Translation Scheme from Prolog to the Andorra Kernel Language ... IAP in addition to the forms of parallelism

Your Pension Matters...4 Scheme news The reason you are able to read this edition of Your Pension Matters is that our administrators, First Actuarial, hold your current address in

Food Amendment (Kilojoule Labelling Scheme and …FILE/17-002a.docx · Web viewPage. Endnotes. Food Amendment (Kilojoule Labelling Scheme and Other Matters) Act 2017. No. 2 of 2017.

A Query Translation Scheme for Rapid Implementation of Wrappers Presented By Preetham Swaminathan 03/22/2007 Yannis Papakonstantinou, Ashish Gupta, Hector.

TRANSLATION AS METAPHOR IN PSYCHOANALYSIS, …documents.routledge-interactive.s3.amazonaws.com/translation... · TRANSLATION AS METAPHOR IN ... Susan Bassnett ... translation in translation

H444/01 Unseen Translation Sample Question Paper · A Level Classical Greek H444/01. Unseen Translation . SAMPLE MARK SCHEME Duration: 1 hour 45 minutes . MAXIMUM MARK 100 DRAFT .

FOI Publication Scheme 2018 · 2019-09-06 · Briefing EU matters including EU aviation legislation International Aviation Conventions Liaison with international aviation bodies,

Why Terminology matters in Journalism€¦ · terminology for any person who is not a citizen or national. Why Researching News Translation? Translation strategies might mitigate

Rna translation & translation

TRADUCCIÓN PÚBLICA CERTIFIED TRANSLATION COMPLAINANT FILES A CRIMINAL COMPLAINT … · 2019-11-13 · exercised by the Argentine Federal Courts with jurisdiction in criminal matters.

MATTERS MATTERS

WHEN TRANSLATION MATTERS 2016 eng(1).pdf · MTA employs modern technology for translation and project management. It ensures consistent quality in translation and effective project

CRIMINAL JUSTICE (COMMUNITY SANCTIONS) BILL … Scheme - Criminal... · Matters to be considered by the court regarding ... Amendment of Criminal Justice ... “probation assessment

Date: 19th September 2019 Subject: PRE-APPLICATION ... · 3.5 Pre-application scheme for Buildings 11 & 12 The pre-application scheme to be considered relates to reserved matters

Pattern Formation in Magnetically Confined Plasmas: Why it Matters · 2018-07-13 · Plasma Zonal Flows II •Fundamental Idea: –Potential vorticitytransport + 1 direction of translation

National Action Plan UNOFFICIAL TRANSLATION · This is not an official translation. This research was funded by the Australian Research Council Discovery Project Scheme (grant identifier

Preparing for Primary Authority Intro: Why the scheme matters.

UNITED STATES BANKRUPTCY COURT CENTRAL … STATES BANKRUPTCY COURT CENTRAL DISTRICT OF ... I announced my tentative decisions in two matters ... distribution scheme …

HA Reservation of Powers to the Authority and Delegation ... · This scheme of delegation covers only matters delegated by the Board to directors and certain other specific matters

1 Machine translation or Automatic translation or Computer-assisted translation.