Taurus: An Intelligent Data Plane · Tushar Swamy [email protected] 26. References [1] Why (and...

27
Taurus: An Intelligent Data Plane Tushar Swamy Alexander Rucker, Muhammad Shahbaz, Neeraja Yadwadkar, Yaqi Zhang, and Kunle Olukotun 1

Transcript of Taurus: An Intelligent Data Plane · Tushar Swamy [email protected] 26. References [1] Why (and...

Page 1: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Taurus: An Intelligent Data Plane

Tushar Swamy

Alexander Rucker, Muhammad Shahbaz,Neeraja Yadwadkar, Yaqi Zhang, and Kunle Olukotun

1

Page 2: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Managing networks is hard!

Cloud Computing

Internet of Things (IoT)

Augmented and VirtualReality (AR/VR)

2

Page 3: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Approaches to manage networks are …

Data PlaneSwitch or NIC

Fast yet dumb

Slow but intelligentControl PlaneIntelligence

3

OR

Page 4: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Approaches to manage networks are …

Data PlaneSwitch or NIC

Fast yet dumb

Examples:• Congestion control • Load balancing (ECMP, RSS) • Queue scheduling

Characteristics:• Operates on packet scale• Low latency: ns• High throughput: Tbps• Uses heuristics: hash, …

Packets In Packets Out

4

Page 5: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Approaches to manage networks are …

Data PlaneSwitch or NIC

Fast yet dumb

Slow but intelligentControl PlaneIntelligence

5

OR

Page 6: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Approaches to manage networks are …

Data PlaneSwitch or NIC

Slow but intelligentControl PlaneIntelligence

Examples:• Anomaly detection• Automation• Recommendation

Characteristics:• Can do machine learning• Operates on flows• Millisecond latency• Low throughput

Packets In Packets Out

Packet Digest

FlowRule

6

Page 7: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Approaches to manage networks are …

Data PlaneSwitch or NIC

Fast yet dumb

Slow but intelligentControl PlaneIntelligence

7

OR

Page 8: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Approaches to manage networks are …

Fast yet dumb

Slow but intelligentControl PlaneIntelligence

Data PlaneSwitch or NIC

8

OR

Page 9: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Network management should be …

Fast and intelligent

Control PlaneIntelligence

Data PlaneSwitch or NIC

9

Page 10: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

What does “Intelligence” mean?

• Networks are becoming autonomous (Self Driving Networks)

• Machine learning (ML) will play a key role in the future of networks [1,2,3]

10

Security Control Analytics

Page 11: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

ML Inference: Neural Networks

Inputs Outputs

Page 12: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

ML Inference: Neural Networks

Inputs Outputs

Page 13: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

ML Inference: Single Neuron

x1

x0

x2

x3

InputFeatures

W0

PerceptronWeights and Bias

*+

B

+ G(z)

Activation

W1

W2

W3

*

*

*+

+

13

Page 14: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Can match-action tables perform ML inference?

PacketParser

Match-ActionTables

TrafficManager

Packets In Packets Out

14

Page 15: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

ML Inference: Single Neuron

x1

x0

x2

x3

InputFeatures

W0

PerceptronWeights and Bias

*+

B

+ G(z)

Activation

W1

W2

W3

*

*

*+

+

Map: Element-wisemultiplication

Reduce: Addition using a tree 15

Page 16: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Taurus: An Intelligent Data Plane

PacketParser

Match-ActionTables

Map ReduceUnit

Match-ActionTables

TrafficManager

Packets In Packets Out

16

Page 17: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Building blocks of Taurus: Map-Reduce

CU MU CU

MU MUCU

CU MU CU

Data In

Data In

Data Out

Data Out

Compute Units Memory Units

17

Page 18: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Building blocks of Taurus: Map-Reduce

CU MU CU

MU MUCU

CU MU CU

Data In

Data In

Data Out

Data Out

18

Page 19: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Compute unit design

• Taurus CUs are array based:• Functional Units (FUs)• Pipeline Registers (PRs)

LANE 0

LANE 1

LANE 2

STAGE 0 STAGE 1 STAGE 2

FU PR FU PR FU

FU PR FU PR FU

FU PR FU PR FU

PR

PR

PR

FU PR FU PR FU PRLANE 3

19

CU

Page 20: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Reduction network condenses vectors to scalars

LANE 0

LANE 1

LANE 2

STAGE 0 STAGE 1 STAGE 2

FU PR FU PR FU

FU PR FU PR FU

FU PR FU PR FU

PR

PR

PR

FU PR FU PR FU PRLANE 3

20

Page 21: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Taurus: An Intelligent Data Plane

PacketParser

Match-ActionTables

Map ReduceUnit

Match-ActionTables

TrafficManager

Packets In Packets Out

21

Page 22: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Example: Anomaly DetectionPacketParser

Match-ActionTables

Map ReduceUnit

Match-ActionTables

Packets InPackets Out

Read local features(e.g., IP address)

Retrieve out ofnetwork events

(e.g., failedlogins per IP)

Apply learned anomaly detection

Select a port or action

(e.g., drop if score == 1)

Send packetto destination

TrafficManager

Packet

22

Page 23: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Evaluation: Anomaly Detection in Switches

• Taurus examines every packet at every switch

• Additional latency is less than port to port latency

*Overheads are calculated relative to a 300 mm2 chip with 4 reconfigurable pipelines, each drawing an estimated 25 W

Model Lat (ns)

Area Power

+% +%SVMDNN

68362

6.111.7

1.12.0

TP (GPkt/s)

11

23

Page 24: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Evaluation: Congestion Control at the NIC

• Indigo Congestion Control LSTM Network [7]

• Taurus updates every 12.5 ns (software updates every 10 ms)

*Overheads are calculated relative to a 300 mm2 chip with 4 reconfigurable pipelines, each drawing an estimated 25 W

Model Lat (ns)

Area Power

+% +%LSTM 380 23.6 4.1

TP (GPkt/s)0.08

24

Page 25: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Taurus: Fast and Intelligent

• Introduces map and reduce to PISA• Designed to run ML inference inside a data plane• Provides orders of magnitude improvement

Data PlaneSwitch or NIC

Intelligence

25

Page 26: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

Questions?

Tushar [email protected]

26

Page 27: Taurus: An Intelligent Data Plane · Tushar Swamy tswamy@stanford.edu 26. References [1] Why (and How) Networks Should Run Themselves [2] A Knowledge Plane for the Internet, SIGCOMM

References[1] Why (and How) Networks Should Run Themselves[2] A Knowledge Plane for the Internet, SIGCOMM 2003[3] Knowledge-Defined Networking, SIGCOMM CCR 2017[4] Plasticine: A Reconfigurable Architecture For Parallel Patterns, ISCA 2017[5] Deep Learning Approach for Network Intrusion Detection in Software Defined Networking, WINCOM 2016[6] SVM for Network Anomaly Detection Using ACO Feature Subset, iSMSC 2015[7] Pantheon: the training ground for Internet congestion-control research, ATC 2018[8] Forwarding Metamorphosis: Fast Programmable Match-Action Processing in Hardware for SDN , SIGCOMM 2013[9] Programmable Packet Scheduling at Line Rate , SIGCOMM 2016[10] Design Principles for Packet Parsers, ANCS 2013

27