Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf ·...

50
Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet Infrastructure and Security Group The Hong Kong Polytechnic University Xiamen University, 4 July 2013

Transcript of Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf ·...

Page 1: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Measuring Internet Performance: Challenges and Opportunities

Rocky K. C. Chang

Internet Infrastructure and Security Group

The Hong Kong Polytechnic University

Xiamen University, 4 July 2013

Page 2: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Internet Infrastructure and Security Laboratory

2

Page 3: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Measuring end-to-end performance

• Source: Akamai’s network performance comparison 3

Page 4: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

End-to-end network monitoring …

4

Page 5: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Why measuring network path? P

erf

orm

ance

met

rics

Latency

Delay variation (jitter)

Connectivity

Packet loss/reordering

Link/path capacity

Available Bandwidth

TCP throughput

Router hop (count)

Packet duplication

Ap

plic

atio

ns

Traffic engineering

• Network tomography

• Path fingerprinting

• Routing optimization

• QoS routing, admission control, channel assignment in WLAN

User profiling

• Network resource planning

• SLA verification

Application performance tuning

• Rate adaption for VoIP/video streaming apps

• Distance/location prediction for overlay networks, P2Ps, CDNs

5

Page 6: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

An unfinished business

• Much had been done in late 1990 and early 2000. • Very few measurement tools have made their way into

wide deployment. • The Internet is no longer friendly to measurement

probes. – Many unfriendly and intelligent middleboxes – Measurement Lab from Google, PlanetLab, …

• Measurement results may not reflect the experience of data packets.

• Continuous monitoring for inter-domain paths is hard without receiving complaints.

6

Page 7: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Active Path-Quality Measurement

Page 8: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Challenges to active measurement

• Measurement scalability – Measure many network paths

• Measurement reliability – Measurement will not be interfered or interrupted

• Measurement representativeness – Measurement traffic representing the traffic of interest

• Measurement accuracy – Measurement results are accurate statistically.

• Bi-directional measurement – Measure both directions

• Measuring multiple metrics

8

Page 9: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Challenges to active measurement

• Measurement scalability – Cooperative measurement paradigm (e.g., OWAMP) not

scalable

• Measurement reliability – Interference from various middleboxes and firewalls

• Measurement representativeness – Using control channel to measure data channel

• Measurement accuracy – Sampling rate and patterns

• Bi-directional measurement – Measure from both directions

• Measuring multiple metrics – Need multiple tools

9

Page 10: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

A sampling of measurement tools

10

Page 11: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Our approach to active measurement

• Measurement scalability – Non-cooperative measurement paradigm

• Measurement reliability – Use standard protocol and legitimate application data

• Measurement representativeness – Using data channel to measure data channel

• Measurement accuracy – Supporting different sampling rate and patterns

• Bi-directional measurement – Measure from only one direction

• Measuring multiple metrics – Obtain multiple metrics from one side

11

Page 12: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

HTTP/OneProbe • Use normal TCP data packet to measure data-path

quality. • Use normal and basic TCP data transmission

mechanisms specified in RFC 793. • Integrated into normal HTTP application sessions.

12

OneProbe (TCP)

HTTP

BitTo

rrent

RTM

P

… Data clocking

Path measure-ment

Page 13: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

What does HTTP/OneProbe offer?

• Continuous path monitoring in an HTTP session (stateful measurement)

• All in one: – Round-trip time – Loss rate (uni-directional) – Reordering rate (uni-directional) – Capacity (uni-directional) – Loss-pair analysis – …

• "Design and Implementation of TCP Data Probes for

Reliable and Metric-Rich Network Path Monitoring,“ Proc. USENIX Annual Tech. Conf., June 2009.

13

OneProbe

RTT

Forward Loss

Reverse Loss

Forward Reordering

Reverse Reordering

Forward Capacity

Reverse Capacity

Page 14: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

14

Page 15: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

15

Page 16: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

16

Page 17: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

The probe design • Send two back-to-back probe data packets.

– Capacity measurement based on packet-pair dispersion

– At least two packets for packet reordering

– Determine which packet is lost.

17

Page 18: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

The probe design (cont’d)

• Similarly for the response packets

• Each probe packet elicits a response packet. – Adv. Window = 2 and acknowledge only 1 packet.

18

Page 19: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Bootstrapping and continuous monitoring

19

Page 20: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Loss and reordering measurement via response diversity

20

Page 21: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

18 possible path events

21

Page 22: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Based on their response packets

22

Page 23: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Path event distinguishability

• All 18 cases can be distinguished except for

– A1. F1×R2 and F1×R3

– A2. F1×RR and F1×R1

– A3. F0×R3 and FR×R3

• Resolving the ambiguities

– A1 and A2: use RTT.

– A3: use TCP timestamping.

23

Page 24: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Our measurement methods

• Round-trip delay, asymmetric packet loss and packet reordering measurement

• "Design and Implementation of TCP Data Probes for Reliable and Metric-Rich Network Path Monitoring", Proc. USENIX Annual Tech. Conf., June 2009.

• Capacity measurement

• "TRIO: Measuring Asymmetric Capacity with Three Minimum Round-Trip Times", Proc. ACM CoNEXT Conf., Dec. 2011.

• "A Minimum-Delay-Difference Method for Mitigating Cross-Traffic Impact on Capacity Measurement", Proc. ACM CoNEXT, December 2009.

• Loss-pair measurement

• "Measurement of Loss Pairs in Network Paths", Proc. ACM/USENIX IMC, November 2010.

• Available bandwidth measurement

• "QDASH: A QoE-Aware DASH System", Proc. ACM Multimedia Systems Conf., Feb. 2012.

24

Page 25: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Source

Non-cooperative destination

The capacity measurement and loss-pair measurement

Design and analyze three packet-pair methods for sound network measurements

25

Incorporate all the methods into a non-cooperative measurement tool – HTTP/OneProbe [USENIX 08]

• MDDIF [CoNEXT 09], TRIO [CoNEXT 11], Loss pair [IMC 10]

• Fundamentals: decompose + recompose + recycle

Mitigate cross-traffic interference on path capacity measurement Eliminate measurement traffic interference on

asymmetric capacity measurement Recycle bad packet pairs to infer additional path properties

Page 26: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Network capacity

26

2 8 Mbits/s

6 5

3

15 Mbits/s

4 1 Source Destination

Forward path

Reverse path

Link capacity One-way (forward-path) capacity Reverse-path capacity Asymmetric capacity Sub-path capacity

Page 27: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Cross-traffic impact on packet pairs

• Existing techniques: Identify the unaffected packet pair/train 27

p1

p1 p2

p1

p2

p2

Cro

ss

tra

ffic

Compressed PPD

p3

p3 p4

p3

p4

p4

Cro

ss

tra

ffic

Expanded PPD

p5

p5 p6

p5

p6

p6

Correct PPD = S/Cb

Source

Destination

20 Mbits/s

8 Mbits/s

50 Mbits/s

Time

Round-trip capacity

Correct PPD Correct PPD

Page 28: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

• Third PPD = p6’s delay – p5’s delay.

Delay difference = PPD

28

p1

p1 p2

p1

p2

p2

Cro

ss

tra

ffic

Compressed PPD

p3

p3 p4

p3

p4

p4

Cro

ss

tra

ffic

Expanded PPD

p5

p5 p6

p5

p6

p6

Correct PPD = S/Cb

• The MDDIF method: Difference between first and second packets’ minimum delays (minDelays)

Source

Destination

20 Mbits/s

8 Mbits/s

50 Mbits/s

Time

Round-trip capacity

p3

p3 p2

p3

p2

p2

Page 29: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

rj-1 pj

dj-1 T

rj-1 pj

dj-1 T

pj rj

dj T

Source

Destination

Cr Cf pj pj-1 pj-1

pj rj

1-RTP (1,1)-TWP

rj-1 pj

TRIO: measuring asymmetric capacity with three minRTTs

• Exploit 1-RTP and (1,1)-TWP with Sf = Sr = S

29

• dj-1-dj-1 = S/Cf.

• dj-dj-1 = S/Cr.

• Avoid response interference!

pj-1

dj-1 R

T R

T T

• Reuse dj-1

• Avoid probe interference!

T

S/Cf

S/Cr

For self-diagnosis

dj R

Page 30: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

AsymProbe, CapProbe, PingPair

Taxonomy of capacity measurement techniques

30

Clink, Pathchar, Pchar ACCSIG Available tools: Nettimer (tailgating) Packet quartet BBScope Envelope, MultiQ Bprobe, Pathrate, Paśztor’s method,

PBM

MDDIF, TRIO DSLprobe, SProbe

Page 31: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Loss-pair measurement

• Packet pair with exactly one lost packet (defined by Liu & Crovella [liu01imw])

• Path queueing delay Θ – LP01: Θj-1 = dj-1 – minRTT. – LP10: Θj = dj – minRTT.

• Buffer size of congested hop h’ [liu01imw]: B = Θj x C(h’).

31

Source

Destination

dj

pj pj-1

LP10

dj-1

pj pj-1

LP01

Three questions:

1. Θj-1 = Θj? 2. Is B accurate?

3. Any additional info from Θj-1 and Θj?

Page 32: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Loss pairs

32

Forward Path

Reverse Path

Page 33: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Collaborative path-quality measurement

Page 34: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

HARNET measurement (since 1 Jan 2009)

• “Four-Year Experience of Monitoring and Troubleshooting Multi-domain Networks from a Local Federation” under review for a SI in the IEEE Commun. Mag.

34

Page 35: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Running OneProbe at the 8 Us

• 24x365 probing of the paths to 40+ websites

35

Page 36: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

36

On

ePro

be

@H

KU

On

ePro

be

@C

UH

K

On

ePro

be

@C

ityU

On

ePro

be

@Po

lyU

On

ePro

be

@B

U

On

ePro

be

@H

KU

ST

On

ePro

be

@H

KIE

D

On

ePro

be

@LU

40+ web servers selected by the JUCC

Planetopus, database, etc

HKU CUHK PolyU CityU BU HKUST LU HKIED

Mea

sure

me

nt

sid

e

Use

r si

de

Page 37: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

37

Page 38: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Major performance problems diagnosed

38

Page 39: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Application: Impact analysis of submarine cable faults

Page 40: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Eyjafjallajöekull volcano eruption

• “Could Ash Cloud or Deep-Sea Current Overwhelm the Internet?” Poster in HotDep 2010.

• "Non-cooperative Diagnosis of Submarine Cable Faults,” Proc. PAM 2011, March 2011.

40

Page 41: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Path-quality degradation for NOK (Finland) and ENG (in UK)

41

Page 42: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

42

Page 43: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Network congestion caused by the volcano ashes?

• The surges on packet loss and RTT occurred on 14 April 2010.

• But

– The onsets of the path congestion and air traffic disruption do not entirely match.

– Some of the peak loss rate and RTT occurred on weekends.

– Path congestion can still be observed at the end of the measurement period.

43

Page 44: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

A SEA-ME-WE 4 cable fault

• The SEA-ME-WE 4 cable encountered a shunt fault on the segment between Alexandria and Marseille on 14 April 2010.

• The repair was started on 25 April 2010, and it took four days to complete.

• During the repair, the service for the westbound traffic to Europe was not available.

44

Page 45: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

The SEA-ME-WE 4 cable

45

Page 46: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

A plausible explanation for the network congestion

• The congestion in the FLAG network was caused by taking on rerouted traffic from the faulty SEA-ME-WE 4 cable. – FLAG does not use the SEA-ME-WE 4 cable for Hong Kong NOKIA, ENG3, and BBC.

– FLAG uses FEA for Hong Kong NOKIA, ENG3, and BBC – TATA uses different cables between Mumbai and London.

46

Page 47: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Currently

• Server-side measurement methods • Induce data from clients for measurement. • Quality measurement without user intervention

• NetMagic/OpenWrt kernel implementation of measurement boxes • Supporting client-side and server-side measurement • “OMware: An Open Measurement Ware for Stable Residential

Broadband Measurement,” ACM SIGCOMM’13 poster/demo

• CERNET-2 measurement platform • Deploy a measurement platform on CERNET-2 • IPv6 measurement

• Residential broadband measurement platform • SLA measurement • Facilitate a social network for network diagnosis and monitoring

• Open measurement platform

47

Page 48: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

More research projects

• Network performance data analytics – What and when to induce for measurement? – What can we say from the measurement data with high confidence? – “Appraising the Delay Accuracy in Browser-based Network Measurement,”

under review for ACM/USENIX IMC’13

• Automating diagnosis and patch-up of network performance problems – Network tomography – Multiple-time-series analysis for anomaly detection – "MonoScope: Automated Network Faults Diagnosis Based on Active

Measurements," in Proc. IFIP/IEEE IM, May 2013.

• Adapting video bitrate for HTTP streaming – Using network measurement to estimate the available bandwidth – “IRate: Initial Video Bitrate Selection System for HTTP Streaming” under

review for ACM CoNEXT’13

• Adaptive network monitoring • QoE measurement • …

48

Page 49: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

Conclusions

• Develop a suite of atomic path-quality measurement methods. – Atomic => application specific, e.g., video, cloud services – Path quality => QoE – Client side => server side

• Network data research – Mining network data – Designing measurement “experiments” to facilitate network data

mining – Towards a science of network research

• Operational experience informs research; research underpins network operations – Unearthing important problems and questions from operations – Putting research output into practice.

49

Page 50: Measuring Internet Performance: Challenges and Opportunitiescsrchang/Xiamen-talk-new.pdf · Measuring Internet Performance: Challenges and Opportunities Rocky K. C. Chang Internet

50

Thanks