1 An Integrated Approach to Improving Web Performance Lili Qiu Cornell University B-exam December,...

An Integrated Approach to Improving Web Performance

Lili Qiu

Cornell University

B-examDecember, 2000

Acknowledgement Robbert van Renesse, George Varghese,

Ken Birman, Zygmunt Haas, Eva Tardos Venkata N. Padmanabhan, Geoff Volker,

Yin Zhang, Sinivasan Keshav

Outline Motivation & Open Issues Solutions

Study the workload of a busy Web server Properly provision the content distribution

networks Optimize TCP performance for Web

transfers Summary & Other Work

Motivation Web is the most dominant traffic in the

Internet today Account for over 70% wide-area traffic

Web performance is often unsatisfactory WWW – World Wide Wait Consequence: losing potential

customers! Network congestio

nOverloadedWeb server

Challenges in Providing Highly Efficient Web Services

Workload characterization The workload of busy Web

sites is not well understood

Infrastructure provisioning Current trend: Content

Distribution Networks Problem: Where to place

replicas? Protocol inefficiency

Mismatch between Web transfers and TCP protocol

WorkloadCharacterization

InfrastructureProvisioning

ProtocolInefficiency

Our Solutions Web Workload Characterization

Study the workload of a busy Web server Provision Web infrastructure

Develop placement algorithms for content distribution networks (CDNs)

Improve protocol efficiency Optimize TCP startup performance for Web

transfers

Part I Web Workload Characterization

The Content and Access Dynamics of a Busy Web Site: Findings and Implications. Proceedings of ACM SIGCOMM 2000, Stockholm, Sweden, August 2000. (Joint work with V. N. Padmanabhan)

Motivation Solid understanding of Web workload is critical

for designing robust and scalable systems Each of the Web components provides a

unique perspective on the functioning of the Web

Internetreplica

replica

Clients Servers

Motivation (Cont.) Distinguishing features of our work

Study MSNBC Web site a large news server consistently ranked among the busiest

sites in the Web Study content & access dynamics

The dynamics of file modification and creation

The dynamics of users access

Overview MSNBC server site

a large news site server cluster with 40 nodes 25 million accesses a day (HTML content alone) Period studied: Aug. – Oct. 99 & Dec. 17, 98 flash crowd

Server logs HTTP access logs Content Replication System (CRS) logs HTML content logs

Data analysis Content dynamics Access dynamics

Content Dynamics Period studied: 10/1/99 – 10/28/99 Predictive power of modification history

Modification history is a rough predictor of future modification interval

Extent of change upon file modification Most file modifications are minimal

delta encoding can be very useful

Predictive Power of Modification History Has significant bearing on cache

consistency control algorithms, such as adaptive TTL

Prediction algorithm studied Estimate the future modification interval as

the mean of the past x samples Performance metrics

Correlation coefficient between the predicted and actual values

Error in prediction

Correlation Coefficient

A larger averaging window size helps to predict the future modification interval up to a certain point.

00.10.20.30.40.50.60.70.8

0 5 10 15 20

Averaging window size (# samples)

Error in Prediction

50 250 450 650 850

Averaging window: 16 samples

Mean error: 226%

Median error: 45%

Percentage error in predicting file modification interval

Modification history yields a rough predictor need an alternative mechanism (e.g. call-back based invalidation) as

backup

Extent of Change Upon File Modifications

Compute delta using vdelta algorithm

Metric as |vdelta(v1,v2)|

|v1|+|v2| 2 Results

In 77% cases, 1% In 96% cases,

Modification between successive versions is small

Delta encoding can be very useful

Access Dynamics Spatial locality in client accesses

Domain membership is significant except when there is a “hot” event of global interest

Temporal stability of file popularity The set of popular documents mostly

remains stable over a timescale of days

Distribution of file popularity Zipf-like distribution but with a much larger

than at proxies

Temporal Stability of File Popularity

Methodology Consider the traces from

a pair of days Pick the top n popular

documents from each day Compute the overlap

Results One day apart:significant

overlap (80%) Two months apart:

smaller overlap (20-80%) Ten months apart: very

small overlap (mostly below 20%)

1 10 100 1000 10000 100000

# popular documents picked

17DEC98 - 18OCT99 01AUG99 - 18OCT99 17OCT99 - 18OCT99

The set of popular documents remains stable for days

Spatial Locality inClient Accesses

Normal Day

0 10000 20000 30000 40000 50000

Domain ID

Domain membership is significant except when there is a “hot” event of global interest

Dec. 17, 1998

0 5000 10000 15000 20000 25000 30000 35000

Domain IDFr

Random

Spatial Distribution of Client Accesses

Cluster clients using network aware clustering [KW00]

IP addresses with the same address prefix belongs to a cluster

Top 10, 100, 1000, 3000 clusters account for about 24%, 45%, 78%, and 94% of the requests respectively

A small number of client clusters contribute most of the requests.

The Applicability of Zipf-law to Web requests

The Web requests follow Zipf-like distribution Request frequency 1/i, where i is a document’s ranking

The value of is much larger in MSNBC traces 1.4 – 1.8 in MSNBC traces smaller or close to 1 in the proxy traces close to 1 in the small departmental server logs [ABC+96] Highest when there is a hot event

MSNBC Proxies Less popular servers

Impact of larger Accesses in MSNBC traces

are much more concentrated90% of the accesses are accounted by

Top 2-4% files in MSNBC traces

Top 36% files in proxy traces (Microsoft proxies and the proxies studied in [BCF+99])

Top 10% files in small departmental server logs reported in [AW96]

Popular news sites like MSNBC see much more concentrated accesses Reverse caching and replication can be very

effective!

0 0.5 1 1.5

Percentage of Documents (sorted by popularity)

12/17/98 Server Traces 08/01/99 Server Traces10/06/99 Proxy Traces

Summary of Results & Implications

Facts Implications

Past modification history, when averaged over a sufficiently large window, yields a rough predictor

Guide for setting TTL, but need an alternative mechanism (e.g. callback-based invalidation) as backup

Modification between successive versions is small

Delta encoding can be very useful

Summary of Results & Implications (Cont.)

Facts Implications

The set of popular documents remains stable over a timescale of days

Prefetch/push previously popular files that have undergone modification

File popularity follows Zipf-like distribution, but with a much larger than at proxies

Potential of reverse caching & replication

Part II Provision Content Distribution Networks (CDNs)

On the Placement of Web Server Replicas. To appear in INFOCOM'2001. (Joint work with V. N. Padmanabhan and G. M. Voelker)

Introduction to CDNs Content providers want to offer

better service to their clients at lower cost

Increasing deployment of content distribution networks (CDNs)

Akamai, Digital Island, Exodus … Idea: a network of servers Features:

Outsourcing infrastructure Improve performance by moving

content closer to end users Flash crowd protection

CDNserver

server

ClientsContent

Providers

server

Placement of CDN servers Goal

minimize users’ latency or bandwidth usage

Minimum K-median problem

Select K centers to minimize the sum of assignment costs

Cost can be latency or bandwidth or other metric we want to optimize

NP-hard problem

CDNserver

server

ClientsContent

Providers

Placement Algorithms Tree based algorithm [LGG+99]

Assume the underlying topologies are trees, and model it as a dynamic programming problem

O(N3M2) for choosing M replicas among N potential places

Random Pick the best among several random

assignments Hot spot

Place replicas near the clients that generate the largest load

Placement Algorithms (Cont.)

Greedy algorithmGreedy(N,M) { for I = 1 .. M { for each remaining replica R {

cost[R] = cost after placing an additional replica at R

} select the replica with the lowest cost }}

Super Optimal algorithm Lagrangian relaxation + subgradient method

Simulation Methodology Network topology

Randomly generated topologies Using GT-ITM Internet topology generator

Real Internet network topology AS level topology obtained using BGP routing data from

a set of seven geographically dispersed BGP peers Web Workload

Real server traces MSNBC, ClarkNet, NASA Kennedy Space Center

Performance Metric Relative performance: costpractical/costsuper-optimal

Simulation Results inRandom Tree Topologies

Simulation Results inRandom Graph Topologies

Simulation Results inReal Internet Topologies

Effects of Imperfect Knowledge about Input Data

Predict load using moving window average

(a) Perfect knowledge about topology

(b) Knowledge about Topology with a factor of 2

accurate

Summary First experimental study on placement of CDNs Knowledge about client workload and topology is

crucial for provisioning CDNs The greedy algorithm performs the best

Within a factor of 1.1 – 1.5 of super-optimal The greedy algorithm is insensitive to noise

Stay within a factor of 2 of the super-optimal when the salted error is a factor of 4

The hot spot algorithm performs nearly as well Within a factor of 1.6 – 2 of super-optimal

How to obtain inputs Moving window average for load prediction Using BGP router data to obtain topology information

Part III Transport Layer Optimization for the Web Speeding Up Short Data Transfers: Theory,

Architectural Support, and Simulation Results. Proceedings of NOSSDAV 2000 (Joint work with Yin Zhang and Srinivasan Keshav)

Motivation Characteristics of Web data transfers

Short & bursty [Mah97] Use TCP

Problem: Short data transfers interact poorly with TCP !

TCP/Reno Basics

Slow Start Exponential growth in

congestion window, Slow: log(n) round

trips for n segments Congestion

Avoidance Linear probing of BW

Fast Retransmission Triggered by 3

Duplicated ACK’s

Related Work P-HTTP [PM94]

Reuses a single TCP connection for multiple Web transfers, but still pays slow start penalty

T/TCP [Bra94] Cache connection count, RTT

TCP Control Block Interdependence [Tou97]: Cache cwnd, but large bursts cause losses

Rate Based Pacing [VH97] 4K Initial Window [AFP98] Fast Start [PK98, Pad98]

Need router support to ensure TCP friendliness

Our Approach Directly enter Congestion Avoidance Choose optimal initial congestion window

A Geometry Problem: Fitting a block to the service rate curve to minimize completion time

Optimal Initial cwnd Minimize completion time by having the

transfer end at an epoch boundary.

Shift Optimization Minimize initial cwnd while keeping the

same integer number of RTT’s

Before optimization:cwnd = 9

After optimization:cwnd = 5

Effect of Shift Optimization

TCP/SPAND Estimate network state by sharing performance

information SPAND: Shared PAssive Network Discovery [SSK97]

Directly enter Congestion Avoidance, starting with the optimal initial cwnd

Avoid large bursts by pacing

Internet

Web Servers

PerformanceServer

Implementation Issues Scope for sharing and aggregation

24-bit heuristic network-aware clustering [KW00]

Collecting performance information Performance reports, New TCP option, Windmill’s

approach, … Information aggregation

Sliding window average Retrieving estimation of network state

Explicit query, active push, … Pacing

Leaky bucket based pacing

Opportunity for Sharing MSNBC: 90% requests arrive within 5 minutes

since the most recent request from the same client network (using 24-bit heuristic)

Cost for Sharing MSNBC: 15,000-25,000 different client

networks in a 5-minute interval during peak hours (using 24-bit heuristic)

Simulation Results Methodology

Download files in rounds Performance Metric

Average completion time TCP flavors considered

reno-ssr: Reno with slow start restart reno-nssr: Reno w/o slow start restart newreno-ssr: NewReno with slow start restart newreno-nssr: NewReno w/o slow start restart

Simulation Topologies

T1 Terrestrial WAN Link withSingle Bottleneck

T1 Terrestrial WAN Link withMultiple Bottlenecks

T1 Terrestrial WAN Link with Multiple Bottlenecks and Heavy Congestion

TCP Friendliness (I)Against reno-ssr with 50-ms Timer

TCP Friendliness (II)Against reno-ssr with 200-ms Timer

Summary TCP/SPAND significantly reduces latency

for short data transfers 35-65% compared to reno-ssr / newreno-ssr 20-50% compared to reno-nssr / newreno-

nssr Even higher for fatter pipes

TCP/SPAND is TCP-friendly TCP/SPAND is incrementally deployable

Server-side modification only No modification at client-side

Contributions Workload characterization

Study the workload of MSNBC web site

Infrastructure provisioning

Develop placement algorithms for Content Distribution Networks

Protocol efficiency Optimize TCP startup

performance for Web transfers

Workloadcharacterization

InfrastructureProvisioning

ProtocolInefficiency

Other Work Available at

http://www.cs.cornell.edu/lqiu/papers/papers.html Fast Firewall Implementations for Software and

Hardware-based Routers. Submitted to ACM SIGMETRICS’2001.

Integrating Packet FEC into Adaptive Voice Playout Buffer Algorithms on the Internet. Proceedings of IEEE INFOCOM'2000, Tel-Aviv, Israel, March 2000.

On Individual and Aggregate TCP Performance. 7th International Conference on Network Protocols (ICNP'99), Toronto, Canada, October 1999.

Contributions Study the workload of a busy Web

server Develop placement algorithms for

Content Distribution Networks Optimize TCP startup performance for

short Web transfers

Integrating Packet FEC into Adaptive Voice Playout Buffer Algorithms

Internet telephony are subject to Variable loss rate Variable delay

Previous work has addressed the two problems separately Use FEC for loss recovery Use playout buffer adaptation for

delay jitter compensation

Integrating Packet FEC into Adaptive Voice Playout Buffer Algorithms (Cont.)

Our work Demonstrate the interaction between

playout algorithm and FEC Playout algorithm should depend on both FEC and

network loss conditions and network jitter Propose several playout algorithms that

provide this coupling Demonstrate the effectiveness of the

algorithms through simulations

On Individual and Aggregate TCP Performance Motivation

TCP behavior under many competing TCP connections has not been sufficiently explored

Our work Use extensive simulations to

investigate the individual and aggregate TCP performance for many concurrent connections

On Individual and Aggregate TCP Performance (Cont.) Major findings

All connections have the same rtt Wc > 3*Conn global synchronization Conn < Wc < 3*Conn local synchronization Wc < Conn shut off connections

Adding random processing time synchronization and consistent discrimination less pronounced

Derive the general characterization of overall throughput, goodput, and loss probability

Quantify the roundtrip bias for connections with different RTT

Understanding the End-to-End Performance Impact of RED in a Heterogeneous Environment

Motivation IETF recommends wide spread

deployment of RED in routers Most previous work studies RED in

relatively homogeneous environment Our work

Investigate the interaction of RED with five types of heterogeneity

Understanding the End-to-End Performance Impact of RED in a Heterogeneous Environment (Cont.) Major findings

Mix of short and long TCP connections Short TCP connections get higher goodput with RED than with

Drop Tail Mix of TCP and UDP

Bursty UDP tends to get lower loss rate with RED than with Drop Tail

Mix of ECN and non-ECN capable traffic ECN-capable TCP connections get higher goodput than non-ECN-

capable TCP connections Effect of different RTT

RED reduces the bias against long-RTT bulk transfers Effect of two-way traffic

When ACK path is congested, TCP gets higher goodput with RED than with Drop Tail

Effects of Imperfect Knowledge about Input Data

Effects of Imperfect Knowledge about Input Data (Cont.)

The effect of imperfect topology information

Randomly remove from 0 up to 50% edges in the AS topology derived from the BGP routing tables

The greedy algorithm is insensitive to edge removal

Performs within 2.6 of optimal when the edge removal is 50%

Why is the Web so slow? Application layer

Web servers are overloaded … Transport layer

Web transfers are short and busty, and interact poorly with TCP

Network layer Routers are not fast enough Network congestion Route flaps and routing instabilities

…Inefficiency in any layer of the

protocol stack can slow down the Web!

Challenges in Providing Highly Efficient Web Services Workload characterization

The workload of busy Web sites is not well understood

Infrastructure provisioning Current trend: Building efficient Web services

through replication (Content Distribution Networks)

Problem: Where to place replicas? Protocol inefficiency

Mismatch between Web transfers and TCP protocol

Introduction Solid understanding of Web workload is critical

for designing robust and scalable systems The workload of popular Web servers is not

well understood Study the content and access dynamics of

MSNBC web site a large news server one of the busiest sites in the Web 25 million accesses a day (HTML content alone) Period studied: Aug. – Oct. 99 & Dec. 17, 98 flash

Content Dynamics Period studied: 10/1/99 – 10/28/99 CDF of modification intervals

Distinct knees in the CDF at one hour and one day

Predictive power of modification history Modification history is a rough predictor of

future modification interval Extent of change upon file modification

Most file modifications are minimal delta encoding can be very useful

CDF of Modification Intervals

1.E+01 1.E+02 1.E+03 1.E+04 1.E+05 1.E+06 1.E+07

Modification interval (seconds)

Distinct knees in the CDF at one hour and one day

Impact of Age on Popularity

For most documents, accesses are concentrated soon after creation

020406080

100120140160180200

0 100000 200000 300000 400000 500000

Time elapsed since creation (seconds)

ent ID

popula

Causes of First-time MissesUp to 40% of cache misses are due to firsttime misses [VDA+99]

Date New files (%) Old files (%)

Oct. 8, 99 23.16 76.84

Oct. 9, 99 13.22 86.78

Oct. 10,99 13.25 86.75

Oct. 11,99 18.75 81.28

Accesses to old documents account for most first-time misses hard to anticipate such accesses & eliminate first-time misses

1 An Integrated Approach to Improving Web Performance Lili Qiu Cornell University B-exam December,...

Documents

Transcript of 1 An Integrated Approach to Improving Web Performance Lili Qiu Cornell University B-exam December,...

On Selfish Routing In Internet-like Environments Lili Qiu Microsoft Research Feb. 13, 2004 Johns Hopkins University.

Enabling High-Bandwidth Vehicular Content DistributionEnabling High-Bandwidth Vehicular Content Distribution Upendra Shevade, Yi-Chao Chen, Lili Qiu, Yin Zhang, Vinoth Chandar, Mi

Understanding and Managing Notiﬁcationsswadhin/papers/Notification_Infocom17.… · Understanding and Managing Notiﬁcations Swadhin Pradhan , Lili Qiu , Abhinav Paratey, and Kyu-Han

1 Architecture and Techniques for Diagnosing Faults in IEEE 802.11 Infrastructure Networks Atul Adya, Victor Bahl, Ranveer Chandra, Lili Qiu Microsoft.

1 Estimation of Link Interference in Static Multi-hop Wireless Networks Jitendra Padhye, Sharad Agarwal, Venkat Padmanabhan, Lili Qiu, Ananth Rao, Brian.

Speeding Up Short Data Transfers Yin Zhang, Lili Qiu Cornell University Srinivasan Keshav Ensim Corporation NOSSDAV’00, Chapel Hill, NC, June 2000 Theory,

Measurement-based models enable predictable wireless behavior Ratul Mahajan Microsoft Research Collaborators: Yi Li, Lili Qiu, Charles Reis, Maya Rodrig,

System & Network Reading Group On Selfish Routing In Internet-Like Evironments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang.

1 Subscription Partitioning and Routing in Content-based Publish/Subscribe Networks Yi-Min Wang, Lili Qiu, Dimitris Achlioptas, Gautam Das, Paul Larson,

1 Analyzing Browse Patterns of Mobile Clients Lili Qiu Joint work with Atul Adya and Victor Bahl {adya,bahl,liliq}@microsoft.com Microsoft Research ACM.

SOAR: Simple Opportunistic Adaptive Routing Protocol for Wireless Mesh Networks Authors: Eric Rozner, Jayesh Seshadri, Yogita Ashok Mehta, Lili Qiu Published:

Effects of Interference on Wireless Mesh Networks: Pathologies and a Preliminary Solution Yi Li, Lili Qiu, Yin Zhang, Ratul Mahajan Zifei Zhong, Gaurav.

Fine-grained Spectrum Adaptation in WiFi Networks Sangki Yun, Daehyeok Kim and Lili Qiu University of Texas at Austin 1 ACM MOBICOM 2013, Miami, USA.

Predictable Performance Optimization for Wireless Networks Lili Qiu University of Texas at Austin lili@cs.utexas.edu Joint work with Yi Li, Yin Zhang,

Enabling High-Bandwidth Vehicular Content …lili/papers/pub/conext10.pdfEnabling High-Bandwidth Vehicular Content Distribution Upendra Shevade, Yi-Chao Chen, Lili Qiu, Yin Zhang,

Optimizing Cost and Performance for Multihoming ACM SIGCOMM 2004 Lili Qiu Microsoft Research liliq@microsoft.com Joint Work with D. K. Goldenberg, H. Xie,

MIMO: Challenges and Opportunities Lili Qiu UT Austin New Directions for Mobile System Design Mini-Workshop.

1 Server-based Characterization and Inference of Internet Performance Venkat Padmanabhan Lili Qiu Helen Wang Microsoft Research UCLA/IPAM Workshop March.

Wei Dong* Swati Rallapalli* Lili Qiu* K.K. Ramakrishnan + Yin Zhang* *The University of Texas at Austin + Rutgers University Swati Rallapalli IEEE INFOCOM.

1 Gossip-Based Ad Hoc Routing Zygmunt J. Haas, Joseph Halpern, LiLi Cornell University Presented By Charuka Silva.