Performance Optimization on Huawei Public and Private Cloud · Performance Optimization on Huawei...

Performance Optimization on

Huawei Public and Private Cloud

Jinsong Liu <liu.jinsong@huawei.com>

Lei Gong <arei.gonglei@huawei.com>

Agenda

• Optimization for LHP

• Balance scheduling

• RTC optimization

Agenda

LHP (Lock Holder Preemption)

• More obvious in virtualization

– vCPU scheduling

– Task preemption

• Then

– Potentially blocking the progress of other vCPUs waiting to

acquire the same lock

– Increasing synchronization latency

– Performance degradation

LHP (Lock Holder Preemption)

• How to solve or alleviate?

– PLE (Pause Loop Exiting)

– DLHS (Delay LH scheduling)

– Co-scheduling

– Balance scheduling

• Hardware support

– VMCS configuration

• Optimization for Lock Waiters

– VM Exit actively

– Avoid waste vCPU cycles for invalid spin

• Pros.

– Supported by upstream

• Cons.

– Setting appropriate values of ple_gap and ple_windows is difficult

• Workloads adjustment

– Find an appropriate vcpu to yield

DLHS (Delay Lock Holder Scheduling)

• Background & precondition

– Usually, lock holders are under interrupt disable contexts

– Normally, the period of holding lock is shortly

– Hardware support (e.g. intel VT-X)

• interrupt window exiting

– Software support

• Hrtimer, …

DLHS (Delay Lock Holder Scheduling)

• Solution– Set a grace period for LH before scheduling

– If one vCPU is LH • Start one hrtimer, and

• Set interrupt window exiting for VMCS

• If the hrtimer expire– Clear interrupt window

– Continue to schedule for vCPU

– Judge the vCPU release the lock• PLE happened

• Interrupt window exiting happen

• then– Cancel hrtimer

– Release grace period

– Schedule the vCPU immediately

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Hackbench results（CPU overcommit 1:3）

VM1-patched

VM2-patched

VM3-patched

DLHS – performance

Agenda

Co-scheduling & Balance scheduling

vCPU vCPU vCPU

pCPU pCPU pCPUTim

Co-scheduling Balance-scheduling

vCPU vCPU vCPU

pCPU pCPU pCPU

Disperse all vCPUs AMASRun all vCPUs on Time x

Co-scheduling

• CPU fragmentation

– Reduces CPU utilization

– Delay vCPU execution

• Priority inversion

– Degrades I/O performance

xxx vCPU0 xxx vCPU0

vCPU1 vCPU1I/O

T0 T1 T2 T3 T4

pCPU 0

pCPU 1

Balance scheduling

• Balances vCPU siblings on pCPUs

– without precisely scheduling the vCPUs simultaneously

• How to?

– Uses a bitmap to record all used pCPUs for VM

– Scheduler adjustment• Enqueue & dequeue

• Migration/find_idle_cpu/select_task_rq etc.

Performance evaluation

• Workload:

– Pushserver in Huawei Private Cloud

– Continuous testing for 24 hours

• Results

with balance schedwithout balance

sched1:1 vcpupin

<10ms 93.50% 70% 95.30%

93.50%

95.30%

20.00%

40.00%

60.00%

80.00%

100.00%

120.00%

Proportion of building chains (higher is better)

Agenda

RTC on KVM

• Windows use RTC as clock event device

• RTC emulation in Qemu, three timers

– rtc_periodic_timer• Generates periodic interrupts

• Programmable to occur according to interrupt rate

– rtc_update_timer• Generates alarm interrupts

• Occur one per second to once per day

– rtc_coalesced_timer• Generates compensation interrupts

• Slews the lost ticks since different reasons

• Getting worse and worse with the VM density increase

• Pain points– Some operations need to hold BQL

– Context switching between user space and kernel space

– Interrupt injecting from user space

– Performance degradation• Latency increase

• Windows guest density decrease

RTC optimizations on KVM

• Minimize influence of BQL – Placing RTC memory region outside BQL

• Using irqfd inject interrupts

• Hyperv features– hyperv clock, …

– Decreases read/write ioports

• Decreases ioport access on 0x70/0x71

• Move RTC emulation to hypervisor– Inject interrupts in KVM

– Reduce context switching

– But• Large attack surface

• Incompatible with new features, such as split irqchip

• Optimize RTC compensation solution

RTC compensation solution

• Slew RTC ticks in hypervisor directly

• Count the coalesced interrupts– When an RTC interrupt injecting failed

– Adjust the count when RTC interrupt rate changes

• Inject coalesced interrupts after EOI handler if exist– Don’t need a separate timer

– More timely

– Throttle the speed if there is too many coalesced interrupts

• Live migration support– Save the coalesced interrupts in src side

– Restore them in dest side

– Both KVM and Qemu need to be patched

Optimization evaluation

Before

optimization

Thank You!

Performance Optimization on Huawei Public and Private Cloud · Performance Optimization on Huawei...

Documents

Transcript of Performance Optimization on Huawei Public and Private Cloud · Performance Optimization on Huawei...

Mario Hoffmann (Chair) mario.hoffmann@sit.fraunhofer.de Fraunhofer Institute SIT, Germany Wang, Hu (Vice Chair) wangh@huawei.com Huawei Technologies, China.

wenyongxin@huawei.com Huang Shifu Wen Yongxin on TTCN-3 2/a... · 2013-01-15 · HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential Security Level: Wen Yongxin Huang Shifu Testing

User Manual - huawei.com · SmartLogger1000 User Manual Contents Issue 09 (2016-08-15) Huawei Proprietary and Confidential Copyright © Huawei Technologies Co., Ltd.

Huawei Concentric Cell Optimization

Feeling the (Pain of) Convergence: mmWave, 5G, SDN, · PDF fileFeeling the (Pain of) Convergence: mmWave, 5G, SDN, ... Chief Architect, Future Networks Huawei USA @huawei.com. HUAWEI

Smartphone-based Acoustic Indoor Space Mappingswadhin/papers/a75-Pradhan.pdfLab, Huawei, China, chenguohai@huawei.com; Bo Yang, Network Technology Lab, Huawei, China, yanbo59@huawei.com.

BGP-MPLS VPN extension for IPv4/IPv6 Hybrid Network Defeng Li 77cronux.leed0621@huawei.com Huawei Technologies.

Enabling Transformation Through Service ... - huawei.com · HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential Page 7 Tokyo E-MOBILE Everest China Mobile Snow Mountains Snow Mountains

AR1200 Series Enterprise Routers Brochure - huawei.com · the Huawei proprietary Versatile Routing Platform (VRP), which take advantage of Huawei long-term accumulation in data communication,

User Manual - huawei.com · Issue Draft A (2017-02-15) Huawei Proprietary and Confidential . c.

Optimal Convergence Rates for Convex Distributed ...jmlr.csail.mit.edu/papers/volume20/19-543/19-543.pdfDistributed Optimization in Networks Kevin Scaman kevin.scaman@huawei.com Huawei

f g@huawei.com jyguo@pku.edu.cn c.xu@sydney.edu.au …HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 1 Figure 1. Visualization of some feature maps generated by the ﬁrst residual

HUAWEI TE30 - voipon.co.uk · HUAWEI TE30 All-in-One HD ... Seamless integration with IMS. ... enterprise.huawei.com Sales contact: vcsales@huawei.com Service contact: ...

Huawei FusionCube 2.5 Hyper-converged Virtualization ...hammer-huawei.com/assets/uploads/solutions/Server... · Huawei FusionCube 2.5 Hyper-converged Virtualization Infrastructure

Data Analytics for 5G Networks: A Complete Framework for ...scan.di.uoa.gr/wp-content/uploads/2019/11/... · Huawei German Research Center email: panagiotis.spapis@huawei.com chan.zhou@huawei.com

RNO parameter optimization 3G Huawei

Huawei - WCDMA Radio Network Optimization Guide

SUN2000L-(2KTL-5KTL) Quick Guide - Huawei/media/Solar/attachment/pdf/au/service/download... · Europe All countries eu_inverter_support@huawei.com Asia Pacific Australia au_inverter_support@huawei.com

Www.huawei.com 2015-10-28 Non-optimized Handoff Enhancement HUAWEI Zhiming Li lizhiming@huawei.comlizhiming@huawei.com Jie Zhao zhao.jie@huawei.comzhao.jie@huawei.com.

Huawei Network Optimization