Data Compression Conference 2013 Chenggang Yan, Yongdong Zhang, Feng Dai and Liang Li 1.

HIGHLY PARALLEL FRAMEWORK FOR HEVC MOTION ESTIMATION ON MANY-

CORE PLATFORM

Data Compression Conference 2013

Chenggang Yan, Yongdong Zhang, Feng Dai and Liang Li

Outline Introduction Related Work Proposed Method Experimental Results Conclusion

Introduction(1/2)

HEVC coding tree unit (CTU)

Introduction(2/2)

Local parallel method (LPM) Maximum parallelism of LMP is equal or less than 8. independent Pus (IPUs)

Directed acyclic graph (DAG)

Related Work(1/2)

Local parallel method (LPM) [16] Motion estimate region (MER)

[16] Minhua Zhou, “AHG10: Configurable and CU-group level parallel merge/skip,” JCTVC-H0082, Feb. 2012

Related Work(2/2)

Local parallel method (LPM)

M = 16 or 8

Proposed Method A. Data Dependency Analysis

B. DAG for CTUs

C. Highly Parallel Framework

Proposed Method.A(1/3)

Independent PUs (IPUs) The IPU’s left boundary and MER’s left boundary do not

overlap. The IPU’s upper boundary and MER’s upper boundary do not

overlap.

Neighboring CTUs left upper upper-left upper-right

B. DAG for CTUs

Proposed Method.B(1/4)

Generate a DAG to capture the dependency relationships of CTUs.

DAG consists of a set of vertices V and edges E. data dependency <=> an edge. Processed <=> remove

Condition matrix (CM)

B. DAG for CTUs

Proposed Method.C(1/5)

Step1 : Initialize DQ and CM. DQ is a waiting queue. CM is

designed to record the number of related CTUs for each CTU. Step2 :

When some values in the CM become zero, get the corresponding coordinates and push them into DQ.

Step3 :Get coordinates from DQ and process corresponding

CTUs in parallel on many-core platform. Step4 :

Update CM. When a CTU with coordinate (i, j) in CM is processed, the values of coordinates (i+1, j), (i+1, j-1), (i,j+1) and (i+1,j+1) in CM will minus one operation.

Step5 :Repeat above steps 2~4 until each frame is over.

Maximum parallelism of CTU

Maximum parallelism of highly parallel framework

Average parallelism of highly parallel framework

Experimental Results(1/5)

Conclusion(1/1)

Highly parallel framework provide sufficient parallelism for many-core platforms.

Use the DAG-based order to parallelize CTUs.

Data Compression Conference 2013 Chenggang Yan, Yongdong Zhang, Feng Dai and Liang Li 1.

Documents

Transcript of Data Compression Conference 2013 Chenggang Yan, Yongdong Zhang, Feng Dai and Liang Li 1.

LIANG CHI COOLING TOWER.pdf

Lily pad lotus. maMa Liang Ma Liang and the Magic Brush.

Dissertation Liang Chen

Chenggang Xu Chinese Institutions Select Pages

CHUNG YAO LIANG - math.usm.mymath.usm.my/rgcft/PDF/Theses/Master final copy.pdfCHUNG YAO LIANG - math.usm.my

Copyright by Liang Liang 2009

ROSEANNE LIANG PRESS KIT

12 Liang v. People

Liang-shin Hahn

Political Economy of Economic Policy Last Lecture Privatization & Corporate Governance (Based on Gan, Guo and Xu, 2008) Chenggang Xu copyright@Chenggang.

EFFICIENT PARALLEL FRAMEWORK FOR H.264 AVC DEBLOCKING FILTER ON MANY-CORE PLATFORM Yongdong Zhang, Member, IEEE, Chenggang Yan, Feng Dai, and Yike Ma.

Liang 2014

Liang (2001)

v12a128-liang pgmkr

Butterfly Lovers Liang Zhu

Poster 2014 IUS_Rongjie Liang

Tiantian Liang - MTNA

Zhichao Liang - export.arxiv.org

Liang Liang and Mark Schwartz U. Wisconsin Milwaukee

Liang vs. People