BRKRST-2008

68

Transcript of BRKRST-2008

Page 1: BRKRST-2008
Page 2: BRKRST-2008

Advanced VPLS attachment technology options BRKRST-2008

Patrice Bellagamba

Distinguished SE

Cisco Europe

Page 3: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Problem statement

Problem to solve:

“Ethernet service Attachment-Circuit redundancy should not imply end to end Spanning-tree usage”

STP is struggling with

– Topology diameter

– Fault isolation between sites

– Resilience to WAN type connections

Position Architecture: STP domain isolation thru dual homing

– Carrier-Ethernet

– Data-Center Interconnect

– Ethernet Services

Page 4: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Access Multi-homing Solutions Summary

Highlights

Multi-chassis LAG Simple solution for spoke-and-hub topology, works for both bridging and non-bridging access device

Standard based solution by using 802.3ad

Sub-second convergence

Phase 1 implement is active/standby mode.

Phase 2 is per VLAN load balancing

Phase 3 is per flow load balancing with E-VPN

REP /REP access gateway

Sub 200msec convergence

Good access ring isolation

Now standard based G.8032 (XR4.1 release)

Spoke-and-hub and ring topology, not works well for mesh network

MST/PVST access gateway Standard based solution as long as access network support MST/PVST

Works for any access network topology

Good access domain isolation

Work with 802.1ah PBB

Convergence time depends on access network STP

Node clustering VSS (Catalyst / Cisco 7600-Sup2T)

Nv cluster (ASR9k)

One control-plane for two chassis

Easiness, Active/Active

Page 5: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

L2 node peering solutions

Making usage of an “Inter-Chassis Communication Protocol”

Service Type L2VPN

Transport Enabler

Access Redundancy ICCP based

Protocol / Feature

E-LINE VPWS Hub and Spoke (Active /

Backup) mLACP + 2-way PW Red. (coupled mode)

E-LAN VPLS Hub and Spoke (Active /

Backup) mLACP + 2-way PW Red. (decoupled

mode)

E-LAN H-VPLS Hub and Spoke (Active /

Backup) mLACP + 2-way PW Red. (decoupled

mode)

DCI VPLS CPE service (Active per

Service) mLACP or pseudo-mLACP

DCI EVPN PE service (Active per Flow) mLACP + BGP

Don’t be scared of all acronyms, they will be explained along the presentation

Page 6: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Agenda

Technology introduction

– Inter-Chassis Communication Protocol (ICCP)

– Multi-Chassis Link Aggregation Group (MC-LAG)

Implementing MC-LAG

– E-LAN dual-homing

– E-LINE dual-homing

– Data-Center Interconnect dual-homing

Active/Active MC-LAG or ICCP-SM

Flow-based E-VPN

7

Page 7: BRKRST-2008

Inter-Chassis Communication Protocol for L2VPN PE Redundancy (ICCP) draft-martini-pwe3-iccp-12

Page 8: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Inter-Chassis Communication Protocol

ICCP allows two or more devices to form a ‘Redundancy Group’

ICCP provides a control channel for synchronizing state between devices

ICCP uses TCP/IP as the underlying transport

–ICCP rides on targeted LDP session, but MPLS need not be enabled

Various redundancy applications can use ICCP:

–mLACP

–Pseudowire redundancy

Under standardization in IETF:

–draft-ietf-pwe3-iccp-12.txt

RG

ICCP over Dedicated Link

RG

ICCP over Shared Network

Page 9: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

10

Inter-Chassis Communication Protocol

• ICC Protocol Transport Requirements

– Reliable Message exchange

– In-order Delivery

– Sequence Numbers

– Timeouts/Retransmissions

use widely deployed LDP protocol.

• Extend LDP with a small set of new messages:

– RG Connect Message

– RG Disconnect Message

– RG Notification Message

– RG Application Data Message

• Use LDP Capability to bootstrap ICCP.

• Application layer specific TLVs.

draft-ietf-pwe3-iccp-12.txt

Page 10: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

11

Two mechanisms:

• BFD – Bi-Directional Forward Detection

• Detection upon loss of BFD keepalives

• Requires nodes to be co-located, with a direct link connection

• No split-brain protection, mandates link to be port-channel dispatched over two

different line cards

• /32 IP Route-watch

• Detection upon loss of IP routing adjacency

• Geo localization of nodes

• Split-brain tie-break via MPLS network

• Depends on IGP timers

OSPF/ISIS fast convergence tuning is required

• Ensure not to have less specific route for detection to work

• “ip routing protocol purge interface”

This is requiring to avoid route-watch flapping on peer link failure

Default in IOS-XR & IOS Rls15

RG

ICCP over Dedicated Link

RG

ICCP over Shared Network

Node Failure Detection

Page 11: BRKRST-2008

Multi-Chassis Link Aggregation Group(ASR 9K - MC-LAG) & Multi-Chassis Link Aggregation control Protocol(Cisco 7600 - mLACP)

Page 12: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Background: Link Aggregation Control Protocol System attributes:

–System MAC address: MAC address that uniquely identifies the switch

–System priority: determines which switch’s Port Priority values win

Aggregator (bundle) attributes:

–Aggregator key: identifies a bundle within a switch (per node significance)

–Maximum links per bundle: maximum number of forwarding links in bundle – used for Hot Standby configuration

–Minimum links per bundle: minimum number of forwarding links in bundle, when threshold is crossed the bundle is disabled

Port attributes:

–Port key: defines which ports can be bundled together (per node significance)

–Port priority: specifies which ports have precedence to join a bundle when the candidate ports exceed the Maximum Links per Bundle value

–Port number: uniquely identifies a port in the switch (per node significance)

Agg1

Agg2

Agg3

Agg4

Port #: 1, Priority: 10

Port #: 2, Priority: 11

Port #: 3, Priority: 5

Port #: 4, Priority: 6

Key: 10

Key: 20

Key: 10

Key: 35

System Priority: 5

System MAC: M1

System Priority: 3

System MAC: M2

Page 13: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Extending LACP Across Multi-Chassis: mLACP

mLACP uses ICCP to synchronize LACP configuration & operational state between PoAs, to provide DHD the perception of being connected to a single switch

All PoAs use the same System MAC Address & System Priority when communicating with DHD

–Configurable or automatically synchronized via ICCP

Every PoA in the RG is configured with a unique Node ID (value 0 to 7). Node ID + 8 forms the most significant nibble of the Port Number

For a given bundle, all links on the same PoA must have the same Port Priority

ICCP

DHD = Dual-Homed-Device

PoA1

LACP PoA2

Node ID: 1

Node ID: 2

Port #: 0x9001, Port Priority 1

Port #:0xA001, Port Priority 2

System MAC:

aaaa.bbbb.cccc

System Priority: 1

PoA = Point of Aggregation

DHD

Page 14: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Operational Variants

Each PoA is configured to limit the maximum number of links per bundle

–Limit must be set to L, where L is the minimum number of links from DHD to any single PoA

DHD max link should be set > L

– In order to insure that it is slave of the POA

– This will allow faster convergence

Selection of active/standby links is the responsibility of the PoAs

Advantages: Faster switchover times compared to other variants, and Minimum Link policy on PoA can be flexible

Disadvantage: If ICCP transport is lost, Split Brain condition could occur

PoA-Based Control

ICCP DHD

Standby PoA

Active PoA LACP

Max Links per

Bundle: L

L = 2

Max Links per

Bundle: L

This is the most used variant

Page 15: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Protected Failure Points

mLACP Offers Protection Against 5 Failure Points:

A: DHD Port Failure

B: DHD Uplink Failure

C: Active PoA Port Failure

D: Active PoA Node Failure

E: Active PoA Isolation from Core Network

A B C D E

Active PoA

DHD

Standby PoA

Page 16: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Failover Operation

Step 1 – For port/link failures (A,B,C), active PoA evaluates number of surviving in bundle:

If >= M, then no action

If < M, then trigger failover to standby PoA

Step 2 – Active PoA signals failover to standby PoA over ICCP

Step 3 – Failover is triggered on DHD by one of:

Dynamic Port Priority Mechanism: real-time change of LACP Port Priority on active PoA to cause the standby PoA links to gain precedence

Links are either Hot-Standby or Up

Brute-force Mechanism: change the state of the surviving links on active PoA to admin down

Links are either Err-disabled or Up

Step 4 – Standby PoA and DHD bring up standby links per regular LACP procedures

Port/Link Failures

ICCP DHD

Standby PoA

Active PoA

• Max Links per

Bundle: L (=2)

• Min Links per

Bundle: M (=2)

A B

C

2. Signal failover over ICCP

3. Dynamic Port Priority or

Brute-force failover

3. LACP Exchanges

Page 17: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Failover Operation

Step 1A – Standby PoA detects failure of Active PoA via one of:

–IP Route-watch: loss of IP routing adjacency

–BFD: loss of BFD keepalives

Step 1B – DHD detects failure of all its uplinks to previously active PoA

Step 2 – Both Standby PoA and DHD activate their Standby links per regular LACP procedures

Node Failure

ICCP DHD

Standby PoA

Active PoA

1A. IP Route-

Watch or BFD

timeout

D 1B. Links Down

2. LACP Exchanges

Page 18: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Failover Operation

Step 1 – Active PoA detects all designated core interfaces are down interchassis group 21 backbone interface TenGigabitEthernet4/1 backbone interface TenGigabitEthernet1/4

Really useful if no direct connection between POA or using one only module toward core

Step 2A – Active PoA signals standby PoA over ICCP to trigger failover

Step 2B – Active PoA uses either Dynamic Port Priority or Brute-force Mechanism to signal DHD of failover

Step 3 – Standby PoA and DHD bring up standby links per regular LACP procedures

PoA Isolation from Core

ICCP DHD

Standby PoA

Active PoA

E

1. Detect core

isolation

2A. Signal failover

over ICCP

2B. Dynamic Port

Priority or Brute-

force failover

3. LACP Exchanges

Page 19: BRKRST-2008

E-LAN Availability Models Active/Backup Access Node Redundancy (mLACP) Works similarly with H-VPLS (MPLS Access)

Page 20: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

E-LAN availability model Active / Backup Access Node Redundancy (mLACP)

VFI

VFI

A A

A A

A

A

A

A

LACP ICCP ICCP

For VPLS Decoupled Mode, VFI’s PWs always advertised in Active state, regardless of AC state

VFI

VFI

Forwarding EoMPLS PW Non-Forwarding EoMPLS PW

FA

FB

FC

LACP

Events

I Initial state

FA-C Port / Link Failures

2

Standby link brought up per LACP proc.

Port / Link Failures

1B

2

1A 1B Failover triggered on DHD

1A

Active PoA detects failure and signals failover over ICCP

3

Standby PoA flushes MAC table and triggers LDP MAC add. withdrawal to remote peers

4

Remote PEs flush MAC addresses

3

4 4

4

VPLS

PW - PseudoWire

VFI – Virtual Forwarding Instance

Page 21: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

E-LAN availability model Active / Backup Access Node Redundancy (mLACP)

VFI

VFI

A A

A A

A

A

A

A

LACP ICCP

VFI

VFI

Forwarding EoMPLS PW Non-Forwarding EoMPLS PW

FA

FB

FC

LACP

Events Port / Link Failures (cont.)

E End State

ICCP

VPLS

4

Remote PEs flush MAC addresses

Page 22: BRKRST-2008

E-LINE Availability Models Active/Backup Access Node Redundancy (mLACP)

Page 23: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Pseudowire Redundancy with LDP

Designate Pseudowires as either primary or backup

–Primary Pseudowire used for traffic forwarding, and backup takes over in case of failure

Signaling Redundant Pseudowires in targeted LDP session

–Cold Redundancy: Backup PWs were not signaled until required to take over

–Warm Redundancy: Backup PWs were signaled up in the control-plane but held down in the data-plane. Use AC Fault code-point in LDP Status Message to indicate a backup PW

–Hot Redundancy: Use PW Preferential Forwarding Status Bits

Background

PW 2

PW 1

MPLS

PE1

CE

<IP1, VCID1>

<IP2, VCID2>

Page 24: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Two-Way Pseudowire Redundancy

Allows dual-homing of two local PEs to two remote PEs

Four pseudowires: 1 primary & 3 backup provide redundancy for a dual-homed device

Overview

PW 1

MPLS

uPE2 uPE4

uPE1

CE

uPE3 PW 2 PW 3

PW 4

Page 25: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Pseudowire Redundancy with LDP

–0x00000000 - Pseudowire forwarding (clear all failures)

–0x00000001 - Pseudowire Not Forwarding

–0x00000002 - Local Attachment Circuit (ingress) Receive Fault

–0x00000004 - Local Attachment Circuit (egress) Transmit Fault

–0x00000008 - Local PSN-facing PW (ingress) Receive Fault

–0x00000010 - Local PSN-facing PW (egress) Transmit Fault

–0x00000020 - PW Forwarding Standby

–0x00000040 – PW Request to go Active

PW Status Signaling

RFC 4447

draft-ietf-pwe3-redundancy-bit

Only this bit is required/used (with help of ICCP)

PW

tLDP

PE1

CE

PE2

Page 26: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Two-Way Pseudowire Redundancy

Every PE decides the local status of the PW: Active or Standby

A PW is selected as primary for forwarding if it is active on both local & remote PEs

A PW is considered as backup if it is declared as Backup by either local or remote PE

Independent Operation Mode

PW

PE1 PE2

PE1 PE2 Active Active

Active Standby

MPLS

PE1 PE2 Standby Standby

PE1 PE2 Standby Active

Page 27: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Two-Way Pseudowire Redundancy

VPWS / H-VPLS – two-way coupled:

–When AC changes state to Active1, both PWs will advertise Active

–When AC changes state to Standby1, both PWs will advertise Standby

H-VPLS – two-way decoupled:

–Regardless from AC state, Primary PW and Backup PWs will advertise Active state

–ASR9K implementation

For H-VPLS, all PWs in VFI (at nPE) are Active simultaneously, for both access & core PWs

Determining Pseudowire State

Active Active

Active

Standby Standby

Standby

pseudowire-class <class name>

encapsulation mpls

status peer topology dual-homed

Active Active

Active

Standby Active

Active

pseudowire-class <class name>

encapsulation mpls

status peer topology dual-homed

status decoupled (1) Active / Standby AC states determined for example by mLACP

Page 28: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

E-LINE availability model Active / Backup Access Node Redundancy (mLACP)

S S

A A

LACP LACP ICCP ICCP

1A

2

1B

S

3

A

4

Events

I Initial state

FA-C Port / Link Failures

1A

Active PoA detects failure and signals failover over ICCP

1B Failover triggered on DHD

2 Standby link brought up per LACP proc.

3 Active PoA advertises “Standby” state on its PWs

4 Standby PoA advertises “Active” state on its PWs

For VPWS Coupled Mode, attachment circuit (AC) state (Active/Standby) drives PW state advertised to remote peers

Forwarding EoMPLS PW Non-Forwarding EoMPLS PW

Port / Link Failures

VPWS

FA

FB

FC

Page 29: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

LACP

E-LINE availability model Active / Backup Access Node Redundancy (mLACP)

S

A

LACP ICCP ICCP

S

A

FA

FB

FC

Events

I Initial state

FA-C Port / Link Failures

1A

Active PoA detects failure and signals failover over ICCP

1B Failover triggered on DHD

2 Standby link brought up per LACP proc.

3 Active PoA advertises “Standby” state on its PWs

4 Standby PoA advertises “Active” state on its PWs

E End State Local site access failure does not trigger LACP failover at

remote site (i.e. control-plane separation between sites)

Forwarding EoMPLS PW Non-Forwarding EoMPLS PW

Port / Link Failures (cont.)

VPWS

Page 30: BRKRST-2008

Implementing MC-LAG

Page 31: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Multi-Chassis LACP synchronization:

LACP BPDUs (01:80:C2:00:00:00) are exchanged on each Link

System Attributes: Priority + bundle MAC Address

Port Attributes: Key + Priority + Number + State

MPLS

Terminology:

mLACP : Multi-Chassis Link Aggregation Control Protocol

MC-LAG : Multi-Chassis Link Aggregation Group

DHD : Dual Homed Device (Customer Edge)

DHN : Dual Homed Network (Customer Edge)

POA : Point of Attachment (Provider Edge)

redundancy

iccp

group <ig-id>

mlacp node <node id>

mlacp system mac <system mac>

mlacp system priority <sys_prio>

member

neighbor <mpls device>

interface <bundle>

mlacp iccp-group <ig-id>

mlacp port-priority <port prio>

interface <physical interface>

bundle id <bundle id> mode active

DHD

Active POA

Standby POA

Redundancy

Group

ICCP

Attachment Circuit Using mLACP/MC-LAG

Page 32: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Additional Features for IOS-XR

Configurable ‘wait-while’ timer

–Timer used to bulk member aggregation by delaying initial link state transitions within a bundle (standards set a time of 2 s)

Configurable flap suppression timer

–Timer used to allow a bundle time to move links from Standby into Active state (i.e. bring in replacement links) before determining whether the bundle has gone down

Configurable delay for revertive switchbacks

–Allows other applications time to complete a sync of their data

–Useful for ‘Revertive’ behavior which would normally make the highest priority device Active as soon as possible

LACP level bundle shutdown

–Method for costing out a bundle while allowing a quick bring up

–LACP continues to be performed but...

– Links are kept in Standby state

– The bundle is Down to other applications

Page 33: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Active/Standby Deployment Options

ICCP VFI VFI

MPLS Core

Cluster node

as a DHD

ICCP VFI VFI

MPLS Core

Cluster node

as a DHD

LACP Hot-Standby

Active PE Standby PE

Traffic may take sub-optimal path via the

DHD inter chassis link

Optimal path by bundling 4 links in vPC

Direct path exist between each N7K and active PE

ASR9K’s as

POA’s ASR9K’s as

POA’s

Active

PE

Standby PE

Page 34: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

SiSi

SiSi

MPLS

core

1 2

3 4

5

6

7

8

Only error 2/3/4 are leading to ICCP convergence

Rem: 2 & 4 are dual errors

• 500 VLAN Unicast: Link error sub-1s & Node error sub-2s

• 1200 VLAN unicast: Link error sub-2s & Node error sub-4s

MC-LAG to VPLS Testing http://www.cisco.com/en/US/docs/solutions/Enterprise/Data_Center/DCI/vpls/vpls_asr9k.html

Page 35: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

MC-LAG Not-Supported Service Example Local Connect

Local connect service between bundle port and bundle port, or between bundle port and regular port are configurable. And it works in the normal condition

However, if bundle failover happen, then it won’t work. For example, for CE2, if it failover to the bottom POA, then there is no local connection to CE 1 or CE3

In summary, although you can configure the local connect service for the MC-LAG bundle port, but it could cause problem. So it’s not supported

Local bridging is supported assume there is L2 link between two POA

ICCP

Standby POA

Active POA

CE1

CE2

CE3

ICCP

Standby POA

Active POA

CE1

CE2

CE3

Local Bridging Local Connect

Page 36: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Feature Cisco Catalyst

3750-ME Cisco ME 3600X /

3800X Cisco ASR

901/903 Cisco 7600

Cisco ASR 9000

One-Way PW Redundancy X X X X X

Two-Way PW Redundancy X X

LDP MAC Address Withdrawal X X

LACP X X X X X

mLACP X (903) X X

ICCP X (903) X X

mLACP Platform Support

Page 37: BRKRST-2008

Pseudo mLACP ICCP-SM - Service Multi-Homing

Page 38: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Pseudo MLACP (P-mLACP) concept P-mLACP provides an Active / Active dual homing redundancy

mechanism with per-VLAN load balancing

– higher bandwidth utilization

Backup time does not require link state changes

– Improve scale and convergence time

Supports today Dual Homed Device (DHD)

– Will support Dual Homed Network (DHN) in future

New extensions to Inter-Chassis Communication Protocol (ICCP) used for communication to control the failover process

Available in Cisco 7600 with 15.1(3)S release mandates ES / ES+ linecards

Available in ASR 9000 with 4.3.1 released in Q2 CY13

Page 39: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Pseudo MLACP (P-mLACP) concept (cont.) P-mLACP provides VLAN based redundancy by allowing one to configure one primary and one secondary interface pair for each member VLAN

Dual Homed Device (DHD) configured with two separate port-channels aggregating to one LAG on Point of Attachment (PoAs)

– Pseudo-LACP application on PoAs ensure that two port-channels on DHD will get bundled to one logical port-channel on PoA

Pseudo mLACP

DHD

POA1

POA2

Allow: VLAN 1-10

Block: VLAN 11-20

Block: VLAN 1-10

Allow: VLAN 11-20

Allow: VLAN 1-20

Allow: VLAN 1-20

ICCP Aggregation / Core

Page 40: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Attachment Circuit options

DHD

ICCP

DHD

ICCP

DHN

Mono-Device Dual Homed Device(DHD)

Multi-Devices Dual Homed Device (DHD)

The one analyzed in this presentation

Dual Homed Network (DHN)

• 7600 & ASR9K do support MVRP

• it would require DHN to support MVRP to flush local

MAC

• ASR9K supports STP TCN to flush edge MAC

• 4K VLAN Scale (no multi-chassis etherchannel)

Page 41: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Active path failures

DHD

ICCP

A B

C D E

A = Nexus port failure (shut)

B = Attachment-Circuit failure

C = Dual AC failure

D = PE crash

E = MPLS link failure

N= DHD member crash

N

Page 42: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Failure condition and MAC flush consideration MAC learning on nominal mode

Mr

LAG PW Forwarding

Ml

Ml

Ml

Mr

Mr ICCP

Blocking

Mr

Mr

Page 43: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Failure condition and MAC flush consideration MAC learning on PE failure

Mr

LAG PW Forwarding

Blocking

Ml

Ml

Ml

Mr

Mr

Mr

Mr ICCP

Mr

Mr

Mr

Ml

Ml

Page 44: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Failure condition and MAC flush consideration MAC learning on PE up time

Mr

LAG PW Forwarding

Ml

Ml

Ml

Mr

Mr ICCP

Blocking

Mr

Mr

Mr

Mr

Mr

Ml

Ml

Automatic

reversion

-

Manual

procedure

No MVRP on Nexus 7000

Solved with ASR9K using a TCN instead

Page 45: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Active path failures measurement

Failure type Traffic

Down Up

Traffic

Down Up

Comment

A DHD link

shut

0.5s-1s 1.5s <0.1s 1.8s

C PE link

shut

0.8s 1.8s <0.1s 1.8s

D PE crash 0.8s 1.7s 0.8s 1.7s

E MPLS link

down

0.1s <0.1s 0.1s <0.1s

N DHD crash

0.3s <0.1s 0.3s <0.1s

Medium scale = 1000 VLAN per vPC *2

Page 46: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

ICCP Based Service Multihoming

5-tuple hashing is used for load balancing among bundle member links, using the following parameters:

– IP source address

– IP destination address

– Router ID

– Layer 4 source port

– Layer 4 destination port

Applications running on the two POAs (mLACP, IGMP snooping, DHCP snooping or ANCP (Access Node Control Protocol)) synchronize their state using ICCP.

Ethernet CFM on Link Aggregation Group (LAG) interfaces (Ethernet bundle interfaces), Ethernet and bundle subinterfaces, and LAG member (bundle member) interfaces.

(ICCP-SM) = pseudo mlacp for ASR9000

47

Page 47: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

iccp group 100

redundancy

iccp

group 100

member

neighbor 222.222.222.222

!

backbone

interface GigabitEthernet0/0/0/10

interface GigabitEthernet0/0/0/11

interface GigabitEthernet0/0/0/12

!

isolation recovery-delay 30

!

iccp group 100

redundancy

iccp

group 100

member

neighbor 111.111.111.111

!

backbone

interface GigabitEthernet0/0/0/10

interface GigabitEthernet0/0/0/11

interface GigabitEthernet0/0/0/12

!

isolation recovery-delay 30

!

PoA1 Config PoA2 Config

Scale Numbers - 30 ICCP Groups, 240 Bundle Interfaces, 4K VLANs, 4K Sub-Interfaces per Bundle

Page 48: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

l2vpn

nsr

redundancy

iccp group 100

multi-homing node-id 1

interface Bundle-Ether100

primary vlan 1-100

secondary vlan 101-200

recovery delay 60

bridge group bg100

bridge-domain bd100-1

mac

withdraw state-down

limit

maximum 512000

!

!

interface Bundle-Ether100.1

!

vfi v100-1

neighbor 55.55.55.55 pw-id 1001

!

neighbor 66.66.66.66 pw-id 1001

!

l2vpn

nsr

redundancy

iccp group 100

multi-homing node-id 2

interface Bundle-Ether100

primary vlan 101-200

secondary vlan 1-100

recovery delay 60

bridge group bg100

bridge-domain bd100-1

mac

withdraw state-down

limit

maximum 512000

!

!

interface Bundle-Ether100.1

!

vfi v100-1

neighbor 55.55.55.55 pw-id 1001

!

neighbor 66.66.66.66 pw-id 1001

!

PoA1 Config PoA2 Config

Page 49: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

RP/0/RSP0/CPU0:ICCPSM1#show l2vpn iccp-sm group 100 ICCP-based Service Multi-Homing Group ID: 100, State: Synchronized with Peer Local Node ID: 1, Remote Node ID: 2, ICCP State: Transport Up, Member Up Interface Name: Bundle-Ether100 MAC flushing: MVRP Recovery Delay: 60 (Timer not running) Local State: Operational Remote State: Operational Local VLAN IDs State ----------------------------------------------------------------------------- Primary: 1-100 Forwarding Secondary: 101-200 Blocked Remote VLAN IDs State ----------------------------------------------------------------------------- Primary: 101-200 Forwarding Secondary: 1-100 Blocked

Normal Operation – No Failures

Page 50: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

RP/0/RSP0/CPU0:ICCPSM1#show l2vpn iccp-sm group 100 ICCP-based Service Multi-Homing Group ID: 100, State: Provisioned Local Node ID: 1, Remote Node ID: 2, ICCP State: Transport Up, Member Down Interface Name: Bundle-Ether100 MAC flushing: MVRP Recovery Delay: 60 (Timer not running) Local State: Operational Remote State: unknown Local VLAN IDs State ----------------------------------------------------------------------------- Primary: 1-100 Forwarding Secondary: 101-200 Forwarding Remote VLAN IDs State ----------------------------------------------------------------------------- Primary: 101-200 Blocked Secondary: 1-100 Blocked

Remote PoA is DOWN (D) Failure

Page 51: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Trouble Shooting - Info to collect:

Running-config

"show l2vpn iccp-sm private”

"show iccp group <>”

"show iccp trace”

"show l2vpn trace”

“show l2vpn forwarding mstp detail location 0/x/CPU0”

“show uidb data location 0/x/CPU0 bundle-ether x.y ingress | incSTP”

Troubleshooting Information

Page 52: BRKRST-2008

Attachment Circuit A/A for Ethernet-VPN

Page 53: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

• Next generation L2 VPN to extend Layer 2 connectivity across MPLS

• Standard based approach. IETF drafts by L2VPN Work Group

E-VPN – draft-ietf-l2vpn-evpn

PBB-EVPN – draft-ietf-l2vpn-pbb-evpn

• E-VPN improves upon VPLS for high scalability, and high availability

Full-Mesh of PW no longer required

• PBB-EVPN combines Provider Backbone Bridge (PBB) 802.1ah with

E-VPN to simplify core transport and enhance scalability.

BGP

PE PE

PE PE

E-VPN and PBB-EVPN (aka Routed VPLS)

Main Principles XR4.3.2

Control-Plane Distribution of Customer MAC-Addresses using BGP

Page 54: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Supported Access Topologies

Physical Interfaces

Bundle Interfaces (shown)

55

PE1

MPLS Core

PE2

Bundle-Eth25 CE2

Bundle-Eth25 CE1

MST-AG / REP-AG

MST on N-PE

G.8032

P-mLACP

Dual Home Network (DHN)

Active / Active Per-Service LB

PE1

CE1

MPLS Core

PE2

Bundle-Eth25

Bundle-Eth25

Dual Home Device (DHD) Active / Active Per-Flow LB

VID X

VID X

PE1

CE1

MPLS Core

PE2

Bundle-Eth25

Bundle-Eth25

Dual Home Device (DHD)

Active / Active Per-Service LB

VID X

VID Y

PE1

CE1 MPLS Core

Bundle-Eth1

Single Home Device (SHD) Single Home Network (SHN)

VID X

VID X

Bundle Interface Physical Interfaces

Bundle Interfaces

(shown)

Physical Interfaces

Bundle Interfaces

(shown)

Page 55: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

AAPF – Imposition Active / Active Per Flow load balancing

PE1-A (DF:Isid 100)

CE1 MPLS Core

PE1-B (non DF:Isid 100)

PE2

CE2

• CE1 does load balancing of flows based on 5 tuples

• (SMAC/DMAC/VLAN/SIP/DIP/TCP ports / UDP ports)

• CE1 will split the customer traffic between members of same bundle

• Both PE1-A and PE1-B forward all imposition traffic

Page 56: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

PE1-A (DF)

CE1 MPLS Core

PE1-B (non DF)

PE2

CE2

DROP FLOOD to avoid Duplicates

FWD Unicast

AAPF – Disposition Active / Active Per Flow load balancing

Page 57: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

ASR9k Internal PBB Topology

Customer Network Port

(CNP)

Provider backbone

Network Port (PNP)

Core BD

Edge BD-1

Edge BD-n

VIP

CBP

EFP-1

EFP-2

EFP-x

system internal virtual port

Edge BD-2

PW

I-component

B-component VIP

VIP

PW

EFP-m

EVPN

Page 58: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Configuration: EdgeBD / I-component

Customer Network Port

(CNP)

Edge BD-1 EFP-1

BE 10.1

I-component

l2vpn

bridge group PBB

bridge-domain PBB-EDGE

interface GigabitEthernet0/0/0/38.100

interface Bundle-ether10.100

neighbor 10.10.10.1 pw-id 1010

!

pbb edge i-sid 1200 core-bridge PBB-CORE

!

PW

Page 59: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Configuration: Core BD/ B-Component

Provider backbone

Network Port (PNP)

Core BD

CBP

B-component

l2vpn

bridge group PBB

bridge-domain PBB-CORE

interface G0/5/0/10.100

!

pbb core

evpn

vpn-id 100

!

router bgp 200

nsr

timers bgp 20 60

bgp router-id 10.10.10.10

address-family l2vpn evpn

!

neighbor 20.20.20.20

remote-as 200

update-source Loopback0

address-family l2vpn evpn

EVPN Core

Page 60: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Active / Active per-FLOW Load Balancing

61

PE1

CE1

MPLS Core

PE2

Bundle-Eth25.1

Bundle-Eth25.1

ICCP in singleton mode. No

need for peer neighbor

configuration

PE 2 should use different mlacp

node id

PE2 should use same mlacp

system mac and system priority

MINIMAL Configuration

Auto ESI

Auto B-MAC SA

A/A Per-flow LB (default)

Auto RT for EVI

Auto RD for EVI

Auto RD for Segment Route

Note: MPLS / LDP configuration

required on core-facing interfaces

(not shown)

PE1

redundancy

iccp

group 66

mlacp node 1

mlacp system mac 0aaa.0bbb.0ccc

mlacp system priority 1

mode singleton

interface Bundle-Ether25

mlacp iccp-group 66

interface Bundle-Ether25.1 l2transport

encapsulation dot1q 777

Page 61: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

C-MAC table

C-MAC table is learnt on Linecard hardware when customer traffic flows

Similar to VPLS/ PBB-VPLS.

1) Edge-BD

RP/0/RSP0/CPU0:PE2#sh l2vpn forwarding bridge-domain PBB:PBB_EDGE_1011 mac-address location

(CMAC Entry learnt from edge-bd)

00bb.0100.00aa dynamic Gi0/6/0/19.101 0/6/CPU0 0d 0h 0m 17s N/A

(CMAC Entry learnt from core-bd)

00aa.0100.0000 dynamic BD id: 3 0/6/CPU0 0d 0h 0m 8s 0011.1111.0000

Page 62: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

B-MAC table

B-MAC table is learnt by BGP, via control plane. Static Entries created in MAC table.

This is new in PBB-EVPN.

2) Core-BD

RP/0/RSP0/CPU0:PE2#sh l2vpn forwarding bridge-domain PBB:PBB_CORE1 mac-address

Mac Address Type Learned from/Filtered on LC learned Resync Age Mapped to

--------------------------------------------------------------------------------

0011.0000.0000 BMAC BD id: 0 N/A N/A N/A STATIC ENTRIES (Built by control plane, does not

age out)

0011.1111.0000 BMAC BD id: 0 N/A N/A N/A

0011.1111.1111 BMAC BD id: 0 N/A N/A N/A

0022.0000.0000 BMAC BD id: 0 N/A N/A N/A

Page 63: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

PBB EVPN Scale Limits

• Max 128K Bundle EFP per system

• Max 4K EFP per bundle

• Max 64K Edge-BD per system

• Max 200 Core-BD per system

• Max 2K BMACs per system

• Max 200 PE per core-BD

Page 64: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Ethernet spoke-and-hub (MC-LAG)

L2 Ethernet Ring (MST/REP-AG, G.8032)

IP/Service Edge

IP/MPLS

L3 Router dual-homing (L3 ECMP)

Conclusion Service Multi-Homing Building blocks

Point to point L2 Service PW redundancy Multi-point L2 Service MC-LAG

Page 65: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Changing the Redundancy Paradigm Virtual Cluster and Satellite for BNG

Home

Aggregation

(MPLS)

Cluster ASR 9000

Distribution

Home

Cluster & Satellite

Core

BNG

BNG

ASR 9000

Virtual

Cluster

BNG

BNG

ASR 9000

Virtual Cluster

Satellite

Aggregation

(MPLS)

ASR 9000

Distribution

Core

• Geo Redundant Dual Homing

• High Availability

• Single-Chassis-like look&feel and

Management of cluster Members

• Stateful Failover between chassis

• Active/active LAG user facing

+ • Huge 1GE Fan-out

• Satellites appear like ASR 9000 Linecards

• Simplified topology, No Spanning tree or

other L2 redundancy protocols needed

Page 66: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Call to Action…

Visit the Cisco Campus at the World of Solutions to experience the following demos/solutions in action: Speaker to add relevant demos/areas to visit from the campus demos list

Get hands-on experience with the following Walk-in Labs Speaker to add the relevant walk in labs from the list

Meet the Engineer Speaker to specify when will they be available for ad-hoc meetings at the MTE village, and provide other recommended names…

Discuss your project’s challenges at the Technical Solutions Clinics

Attend one of the Lunch Time Table Topics, held in the main Catering Hall

Recommended Reading: For reading material and further resources for this session, please visit www.pearson-books.com/CLMilan2014

CL365 -Visit us online after the event for updated PDFs and on-demand session videos. www.CiscoLiveEU.com

67

Page 67: BRKRST-2008

© 2014 Cisco and/or its affiliates. All rights reserved. BRKRST-2008 Cisco Public

Complete your online session evaluation

Complete four session evaluations and the overall conference evaluation to receive your Cisco Live T-shirt

Complete Your Online Session Evaluation

68

Page 68: BRKRST-2008