NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% •...

28
NANOG 58 June 4 th , 2013

Transcript of NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% •...

Page 1: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

NANOG  58  June  4th,  2013  

Page 2: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

Internet2  Architecture    

Chris  Spears  

Sr.  Network  Planning  Architect,  Internet2  

Page 3: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  NaAonal  Research  &  EducaAon  Network  (NREN)  •  Formed  in  1996  by  34  research  universiAes  

–  Need  for  a  network  focused  on  needs  of  researchers    •  What  is  a  Research  Network?  

–  hSp://www.nanog.org/meeAngs/nanog52/presentaAons/Monday/Oberman-­‐NANOG-­‐Research%20Networks-­‐Final.pdf  

•  Different  set  of  needs  among  researchers  

•  “Big  Data”  –  and  moreover  “Big  Science”  –  as  driver  

•  More:  internet2.edu/about  

3  –  6/4/13,  ©  2012  Internet2  

Internet2  Background  

Page 4: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  220  U.S.  universiAes  (over  4.5M  enrollment)  •  60  leading  corporaAons  

•  70  government  agencies  

•  38  regional  and  state  educaAon  networks  (sponsored  parAcipants,  K-­‐12,  etc)  

•  >  100  R&E  partners,  represenAng  more  than  50  countries  

4  –  6/4/13,  ©  2012  Internet2  

Internet2  Community  Makeup  

Page 5: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

5  –  6/4/13,  ©  2012  Internet2  

Internet2  Network  EvoluAon  

Routed IP ServicesIPv4, IPv6, Multicast

Layer-0/1 Services(ESnet, NOAA,Sun Corridor, et al)

2006 20111999 2004

SDN Services

Future

IP Services: IPv4, IPv6, Multicast, MPLS

Layer-0/1 Dedicated Network Services

SDN Services

Evolution of Services and Architecture at Internet2

SDNIPLayer-0/1

10GEOC-192OC-48 20GE 30GE

ION

CoreDirectorDynamic Circuit Network (OC-192)

Advanced Layer-2 Service OpenFlow at 100G

NDDI OpenFlow testbed (10G)

Peering Service

TR-CPS

LHC-Open Network Environment

GENI

XSEDE

HOPI

Ciena ActivFlex 6500Infinera DTN

Page 6: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  IP  Services  –  mulAcast,  IPv6,  long  ago…  •  1999  OC-­‐48,    partnership  with  Qwest    

•  2004  OC-­‐192c  

•  2006  10G+,  partnership  with  Level(3),  Infinera  DTN  network  –  100G  (10x10)  of  capacity  allowed  growth  outside  of  IP  services  –  IP  conAnued  to  grow,  upto  30G  inter-­‐node  capacity  –  New  topologies  to  support  research  and  experimentaAon  

–  Peering  service  •  2010  NTIA  BTOP  award  

–  2011  began  building  current  opAcal  infrastructure  –  Supports  even  greater  scale  of  services,  networks,  and  applica&ons  

6  –  6/4/13,  ©  2012  Internet2  

Internet2  Network  EvoluAon:  IP  Services  

Page 7: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  15,717  route  miles  of  dark  fiber  (predominantly  Level3,  Zayo)  –  51  opAcal  add/drop  sites  (and  growing)  –  341  opAcal  faciliAes  across  U.S.A.  

•  Ciena  AcAvFlex  6500  planorm  –  50GHz  ITU  grid  spaced,  88-­‐channels,  DIA  (direcAonless)  in  metros  

•  Partnered  with  ESnet  at  Layer-­‐1  •  100G  penetraAon:  

–  172  100G  coherent  DP-­‐QPSK  transponders  deployed  –  Core  L2/L3  interfaces:  >70  (sAll  adding  nodes  &  links)  

•  144  40G  transponders  –  solely  for  OTU  mulAplexing  of  10GE  •  100GE  Firsts  

–  TransconAnental  (N.A.)  -­‐  October  2011  –  TransatlanAc  –  June  2013  

7  –  6/4/13,  ©  2012  Internet2  

Internet2  OpAcal  Network  Today  

Page 8: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Custom  Network  Infrastructure  supporAng  science  &  research  –  ESnet    -­‐  hSp://es.net/  –  NOAA  -­‐  hSp://noc.nwave.noaa.gov/  –  GENI  -­‐  hSp://www.geni.net/  –  LHC-­‐Open  Network  Environment  -­‐  hSp://lhcone.net/  

–  XSEDE  -­‐  hSps://www.xsede.org/  •  Shared  Infrastructure  partnerships  

–  Dark  fiber,  spectrum,  OTU  mulAplexers  

•  Dedicated  Infrastructure  for  Internet2  networks  

8  –  6/4/13,  ©  2012  Internet2  

Internet2  Network  EvoluAon:  Layer-­‐0/1  

Page 9: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  History  of  experimenAng  with  new  technologies  •  Dynamic  network  services,  driven  by  sorware…..  a.k.a.  SDN  

–  Hybrid  OpAcal-­‐Packet  Infrastructure  (HOPI)  –  GENI    -­‐  Slice-­‐able,  experimental  network  substrate  

–  Dynamic  Circuit  Network  (DCN)  –  •  22  Node  OC-­‐192  network,  Ciena  CoreDirector  

–  ION  –  Internet2-­‐ON  demand  circuits  •  Dynamic  pseudowires,  speaks  OSCARS  IDC  protocol  

–  NDDI  OpenFlow  testbed    –  Advanced  Layer-­‐2  Service  (discussed  later  in  this  presentaAon)  

•  100GE  backbone,  18  nodes;  25  by  end  of  summer  2013  

•  Built  as  an  Open  Exchange  

9  –  6/4/13,  ©  2012  Internet2  

Internet2  Network  EvoluAon:  SDN  

Page 10: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

10  –  6/4/13,  ©  2011  Internet2`  

Page 11: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

11  –  6/4/13,  ©  2012  Internet2  

International Network Connections

20-30G (Nx10GE) IP/MPLS backbone10GE Peering Service Network

100GE AL2S backbone

Advanced Layer-2 Service NodeAdvanced Layer-3 Service NodeCommodity Peering NodeInternational Peering Point

ESnet

ESnet

Europe(GÉANT)

30G

50G

WIX

MAN-LAN

Starlight

Canada(Canarie)Europe

Trans-Atlantic100GE Trial

Canada(Canarie)

AMPath

ESnet

PacificWave

KAREN

CERNET

LondonOpenExchange

Europe(GÉANT)

AARNet

TransPacPacificWave

South &Central America

AtlanticWave

Asia-PacificAsia-Pacific

Tulsa

Dallas

Houston#2

Houston#1

Denver

Salt Lake City

Portland

Seattle

Columbia

Kansas City

Atlanta

Jacksonville

WashingtonDC

Ashburn

Raleigh

ChicagoCleveland

AlbanyBoston

NYC

Phoenix

Sunnyvale

El Paso

Internet2 AL2S/AL3S/TRCPS/International 2013

Los Angeles

Jackson

Minneapolis

Page 12: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

SDN  at  Internet2  

Edward  Balas  

Manager,  Sorware  Engineering,  Indiana  University  GlobalNOC  

Page 13: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Historic  projects  have  pushed  for  programmaAc  network  control  –  HOPI    –  ION    –  NDDI    –  AL2S    

•  MoAvated  by  desire  to  more  quickly  create  new  virtual  networks  –  Give  members  ability  to  directly  create    

–  Remove  unneeded  provisioning  delays  

–  Concerned  about  quality  and  control  •  The  historic  use  case  =  NaAonal  Exchange  Fabric  

13  –  6/4/13,  ©  2012  Internet2  

Origins  of  SDN  at  Internet2  

Page 14: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Key  Ingredients  –  Big  Pipes(100g)  with  minimal  aggregaAon  

–  Open  the  network  stack  for  non-­‐vendor  driven  innovaAon  –  Domain  expert  involvement  in  developing  new  services  

–  Means  to  separate  experiment  and  producAon  

•  Goal  –  Create  an  improved  experience  for  R&E  Users  

–  We  want  to  find  applicaAons  that  beSer  fill  the  pipes  with  science  

–  Make  it  easier  to  move  data  so  folks  can  focus  on  discovery  

•  Enabler  –  OpenFlow  1.0  today,  1.3  someday  

–  Any  cross  planorm  SDN  techniques  we  can  find  in  future    

14  –  6/4/13,  ©  2012  Internet2  

Internet2  InnovaAon  Planorm  

Page 15: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  TestLab  –  Mixed  vendor  8  switches  and  6  test  PCs  –  MEMS  switch  to  control  layer1  topology  –  Jenkins  based  test  automaAon  system  

•  NDDI  –  5  NEC  PF5820  switches  –  10GE  core  –  Ring  Topology  

•  AL2S  –  15  Brocade  MXLe-­‐16,  3  Juniper  MX960  –  100GE  core  –  ParAal  mesh  topology  –  OESS  used  to  provide  point  and  click  provisioning  

15  –  6/4/13,  ©  2012  Internet2  

InnovaAon  Planorm  

Page 16: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

16  –  6/4/13,  ©  2012  Internet2  

Page 17: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Just  a  playground  –  We  do  encourage  responsible  experimentaAon  

–  It  is  an  involved  process  to  get  into  the  AL2S  network  •  Just  a  testbed  

–  We  do  at-­‐scale  operaAon  of  OpenFlow  Apps  •  Some  are  experimental  

•  Others  are  considered  producAon  grade  –  There  are  risks  that  experiments  will  interfere  with  producAon  traffic  

–  We  try  to  manage  risk  with  technology  and  policy  

17  –  6/4/13,  ©  2012  Internet2  

What  the  InnovaAon  Planorm  is  NOT  

Page 18: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Running  2  separate  producAon  and  research  planorms  too  costly  •  Goal  

–  Run  a  producAon  planorm  with  a  virtual  SDN  net  built  on  top  –  Support  mulAple  simultaneous  applicaAons  /  controllers  –  Minimize  trust  placed  in  applicaAons  

•  Approach  –  Separate  flow  control  by    Switch  /  Port  /  Vlan  Tag  –  Use  FlowVisor  etc  to  logically  “slice”  or  parAAon  the  network  –  Each  app  gets  a  limited  and  non-­‐overlapping  “flowspace”  –  Customers  define  which  apps  can  control  their  port’s  flow  space  –  Traffic  Engineering  a  concern  in  some  cases  

•  ImplementaAon  –  EvaluaAng  FlowVisor  –  Exploring  other  opAons  including  use  of  overlay  networks  

18  –  6/4/13,  ©  2012  Internet2  

MulA-­‐Tenancy  a  key  feature    

Page 19: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  hSp://www.internet2.edu/network/innovaAve-­‐applicaAon-­‐awards.html  

•  Goal  –  Encourage  development  of  SDN  applicaAons  

–  Improve  scienAfic  data  movement  at  100G  

–  Engage  *.edu  to  developing  network  scale  applicaAons  •  Sponsored  by  Juniper,  Ciena  and  Brocade  

•  Modest  (up  to  10k)  cash  prize  to  support  effort  •  Apps  must  work  on  AL2S  and  be  licensed  modified  Berkeley  

19  –  6/4/13,  ©  2012  Internet2  

Internet2  InnovaAve  ApplicaAon  Award  

Page 20: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

20  –  6/4/13,  ©  2012  Internet2  

OpenFlow  Issues  and  Lessons  Learned  

Page 21: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  For  last  6  months,  *including*  maintenance  windows  –  99.69%  for  circuits  –  99.25%  for  nodes  

•  Single  worst  node  event  was  25  hr  outage  –  Bug  in  controller  related  to  corner  case  –  Only  alarm  triggered  was  ISIS  adjacency  alarm  

–  Prolonged  by  iniAal  miss-­‐diagnosis  

•  Circuit  availability  issues  –  Having  100G  opAc  issues  with  some  vendors  

–  Non-­‐trivial  number  of  opAcal  system  upgrades  during  this  period  

21  –  6/4/13,  ©  2012  Internet2  

Availability  for  last  6  months  

Page 22: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  ParAal  support  for  specificaAon  –  Match  and  act  on  both  layer2  and  layer3  –  Proper  barrier  support  –  Support  for  acAons  in  hardware  

•  Stability  problems  –  Various  issues  

•  Performance  issues  –  Port  down  event  generaAon  

•  >  1.5  sec  for  some!  

–  Modify-­‐State  processing  speed  •  ~100  /  sec  

–  Total  number  of  supported  rules  •  ~2,000  

22  –  6/4/13,  ©  2012  Internet2  

Vendor  Issues  

Page 23: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  OpenFlow  1.0  is  not  the  best  protocol  –  Too  much  ler  to  vendor  interpretaAon  

•  Inherent  DoS  risks,  if  you  don’t  trust  your  north  bound    –  No  rate  limits  on  packet  in  –  No  rate  limits  on  packet  out  –  Table  space  exhausAon  

•  Feature  set  lacking  to  replicate  exisAng  services  –  No  viable  QoS  –  No  TTL  decrement  –  No  push  /  pop  VLAN  or  MPLS  tags  

•  ReacAng  to  network  events  requires  controller  round  trip  –  Fast  Failover  Port  groups  in  1.3  should  be  a  win  

23  –  6/4/13,  ©  2012  Internet2  

Protocol  Issues  

Page 24: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Vendor  interacAon  sAll  fairly  intense  •  Perform  full  system  tesAng  when  we  get  a  new  code  revs  

–  Vendor  code    •  3  vendors  ,  6  total  releases  •  20  –  50  hours  per  test  

–  ApplicaAon  Code  •  1  vendor  (us),  4  releases  •  30  -­‐  40  hours  per  test  

–  Hypervisor/slicer  code    •  1  vendor,  2  releases  •  20  –  50  hours  per  test  

•  More  than  50%  of  lab  Ame  is  spent  helping  vendors  •  At  least  50%  of  an  FTE  

24  –  6/4/13,  ©  2012  Internet2  

TesAng  effort  for  last  6  months  

Page 25: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Today  use  central  controller  cluster  over  dedicated  management  network  –  side  band  on  the  OSC  channel    –  Limited  bandwidth    

•  Management  network  disrupAons  impact  OpenFlow  operaAon  –  If  shared  fiber  plant,  OpenFlow  restoraAon  blocks  on  management  

network  restoraAon  

–  Traffic  conAnues  to  flow,  just  black  hole  on  failed  link  

–  Distributed  controller  architecture  can  help    •  Requires  you  mimic  a  rouAng  protocol  to  avoid  dependency  

–  Port  groups  in  1.3  can  also  help  

25  –  6/4/13,  ©  2012  Internet2  

Management  Network  

Page 26: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Robust  WAN  capable  apps  are  hard  •  There  is  a  reason  for  separaAng  IGP  from  EGP  

•  Do  WAN  apps  need  to  control  the  interior  path?  –  If  yes,  do  you  trust  to  developer  to  perform  TE  

–  If  no,    how  do  you  constrain  bandwidth  •  ConsideraAons  

–  Ability  to  funcAon  with  parAal  management  network  disrupAon  

–  RestoraAon  performance  

–  System  complexity  and  cost  of  operaAon  /  tesAng  

26  –  6/4/13,  ©  2012  Internet2  

WAN  OpenFlow  ApplicaAon  Architecture  

Page 27: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

•  Working  together  to  develop  beSer  tesAng  regiments  •  MigraAng  to  1.3  to  get  sought  arer  features  

•  Developing  beSer  sw  ecosystem  –  Truly  distributed  controllers  –  Standard  north  bound  interfaces  

•  Refining  our  operaAons  capability  –  BeSer  monitoring  and  troubleshooAng  

•  What  is  the  crar  interface  to  an  OpenFlow  device  or  app?  

–  OperaAons  support  team  structure  

–  With  WAN  mulA-­‐tenancy,  where  to  Engineer  Traffic?    

27  –  6/4/13,  ©  2012  Internet2  

Future  Challenges  

Page 28: NANOG58 June4 th,2013...• IP%Services%–mulAcast,%IPv6,%long%ago…% • 1999%OCX48,%%partnership%with%Qwest% • 2004OC192c • 2006%10G+,%partnership%with%Level(3),%Infinera%DTN

Ques<ons?  

28  –  6/4/13,  ©  2012  Internet2