Purchase the MEF-CECP Exam Today!
Home

Login/Register

Recent Members

Online Users

  • Daniel Bar-Lev
  • Hieu Hieu
  • Donamark Mirabel
  • Ayal Lior
  • Eng. Anuradha Udunuwara
5 user(s) and 870 guest(s) online | Show All
Today
Ayal Lior added a new wall post in the group, Operations, Administration & Maintenance 04:43 AM
Yesterday
Ravi Prakash Vaish added a new wall post in the group, Operations, Administration & Maintenance 04:10 PM
Francesco Fucelli and Richard Strike are now friends 02:33 PM
 
Follow us on Twitter
Ethernet Service Protection Print E-mail
(16 votes, average: 4.31 out of 5)
Papers - Ethernet Academy Articles
Monday, 17 January 2011 00:00

Ethernet Service Protection

Over External Interfaces

Written by Zehavit Alon

 

Content Disclaimer

1 Preface

1.1 Abstract

Over the last few years, packet switching technologies, such as Ethernet and MPLS, have been widely adopted by the industry as the technologies of choice to replace existing networks based on circuit-switching technologies, such as SONET/SDH and ATM.

Circuit-switched network technologies provide mechanisms that guarantee sub-50ms service protection for their subscribers. A similar protection level should be provided by the different packet technologies when the identical set of capabilities formerly granted to the subscriber is required, in order for packet technologies to seamlessly replace circuit technologies.

The various packet technologies define different ways of protecting a service within the network. However, all the currently defined local protection methods are intra-network only and there is no standard method for protecting services against node failures over external interfaces, i.e. for protecting connectivity between attached networks.

Figure 1 illustrates a theoretical packet based network where one of the services that span several service providers is emphasized.

 
The goal of this paper is to introduce a mechanism aiming at providing a robust solution for the missing protection capabilities. This paper describes the following: (i) Overview of the current state, (ii) required functionality that does not exist as yet and (iii) a proposed mechanism for protection between domains, the Inter Network Service Protection (INSP)

1.2 Abbreviations

  • EI - External Interfaces
  • ENNI - External Network to Network Interface
  • INSP - Inter Network Service Protection 
  • IZ – Interconnection Zone
  • LAG – Link Aggregation 
  • SG - Service Gateway 
  • UNI - User to Network Interface
  • VLAN – Virtual LAN


1.3 Definitions

This section describes the definitions used throughout this paper (Some definitions are accompanied by a figure to clarify the definition).

  • Border Node – A node that resides at the edge of the network and connects it to another network. MEF external interfaces connect border nodes of adjacent networks.

Figure 2: Border Nodes

  • Subscriber- The organization purchasing and/or using Ethernet Services
  • Domain -A collection of network elements within a common realm of address space which are operated as separate administrative network  and are managed by a single Operator
  • ENNI, ENNI-Nx– ENNI is a reference point representing the boundary between two operator networks that are managed as separate administrative domains. ENNI-Nx is a separate, administered, functional component supporting the ENNI between a network X and a second network. 
  • External Interfaces (EI) – Interfaces between domains (ENNIs)  and interfaces between a domain  and a subscriber premise (UNIs) , 
  • External Link – A link connecting the border nodes of attached domains. Can be physical or logical i.e. can be a LAG

Figure 3: External Links

  • Interconnection Zone (IZ) – A realm that includes the border nodes of two connected domains and all the links connecting these border nodes (internal and external) that participate in the service protection mechanism

Figure 4: Interconnection Zone

  • Internal Link – A link that connects two border nodes within the same service portal. Can be physical or logical i.e. can be a LAG

Figure 5: Internal Links

  • Inter Network Service Protection (INSP) – A mechanism for providing service protection over external interfaces
  • Network – A number of domains connected to each other 
  • Operator– An entity  that offers connectivity and transport services to service providers for their subscribers
  • Service – Ethernet MAC oriented connectivity and the delivery of Ethernet PDUs presented across internal and external interfaces. The service is designated by a VLAN. The VLAN at the UNI is C-VLAN, and at the ENNI is S-VLAN (The protection mechanism defined in this document can support frames with B-VLAN). For the sake of simplicity, this term is used throughout this document to represent both a single VLAN and a group of VLANs that are handled together and not as individual VLANs. 
  • Service Portal – A logical entity that groups all the border nodes inside a domain which protects a specific service. A service portal in one domain is connected to exactly one other service portal in another domain. Portal – A is a logical entity that gathers all the service portals of a domain facing the same IZ, i.e. a portal includes all the border nodes facing the same IZ.

Figure 6: Service portal and Portal

  • Working path - A path configured to transport user traffic under normal conditions when there are no failures disrupting the traffic flow. The working path is a configuration parameter and, as such, it continues remains the working path even when traffic is switched to another path. The status of the working path may be active – traffic flows through the path, or standby –traffic does not flow through the path.
  • Protection path –A path configured to transport user traffic when the working path fails. When the working path is active, the status of the protection path is standby, and when the protection path is active, the status of the working path is standby. Only one path is active at a time.
  • UNI, UNI-C, UNI-N – Physical interface/demarcation point between the service provider and the subscriber. It also represents the service start/end points. UNI-C is the subscriber side of the UNI, while UNI-N is the network side of the UNI owned by an operator.

 
1.4 Introduction

As the communication infrastructure nowadays has a global reach, a service may extend across a network built of several domains, each of which may employ a different technology and a different protection mechanism.

Domains are connected by external interfaces (EIs). The MEF has defined two major types of EIs – User to Network Interface (UNI) and External Network to Network Interface (ENNI). Service frames transit both UNI and ENNI interfaces. To enable end-to-end service protection capable of protecting against multiple failures, each domain should protect the service locally. In addition, the service should be protected in the area between the domains, that is, in the interconnection zone where the networks are connected. I.e. a mechanism for protecting services over the UNIs between the subscriber premises and the service provider domain, and over the ENNIs where different service provider domains are attached to each other is needed.

While different local protection mechanisms exist and provide protection within domains, no similar standard mechanism exists to protect services between EIs.

It is desirable that any protection mechanism for the interconnection zone limits the effects of component failure or performance degradation with the result that remote parts of networks remain unaffected by these events.


1.5 External Interface protection state as of 2H 2010

Currently, link protection is the only External Interface protection type the MEF explicitly specified in its technical specifications and its implementation agreements (for ENNI - MEF 26 and for UNI - MEF 20). Other mechanisms, although not precluded, are not explicitly defined.

The Link Aggregation Group (LAG) is the link protection mechanism defined by the MEF for External Interfaces. LAG is an IEEE standard, initially published as 802.3ad in 2000, and re-published as 802.1AX in 2008. It enables two nodes (referred to as partners) to connect with several links, which are treated as a single logical link, and to provide (a) load sharing and (b) protection of individual links. Traffic is distributed between all available links of the LAG. In the event of link failure, the traffic from the failed link is moved to one of the other available links of the LAG.

For External Interface protection, the standardized MEF definition support only 2 links allowing a LAG to be used in an active standby configuration and not in a load-sharing configuration. This is because the MEF stipulates that service data and service monitoring shall share the same fate, while in the load-sharing LAG configuration, it is possible that service monitoring shall use a different link than the one used by the service data.


2 Service protection over external interfaces

Different technologies employ different techniques in order to provide intra-domain protection; however, there is no standard way to protect the area between domains. As a mechanism is required for service protection in the interconnection zone, it is necessary to analyze the criteria that such a mechanism should comply with, and define its properties.

2.1 Criteria

Any mechanism providing service protection over external interfaces should stand up with the following criteria:

  • It should protect against any single failure or degradation of a link or a node in the IZ. 
  • The protection time should be less than 50ms (preferably, this time includes failure detection). As the de facto standard of the industry for protection time is sub 50ms, this protection mechanism should not be regarded as an exception and it should comply with this value. 
  • It should operate at the service level so that the resources used for protection can be efficiently used. The mechanism should be able to protect the following services (1) subscriber services (for UNI), (2) provider services (for ENNI) and (3) backbone services (for future support of backbone services in the ENNI).   
  • It should ensure that a service enters and leaves a domain via one border node to enable MAC learning and to prevent frame duplications.
  • It should ensure that the service is co-routed in the interconnection zone, i.e. that it uses the same route in both directions Traffic belonging to each service should use the same route to ensure that a service will not be split between interfaces.
  • It should isolate a link failure when possible (i.e. when an alternate route between the two border nodes connected by this link exists), and prevent propagation of a failure to the attached domains. 
  • It should provide a clear indication of the protection state of each service. The indication should show which node and links the service uses at any given moment, and whether a protection switching event occurred.
  • It should be an independent mechanism, i.e. it should not depend on the domain capabilities and there should be no need to modify a protocol or any protocol configuration running inside each of the interconnected domains in order to enable protection in the IZ.
  • It should maintain an agnostic approach regarding the domain technology running in each of the interconnected domains, and the protection mechanism deployed by each of the interconnected domains.
  •  

2.2 Properties

As the External Interfaces defined by the MEF are Ethernet interfaces, it is extremely important that the protraction mechanism complies with the basic attributes defined for Ethernet. IEEE defines a list of properties that each frame must comply with in order to enable the correct operation of the Ethernet technology. These properties are listed in IEEE802.1Q section 6.5.

The following parameters, which are a sub list of the one defined by IEEE, should be taken into consideration when defining the protection mechanism. (The properties that are part of the original IEEE and are not part of the sub list has little relevance to the protection mechanism and thus are not listed below):

1. Service availability – Service availability is measured as the fraction of the total time during which the MAC service is provided. Service availability is decreased when there is a failure along the data path, and is increased when this failure is overcome. One way to increase service availability, in addition to the existing methods, is to introduce a mechanism to overcome failures in the IZ.

2. Frame loss – The MAC Service does not guarantee the delivery of Service Data Units. It is highly probable that frames transmitted by a source station will reach the destination station in an uncorrupted state. Any mechanism that increases service availability in the IZ should not cause a significantly increase in frame loss.

3. Frame misordering – The MAC Service permits a negligible rate of reordering of frames with a defined priority for a specific combination of destination addresses and source addresses, transmitted on a defined VLAN. Any mechanism that increases service availability in the IZ should not introduce additional frame misordering.

4. Frame duplication – The MAC service permits a negligible rate of frame duplication. The operation of bridges introduces a negligible rate of duplication of user data frames. Any mechanism that increases service availability in the IZ should prevent additional frame duplication, and should ensure that a frame is received only once by any of its destinations.

5. Transit delay – The MAC service introduces frame transit delay that is dependent on the particular media and MAC method employed. Frame transit delay is the time that elapses between transmitting a frame and receiving it. Any mechanism that increases service availability in the IZ must not significantly increase the transit delay.

6. Maximum Service Data Unit Size – The Maximum Service Data Unit Size varies according to the MAC method and its associated parameters (speed, electrical characteristics, etc.). The Maximum Service Data Unit Size, supported by a bridge between two LANs, is the smallest size supported by the LANs. Any mechanism that increases service availability in the IZ should refrain from increasing the Service Data Unit Size.

7. Priority – The MAC service includes priority as a Quality of Service parameter. Frames with high priority may be given precedence over other frames which were received earlier. Any mechanism that increases the service availability in the IZ should handle the frames according to their original priorities and should not modify them.

 

3 Protection Mechnism

A protection mechanism for the IZ should operate on nodes in the border of each domain and on links connecting these border nodes. It should be able to detect failures to overcome these failures. Such a mechanism should also comply with the criteria and Properties defined in‎2.1and ‎2.2.

The protection mechanism should be able to operate in any External Interface and hence should be agnostic to the type of VLAN (C-VLAN, S-VLAN)


3.1 Failure detection

Failures in connectivity and degradations in service performance can be detected by physical means and by OAM methods defined by IEEE 802.1ag and ITU-T Y.1731.


3.2 Interconnection Zone structure

The composition of the IZ, i.e. the number of border nodes, internal links, and external links, and the way border nodes are connected to each other inside the service portal and between service portals of attached domains may vary in different domain configurations.

The border nodes protecting a specific service in the domain, facing a specific attached domain, are grouped in a logical entity called the service portal. (One or more service portals can be contained within one interconnection zone). The number of nodes in each service portal greatly influences the level of protection and the quality that a service is granted. A service portal containing a single node can provide only link protection and can not protect against node failure. A service portal containing two border nodes can provide better protection, as it can provide both node and link protection. Additional nodes may be added to a service portal. However, the author of this document is of the opinion that this is unnecessary, since two border nodes can provide the required functionality.

Theoretically, any topology can be used for the IZ. In some cases, one topology may be more advantageous than others, while in other cases one of the other topologies may better suit the provider's needs. The topology of choice for an operator varies according to the services provided, the network span, the connectivity between domains and the distance between significant elements in the network. Different parameters influence the number of border nodes in the service portal and the number of nodes influences the number of links in the IZ, as border nodes in each service portal may or may not be connected to each other by internal links within the service portal, and may be connected in many ways to border nodes in the service portal of the attached domain.

In addition, the distance between border nodes in the same service portal influences the length of internal links, while the distance between border nodes in adjacent service portals influences the length of external links. All of these variables have a meaningful impact on the chosen topology.

The selected topology affects the level of protection a service is granted i.e. the delay and the propagation of faults occurring in the IZ.


In the next sections different topologies, built of service portals comprising one or two border nodes in each domain are described. The generalization to service portals comprising more border nodes seams intuitive


3.2.1 Ring topology

In a ring topology, each border node is connected to no more than two other border nodes;
Depending on the needs, the ring may include different number of nodes. It may include one border node in one domain and two border nodes in the attached domain, creating a three-node ring, or two border nodes in each domain, forming a four-node ring. The three-node ring provides link protection and partial node protection, while the four-node ring provides link protection and full node protection. Figure 7depicts a three-node ring connecting two adjacent domains where the protection is partial. Figure 8depicts a four-node ring connecting two adjacent domains where the protection is

3.2.2 Hourglass topology

Hourglass topology comprises border nodes connected by external links only. (The border nodes inside the service portal are not connected by an internal link). In an hourglass topology, each border node of a service portal in one domain is connected to all border nodes of the service portal in the attached domain. This connectivity provides full protection in the four nodes construct; however, every instance of component failure (including external link failure) followed by protection switching is propagated to the attached domain as it causes a change to the nodes that handle the service.

Figure 9 depicts three-node, hourglass connectivity providing partial protection. Figure 10 depicts four-node, hourglass connectivity connecting two adjacent domains which provides full protection.

Figure 9: Hourglass topology with three border nodes

Figure 10: Hourglass topology with two border nodes in each domain

3.2.3 Full-mesh topology

In a full-mesh topology, each border node is connected to all other border nodes in the IZ. Every border node connected to all border nodes in the attached service portal by external links, and to all border nodes within the service portal by internal links. This topology is capable of isolating a link failure ensuring that it is handled within the IZ and is not propagated outside of it. Figure 11depicts a configuration with three-node, full-mesh connectivity where two adjacent domains are connected with partial protection. Figure 12 depicts four-node, full-mesh connectivity where two adjacent domains are connected with full protection.

Figure 11: Full-mesh topology with three border nodes

Figure 12: Full-mesh topology with two border nodes in each domain

 

4 Inter Network Service Protection mechanism

One possible way in which the required protection capabilities can be provided to all external interfaces is to use the proposed Inter Network Service protection (INSP) mechanism (explained below). The proposed mechanism ensures that service traffic will always have a path through the IZ and will be able to reach the attached domain while complying with the criteria and needed properties. The proposed INSP mechanism supports all the constructs mentioned in the previous sections.

The following sections and figures describe the terms and the functionality of the different entities comprising the INSP mechanism.

4.1 INSP principle of operations

The INSP is responsible for ensuring that service frames are safely delivered between attached domains and it complies with the Ethernet characteristics such as in-ordered delivery and no duplications.

Each domain allocates a service portal to each service that should be safely delivered between connected domains. When a service portal include more than one border node, node protection can be achieved in addition to link protection. Connectivity interruption is minimized and frame loss will be the least possible.

At any point in time, only one of the border nodes in a service portal conveys traffic from the domain to the IZ and from the IZ to the domain. This border node is called the service gateway. The SG is dynamically elected by the service portal according to the status of the domain and the IZ to which it belongs. Since the domain is dynamic, the SG may change when entities inside the service portal fail and recover. As the service portal holds only one SG at any given moment it, it is guaranteed that service frames enter from the domain to the IZ and from the IZ to the domain by a single border node. As Ethernet aims to prevent frame duplication as much as possible, it is very important to keep this goal in sight when protecting an EI. Frames that are received by all border nodes other than the SG are dropped, regardless of whether they were received from the IZ or from the domain. Figure 13 depicts an example of a service and its SGs in Operator A and in Operator B.

Figure 13: Service Gateway

The SG delivers the service to the IZ and receives the service from the IZ on one link only at any given time. Operating with one SG and one link ensure that service frames are not duplicated. To ensure that service traffic is co-routed inside the IZ (to enable learning), one of the two SGs is responsible for selecting the link over which traffic is conveyed to the attached domain (This SG is referred as the initiating SG).  If both SGs were to select the link, the path could diverge and transmitted and received traffic would not use the same path in the IZ. The second domain expects to receive service frames over a specific link and does not need to select a link for that service.

If the selected link directly connects the SGs of the attached domains, traffic received by the SG from the IZ will be forwarded directly by the SG to its domain. Figure 14 shows an example of two attached domains where the SG in one domain is connected directly to the SG of the attached domain.

Figure 14: Directly connected SGs

If the selected link is connected to a border node that is not an SG, this border node will redirect received traffic to the SG in its service portal, and the SG will transmit it to the domain. Figure 15 and Figure 16 depict examples of attached domains where the SG in one domain is not directly connected to the SG of the attached domain. These examples show the SG connected through an intermediate border node via an internal link to the SG of the attached domain.

Figure 15: SGs in mesh topology not directly connected

 

Figure 16: SGs in ring topology not directly connected

As the INSP operates per service, it is possible to define specific border nodes as the initiating SGs (select the active link) for some services, and as the SGs that accept the selected link for other services. Load sharing can be achieved by defining different border nodes as SGs for different services.

The health of the IZ components is monitored by an OAM messages sent frequently enough to comply with the protection time restriction (Property ‎5 and ‎1 above). When a failure or degradation is detected, the protection mechanism is responsible for selecting an alternate link for the affected services. If the failed component is not the initiating SG, the initiating SG selects another link to transport the interrupted service traffic. If the initiating SG fails, a new SG is elected by the service portal and the new elected initiating SG selects a link. A failed SG is overcome by electing an alternate SG. A failed link is overcome by selecting an alternate link.

As Ethernet aims to prevent frame missordering as much as possible (Property ‎3 above), it is very important to keep this goal in sight when protecting an EI. The SG (which is the only border node that handles a service) ensures that only one port is allowed to send and receive service frames at any given moment from the IZ and to the IZ.  The order of operation when selecting the link over which the service will be conveyed is as follows: first disable the port that currently receives the service frame and only then enable the new port. With this paradigm of "break-before-make", frame ordering is preserved as frames are handled from one queue at a time.  This method is rapid (since each border node acts independently from other border nodes and closes its active port prior to opening a new port for the service).

One of the goals protection mechanism should fulfill is to minimize the effects of a failure in the IZ on the attached domains. This goal can be fulfilled for link failures when the IZ contains internal links that can be used to bypass a failed link without changing the SG, and as the SG is not changed, the attached domain is not affected.

Figure 17 depicts a scenario where the topology is full mesh. The defined path in this example is A1, A1-B2, B2 (meaning border node A1 – which is the SG of operator A's service portal, the external link directly connecting A1 to B2, and B2 which is the SG of operator B's service portal). When the external link A1-B1 fails, a new path is established A1, A1-B1, B1-B2, and B2.  A1 can only use the alternate, external link connecting A1 to B1. Traffic is therefore sent by A1 on this link. Traffic sent by A1 is received by B1, which is not the SG in its service portal; therefore B1 seamlessly relays the traffic over the internal link connecting B1 with B2 to B2, which is the SG of the service portal in this domain. Although the path before and after the failure differs, the SGs in both service portals were preserved and not changed.  The new route that replaces the failed route uses another external link, an additional border node and an additional internal link, but there is no change regarding the domains, as the SG in both domains remains unchanged. This functionality is beneficial as it isolates the link failure in the IZ in a way that prevents the propagation of failure effects outside the IZ. On the other hand, this functionality is only supported when internal links exist between border nodes in the service portal and can relay received traffic to the SG in the service portal.

Figure 17: Bypass of external link failure in full mesh topology

 Figure 18 also depicts a scenario where an external link failed, but the SG is preserved in ring topology.

 Figure 18: Bypass of external link failure in ring topology

 When possible, a bypass inside a service domain is created by the service portal. The attached service domain plays no part in this functionality. Each domain determines by itself, whether and when to use the internal links for bypass functionality. (This differs from other protection mechanisms where usage of an alternate path is determined by the edge nodes and not by the intermediates nodes.)

The initiating SG selects the link that will be used for the services it supports. As the main task is to transfer traffic between domains, and one of the requirements from an EI protection mechanism is to minimize the additional delay introduced by the protection mechanism (Property ‎5 above), the preferred link is one of the external links which directly connects the two SGs of the attached domain.  If no such external link is available, another external link is selected. If there is no available external link an internal link is selected, assuming that the other border node in the service portal is connected to the attached domain and will be able to transfer the service to the attached domain, where it will eventually reach the SG.

The INSP mechanism follows all the Ethernet functionality as defined by 802.1Q and its amendments. It does not modify the frames size in order to perform the protection switching (Property ‎6 above), and its queuing functionality is also not modified so it continues to handle priority as defined (Property ‎7 above).


4.2 INSP advantages

1. Provides a full solution for protection of external interfaces and does not rely on non standard functionality

2. Stand alone mechanism, as intra-domain protocols, mechanisms and configurations should not be modified to support the INSP. All protocols work as usual and do not require modification.

3. Minimize frame duplication

4. Minimize frame misordering

5. Guarantees the prevention of  fault propagation beyond the IZ in all topologies that support this functionality 

6. Scalable, as all the protected services are monitored by one message and the protection state is coordinated by one message for all aggregated services.  

7. All services can be supported by one dedicated, internal link without encapsulation. When the internal link is shared by the IZ and the domain, i.e. the domain uses this link for relaying traffic inside the domain, traffic belonging to the IZ need to be distinguished from the intra-domain traffic. All the IZ traffic can be tunneled in one tunnel and do not have to have a dedicated tunnel per service on the internal link (physical or logical) for each and every service. 

8. One protocol is used on all links – external and internal and the intention is to make it standard. As the same (standard) protocol is used inside and outside the portal, the mechanism allows interoperability, since it is capable of operating in a mixed environment both inside the portal and between portals i.e. it does not assume that a single vendor will be used.

9. Supports topologies such as mesh (full and partial) and ring

10. Supports physical ports as well as LAG, so that service BW can be increased as the service grows

11. Provides simple network expansion through the addition of a domain as service protection using INSP only requires the configuration of border nodes

12. Provides simple indication on the protection state


5 Suffix


5.1 Current standardization status

The IEEE802.1 working group has been discussing this issue since the middle of 2009. It is highly probable that a new project dealing with EI protection will be initiated in the first half of 2011.

The MEF is working on defining requirements for protecting ENNI; this project is currently in the ballot phase.


5.2 Summary

It is evident to the industry that a protection mechanism is required in the IZ. The IEEE is currently discussing possible solutions that will provide a standard mechanism. The MEF is also working on defining requirements for external interfaces protection. Such a mechanism will make networks more robust and ensure that the degree of availability defined in the service level agreement is maintained. A standard protection mechanism will enable service providers to purchase equipment from several vendors and install it in the network boundaries; providers will therefore be better placed to negotiate with the vendors. In addition, it will enable gradual upgrades and replacements of single border nodes to be performed, and the uniform management of all installed equipment.

5.3 References

  1. IEEE  802.1Q-2005- Virtual Bridged Local Area Networks
  2. IEEE 802.1ag - Connectivity Fault Management
  3. ITU-T Y.1731 - OAM functions and mechanisms for Ethernet based networks
  4. MEF 20 - User Network Interface (UNI) Type 2 Implementation Agreement, July 2008
  5. MEF 26 - External Network Network Interface (ENNI) – Phase 1, January 2010
  6. MEF 10.2 - Ethernet Services Attributes Phase 2

 

Written by:
Zehavit Alon
 
Trackback(0)
Comments (0)add
You must be logged in to a comment. Please register if you do not have an account yet.

busy
Last Updated on Thursday, 20 January 2011 10:27
 
MEF Accredited Training Providers