Académique Documents
Professionnel Documents
Culture Documents
Issue 01
Date 2011-04-12
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their respective
holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei and
the customer. All or part of the products, services and features described in this document may not be
within the purchase scope or the usage scope. Unless otherwise specified in the contract, all statements,
information, and recommendations in this document are provided "AS IS" without warranties, guarantees or
representations of any kind, either express or implied.
The information in this document is subject to change without notice. Every effort has been made in the
preparation of this document to ensure accuracy of the contents, but all statements, information, and
recommendations in this document do not constitute a warranty of any kind, express or implied.
Contents
4 Experience..................................................................................................................................... 25
4.1 IP&OTN Synergy Solution Test ..................................................................................................................... 25
4.2 Global Application ......................................................................................................................................... 25
4.3 Success Stories ............................................................................................................................................... 26
5 Conclusion .................................................................................................................................... 27
6 Acronyms and Abbreviations ................................................................................................... 28
Figures
Figure 3-13 IP&OTN alarm correlation and root cause analysis ........................................................................ 23
Figure 4-1 IP&OTN synergy solution test performed by EANTC ...................................................................... 25
Figure 4-2 Application of Huawei GMPLS/ASON in WDM/OTN fields........................................................... 26
Figure 4-3 IP&OTN synergy solution in Netherlands education network .......................................................... 26
Tables
1 Executive Summary
As enterprise branches are located in different regions and a lot of information needs to be
processed, the enterprises rely on WAN and backbone networks to implement department and
service collaboration. Therefore, the WAN and backbone networks must be reliable and easy
to maintain. A fault on the WAN or backbone network may interrupt enterprises' services or
even cause a disaster to the enterprises.
The WAN and backbone network involve the Internet protocol (IP) layer and optical transport
layer. Network maintenance is difficult and fault location is inefficient.
IP&OTN synergy is an important part of the Huawei "Integration, Easy, Clouding" solution. It
uses the intelligent optical transport network (OTN) and IP/MPLS technology to provide
protection synergy and operation, administration, and management (OAM) synergy. IP&OTN
synergy is a valuable solution for enterprises to implement communication and
informatization.
Huawei IP&OTN synergy solution has the following features:
High availability
Improves network reliability and service security by planning multi-layer networks and
using protection and fault recovery measures.
Easy to use
Manages network resources by using a unified network management system,
automatically discovers the topology of two layers, and creates end-to-end services
easily.
Easy to manage
Displays the topology of two layers on a screen, analyzes fault causes, deletes
unnecessary traps, helping to locate faults quickly.
Support for various new services
Provides differentiated Service Level Agreements (SLAs) according to service priorities.
In legacy network solutions, the IP layer and optical layer are separated from each other and
each layer uses individual protection and maintenance plans. The IP&OTN synergy solution
provides collaborative and efficient protection and maintenance for different layers.
2 Introduction
Challenges
Although the technologies on the OTNs and routers develop, legacy networks are facing the
following problems:
Hard to live: The IP layer plan does not consider the paths at the optical layer. Faults at
the optical layer may cause failures in the protection paths at the IP layer may cause
faults at the optical layer. As a result, the protection measures will fail to function and
networks face breakdown.
High costs on logical links: If the IP layer and optical layer are designed separately, the
costs on the optical layer will not be considered in the IP layer design. Therefore, the
optimal paths at the IP layer may cause a high cost at the optical layer.
Failed cooperation between protection mechanisms When a fiber is faulty, both the
optical layer and IP layer enable their protection mechanisms, causing frequently service
flapping. Faults may occur.
Hard to locate faults: When a fiber is faulty, both the optical layer and IP layer will
generate a lot of traps. Fault location is difficult and inefficient because there is a lack of
trap association and cause analysis mechanisms.
To solve the preceding problems, routers at the IP layer must collaborate with OTN devices to
provide multi-layer network plan, collaborative protection, and collaborative maintenance.
3 Solution
3.1 Overview
The Huawei IP&OTN synergy solution is applied to enterprises' backbone networks. In this
solution, the optical transport network uses the ASON technique to dynamically adjust
bandwidth according to Ethernet traffic volume and support long distance transmission. The
routers at the IP layer provide differentiated quality according to the IP precedence or MPLS
EXP priorities in service packets. The optical layer and IP layer collaborate with each other to
provide high quality, high reliability, and easy operation and maintenance.
The IP layer and optical layer collaborate in two modes:
Manual mode
The network planning engineers manually plan the IP layer and optical layer with their
professional knowledge and experience. Protection is implemented by static Shared Risk
Link Group (SRLG) and uniform protection switchover parameters. Huawei network
management system U2000 manages the topology of the two layers and locates faults.
Dynamic mode
In this mode, network plan is implemented by the intelligent multi-layer network
planning tool and protection is implemented by the GMPLS-UNI-based dynamic SRLG
and multi-layer path computation elements (PCEs).
Multi-Layer PCE
GMPLS-UNI
PCECP ASON
Multi-Layer Network
Planning Tool
The IP&OTN synergy solution provides the following functions for the enterprise's WAN and
backbone networks:
Optical layer protection
IP/MPLS layer protection
IP&OTN protection synergy
IP&OTN OAM synergy
Each optical line uses two pairs of fibers. One pair functions as the working path to transmit
service signals. The other pair functions as the protection path to transmit service signals
when a fiber break occurs on the working path or signal attenuation is too large. Figure 3-2
shows the diagram of optical line protection.
Mesh Networking
Mesh networking is a widely used networking type of ASON, and is flexible and easy to
extend. Compared with WDM networking, mesh networking supports more recovery paths,
which improve network security and reduce network resource waste.
In addition to the traditional protection measures (such as 1+1 protection) and shared
protection measures, the mesh networking can also use the rerouting mechanism to protect
services. Using all the preceding measures, the mesh networking is capable of restoring
services in any situations.
As shown in Figure 3-6, if the link between device C and device G is interrupted, a route from
device D to device H is generated. Services are restored through a newly generated LSP.
Figure 3-6 Service protection and restoration using the mesh networking
Dynamic Rerouting
Rerouting recovers services when network faults occur. In non-revertive mode, the first node
on an interrupted LSP calculates the optimal path, and then sets up a new LSP using signaling
messages. Services are transmitted over the new LSP. The interrupted LSP is deleted after the
new LSP takes effect.
Rerouting, as a key technology of GMPLS/ASON, protects services without a waste of
resources. It is also a revolutionary improvement for traditional protection measures.
Rerouting protects services even if fibers are interrupted frequently.
As shown in Figure 3-7, an LSP passes devices A, D, G, and K. When the link between
devices D and G is interrupted, the rerouting process is as follows:
The FIU (for optical layer) or OUT (for electrical layer) of device D detects an alarm,
and then reports the alarm to the GMPLS module.
The GMPLS module on device D checks the affected intelligent services and sends a
Notify message to device A.
After receiving the Notify message, the GMPLS module of device A calculates an
end-to-end protection path and sends a PATH message along the new path. A reverse
cross-connected path destined for device K is set up.
After receiving the PATH message, the GMPLS module of device K returns a RESV
message along the new path to set up a cross-connected path destined for device A.
After receiving the RESV message, device A enables the alarm function and sends a
PATH message to request the downstream devices to enable the alarm function. The
downstream devices enable the alarm function for the new path.
After all devices on the LSP enable the alarm function, the old LSP is deleted if the
non-revertive mode is used. The rerouting process is complete.
D
fy
Noti G
A
PA
TH
K
C F
PA
TH
H
B PAT
H
E
Service Association
Two LSPs are associated. When an LSP is performing rerouting or optimization, this LSP is
separated from the other one. The two LSPs do not overlap each other. Service association is
applicable to the services having two access points (dual homing).
As shown in Figure 3-9, the two LSPs D-E-I and A-B-G-H are associated. If the link between
devices B and G is broken, the LSP A-B-G-H performs rerouting and the LSP D-E-I is not
affected.
1. Diamond service
Diamond service has the best protection ability. When there are enough resources on the
network, diamond service provides permanent 1+1 protection for paths such as ODUk paths.
Diamond services are applicable to voice and data services, VIP private line, such as banking,
security, and aviation.
A diamond service provides 1+1 protection from the source node to the sink node. It is also
called a 1+1 service. There are two LSPs available between the source node and the sink node.
The two LSPs are separated. One is the working LSP and the other is the protection LSP. The
same service is transmitted to the working LSP and the protection LSP at the same time.
When the working LSP is normal, the sink node receives services from the working LSP;
otherwise, the working LSP receives services from the protection LSP.
Figure 3-10 shows the network diagram of diamond service.
3. Bronze service
The bronze services are seldom used. Generally, temporary services, such as the abrupt
services in holidays, use the bronze service. The paths of bronze service include OCh paths,
ODUk paths, and Client paths.
The bronze service means no protection. If an LSP fails, rerouting is not triggered and
services are interrupted.
BFD
BFD is an interactive detection mechanism that rapidly detects communication faults between
systems and reports the detected faults to upper-layer applications.
BFD has the following functions:
MPLS OAM
MPLS OAM is a rapid detection mechanism that checks MPLS LSP connectivity by allowing
nodes along an LSP to exchange OAM packets.
MPLS OAM provides the following functions, independent of upper-layer or lower-layer
protocols:
Detects, identifies, and locates MPLS user-plane faults efficiently.
Evaluates network usage and performance.
Performs protection switching in the event of a link defect or fault to provide services
according to the Service Level Agreements (SLAs).
For more information about MPLS OAM, see ITU-T Recommendation Y.1710 and Y.1711.
IP FRR
On legacy IP networks, it takes the routing system several seconds to complete route
convergence after a fault is detected. This convergence speed cannot meet requirements of the
services that are sensitive to packet delay and packet loss. For example, Voice over Internet
Protocol (VoIP) services are tolerant of millisecond-level interruption.
IP FRR allows the forwarding system to rapidly detect faults and take measures to restore
services as soon as possible. The IP FRR implementation principles are as follows:
When the primary link is available, you can configure IP FRR by using a routing policy
to provide the backup route information for the forwarding engine.
When the forwarding engine finds that the primary link fails, it uses the backup link to
forward traffic before the routes converge on the control plane.
BGP FRR
IGP/LDP FRR can rapidly switch traffic to another link when a link fault occurs. However,
when a fault occurs on a BGP node, routes need to converge on the BGP control plane and
then be delivered to the forwarding table. The route convergence time may reach the second
level. The BGP indirect next hop technique speeds up route convergence on the control plane,
but it still cannot ensure carrier-class reliability.
In BGP FRR, the LDP label or BGP label of a sub-optimal route is installed into the
forwarding table as a backup routing entry. When a rapid fault detection mechanism such as
BFD detects that the optimal route becomes unavailable, services are switched to the backup
route. This implements fast service switchover.
LDP FRR
LDP FRR allows a device to install both the optimal route and sub-optimal route that
functions as the backup route into the forwarding table. When the next hop of the optimal
route fails, traffic is forwarded using the backup route or label.
LDP FRR can work with BFD to rapidly detect faults in the next hop of the optimal route and
implement route convergence within 50 ms.
There are some shortcomings in LDP FRR. For example, on a ring network, the next hop of
the sub-optimal route may send packets to the local node, causing a loop.
Compared with RSVP TE FRR, LDP FRR provides only single-point protection but not
end-to-end protection.
MPLS TE FRR
MPLS TE FRR protects links and nodes in MPLS TE. When an LSP link or a node fails,
traffic can be forwarded along the tunnel of the protected link or protected node. This ensures
uninterrupted traffic forwarding. In addition, the ingress can continue re-establishing the
primary path without affecting data transmission.
In MPLS TE FRR, an LSP is established to protect one or more LSPs. This LSP is called the
FRR LSP and the protected LSP is called the primary LSP. When a link or node fails, MPLS
TE FRR uses the FRR LSP to transmit traffic; therefore, the primary LSP is protected. All the
nodes in the MPLS TE system need to participate in the establishment of the FRR LSP and
primary LSP.
MPLS TE FRR is implemented based on RSVP TE and complies with RFC 4090.
VPN FRR
MPLS TE FRR protects services in the case of a link or node failure between two PEs at both
ends of a TE tunnel; however, MPLS TE FRR cannot protect services in the case of a PE
failure.
Once a PE fails, services can only be restored by means of end-to-end route convergence and
LSP convergence. The service convergence time depends on the quantities of MPLS VPN
routes and hops on a bearer network. The convergence time is usually 5s on a typical network,
which is longer than 1s required for end-to-end service convergence.
VPN FRR solves the preceding problem. In VPN FRR, primary and backup forwarding
entries with the primary PE and backup PE as their respective destinations are preconfigured
on the remote PE. Rapid PE failure detection is also used so that the end-to-end service
convergence is within 1s on an MPLS VPN where a CE is dual homed to two PEs. The
recovery time is independent of the quantity of VPN routes.
Unlike layer-by-layer network planning tools, a multi-layer network planning tool improves
resource utilization and network reliability by planning the IP layer and transport layer
together. This tool has the following advantages:
Allocates bandwidth for the two layers based on traffic volume so that traffic is loaded
evenly, improving utilization of network resources.
Isolates faults on the IP layer and transport layer to prevent a fault from triggering
repeated protection at the two layers. This ensures effective protection and improves
network reliability, laying a foundation for intelligent synergy between the IP layer and
transport layer of a backbone network.
3.4.2 SRLG
An SRLG is a group of links with the same reliability risks. For example, multiple links on a
router involve the same transport path. If the transport path fails, both the working and
protection links on the router will also fail. To prevent this problem, links in the same SRLG
are not assigned to a pair of working and protection paths during path computation. This
improves reliability on the IP layer because a link failure will not cause both the working and
protection paths to fail.
Static SRLG
Static SRLG requires the IP network administrators to manually configure SRLG information
on routers after confirming the information with the transport network administrators.
Static SRLG is easy to implement and does not require configuration of other parameters.
However, static SRLG has the following disadvantages:
The administrators of the IP network and transport network have to exchange and
configure a large amount of detailed information, which is labor-consuming and prone to
errors.
When links on the transport layer are re-planned or adjusted, the transport network
administrators must notify the IP network administrators, and the IP network
administrators modify configurations on the IP layer.
If the GMPLS ASON technology is used at the transport layer, the transport paths may
change automatically. The IP network administrators cannot be notified of the changes in
real time.
Dynamic SRLG
Huawei presents the dynamic SRLG solution to overcome problems of static SRLG.
Transport devices transfer SRLG information to routers through extended GMPLS-UNIs
between them. Dynamic SRLG has the following advantages:
The SRLG information is transmitted from the transport layer to the IP layer
automatically and no manual operation is required, reducing workload in maintenance
and preventing configuration errors.
Transport devices update SRLG information when transport links are adjusted, saving
network administrators' workload in modifying configurations.
When the GMPLS ASON re-computes routes, transport devices notify routers of SRLG
information update.
Transport devices send SRLG information to routers, including information specific to each
layer such as OTN layer, optical layer, and fiber layer. Each router calculates and updates
links on the working and protection paths according to the SRLG information received from
the transport layer to ensure that the working and protection paths do not contain links in the
same SRLG. Figure 3-12 shows dynamic SRLG implementation.
L-S2 L-S3
L-S1
L-S5 WDM/lambda L-S4
L-S4
F-S2
F-S1 F-S3 F-S4
F-S1
F-S3 Fiber
F-S4
GMPLS-UNI
The GMPLS-UNI technology defined by IETF is a key technology to enhance information
exchange between the IP layer and transport layer. Routers on the IP layer send messages to
request transport devices to set up or delete paths through GMPLS-UNIs.
After a router sets up a link, it sends GMPLS-UNI signaling messages to notify transport
devices of the source node, destination node, and attributes (such as bandwidth and protection
attributes) of the link. Transport devices then set up a transport path according to the link
information.
PCE
On a large network, constraint-based path computation is complex, and devices participating
in path computation must have high calculation capabilities. If distributed path computation is
performed on the network, each node must have high calculation capabilities, causing high
costs on network construction. If the network is divided into multiple domains, the topology
of each domain is hidden to other domains. Therefore, devices on the network must cooperate
to compute the optimal end-to-end path.
The PCE technology is introduced to solve the path computation problem. A PCE has high
path computation capabilities and is deployed on a network device or an external server. A
PCE is responsible for path computation in a domain. All path computation requests in a
domain are sent to the PCE in this domain. After completing path computation, the PCE sends
the computation result to the path computation clients (PCCs) that sent the path computation
requests. PCEs in multiple domains work together to compute the optimal path.
Over 80% traffic from the IP network is carried over wavelengths. When services on a
router are interrupted, it is difficult to quickly identify whether the fault occurred on the
IP network or on a WDM device, let alone to isolate the fault.
When a fault occurs on a transport device, the transport network administrators do not
know whether this fault affects IP links and which IP links are affected.
Device connections on the IP network are complex, making OAM on IP networks
difficult. Network administrators usually have to open many pages on the NMS to
configure a service.
The OAM synergy solution is introduced to reduce workload on network management and
make network OAM easy. It solves the preceding problems implementing unified
management on the IP network and OTN and visualized service maintenance.
Unified NE Management
The U2000 manages transport devices, access devices, and IP devices uniformly. It manages
devices such as routers, switches, DSLAMs, and firewalls, and services such as MSTP, WDM,
OTN, microwave, PTN, MSAN, and FTTx.
IP fault locating
The SQM system uses Huawei's IP fault locating techniques to locate faults on the IP
network. After the source IP address/port and destination IP address/port are entered, the
SQM system can locate the fault in 5 minutes.
4 Experience
R2-VenderX
R1-Huawei
R3-Huawei
N3-VenderY N4-VenderY
N1-Huawei N2-Huawei
Huawei provides cutting-edge routing techniques and is serving 36 of top-50 carriers in the
world. Huawei has deployed over 120 IP/MPLS networks and over 620 metro networks in
102 countries and regions, and provides services for the most users (1 billion) among all
telecommunications device vendors.
Regional
OSN1800
POP
Regional Core POP Site B
ring (NE40E+ S5300
OSN6800)
S5300
Gigabit Site A
External metro ring
network DWDM core network
S5300
S5300
Core POP Core POP
(NE40E+ Site C
OSN6800)
5 Conclusion
The IP&OTN synergy solution enhances reliability and security of the GMPLS/ASON
intelligent and provides a multi-layer network planning tool to improve network planning
efficiency. It takes advantages of high flexibility and reliability of the IP/MPLS network and
uses the U2000 unified OAM platform to improve reliability and maintenance efficiency of
WAN and backbone network. This solution provides high-quality cloud networks to help
enterprise implement informatization and create value for customers.