Page 57 - Proceedings of the 2018 ITU Kaleidoscope
P. 57

Machine learning for a 5G future




           isolation. The different  management areas, although   can be used to visualize the  whole profiling, anomaly
           operating on different time scales and on different managed   detection and diagnosis process.
           objects, need to be able to share knowledge and  make a
           holistic picture of the whole system, for example:     The collaborative self-healing with a QoE-driven backhaul
                                                              SON system was evaluated in a test network, where a ground
               •  Quality of Service (QoS) and Quality of     truth could be established. Delay of different network
                  Experience   (QoE)   driven  management:    segments as well as radio KPIs such as the Channel Quality
                  Optimizing the end-to-end customer experience at  Indicator (CQI) were  monitored in both the anomaly
                  the application and individual subscriber level  detection and diagnosis. The testbed consisted of two cells.
               •  Network Management (NM): Management
                  automation aggregated on a (Virtual) Network  Test runs consisted of three-day cycles, where the first day
                  Function (V)NF level                        the  network  was in a normal state, the second a radio
               •  VNF and Service Orchestration: Orchestration of  attenuation condition was present and on a third a Software
                  cloud resources, CPU cores, memory, storage, links  Defined Network (SDN) misconfiguration was introduced in
                  etc.                                        the radio backhaul  network. The three-day cycle  was
                                                              repeated four  times, while constantly increasing background
           To give an example, consider the corrective actions done by   traffic (video streaming). When the background traffic
           a QoE-driven self-healing function in a transport network   volume  was low, the  self-healing  mechanisms of the
           SON system described in [11]. It can re-route traffic past   backhaul SON solution could mitigate the problems caused
           problematic links at a very fast pace. If there are underlying   by the radio attenuation and the SDN misconfiguration so
           issues necessitating such re-routing, however, which are not   that the end user experience  was  not affected. However,
           corrected, it  may eventually  happen that the QoE   when the background traffic was increased, on the last three-
           management system is no longer able to fulfill the customer   day cycle the available resources become insufficient for the
           expectation with the available resources. The network will   backhaul SON to correct the degradations. This is shown in
           fail and the danger is that because it is failing at a later and   Figure 5.  The horizontal axis shows the aggregated
           potentially more escalated stage of the problem, the failure   throughput demand of a cell in Mbps and the vertical the
           will be more catastrophic and more difficult to troubleshoot.   actual throughput. The shaded area is normal correlation
           If you are failing, it is often better to fail fast.    between the demand and the actual throughput, as profiled
                                                              by the SEF.  As can be expected, normally the actual
           Our solution  for a  more holistic view, enabling early   throughput follows the demand. The points in the scatter plot
           detection of any issues, is to have the NM-level self-healing   represent the observations after profiling. We can see that a
           function monitoring the corrective actions performed by the   set of points transgress the profile boundary, indicating an
           QoE-driven, application and user centric backhaul  SON   anomaly,  where the cell cannot provide the throughput
           system. The actions are  modelled and aggregated as   required.
           additional KPIs that are used as input features for the NM
           SON self-healing, in addition to the normal NM-level alarms   Figure 6 shows the anomaly-level and anomalies detected
           and performance KPIs. Several corrective QoE-driven   by SEF during the last  four days in the test run. SEF is
           actions  may be an indication of an emerging problem,   monitoring both the NM KPIs and KPIs indicating the
           especially if occurring together with other indicators for a   actions taken by the backhaul SON. The shaded areas are the
           network performance degradation. This may be further   anomalous  timeframes  detected   by    SEF.
           extended to cover also VNF orchestration, for example. The
           different  management areas can collaboratively  monitor
           each other’s corrective actions or determine the best course
           of action  in a cooperative  manner. This also enables
           coordination of the actions taken by different management
           agents on different management areas.

           Therefore, mechanisms and KPIs to communicate manage
           decisions and actions between  management areas are
           required.  The management functions  may be created by
           different vendors, which may raise a need for standardization.

                7   SON EXPERIMENTAL FRAMEWORK

           The presented concept was demonstrated in a tool called the
           SON Experimental Framework (SEF) [12]. It is a framework
           implemented in the  R-language that can be integrated to
           different data sources. For evaluation, data collected from
           real operator networks was used, as well as live integrations
           to networks and testbeds. The system provides a web UI that   Figure 5 – Service degradation as detected by SEF





                                                           – 41 –
   52   53   54   55   56   57   58   59   60   61   62