[1]齐小刚,牛红曼,刘兴成,等.多层信息网络故障定位综述[J].智能系统学报,2019,14(01):44-56.[doi:10.11992/tis.201804062]
 QI Xiaogang,NIU Hongman,LIU Xingcheng,et al.Survey of fault localization in multilayer information networks[J].CAAI Transactions on Intelligent Systems,2019,14(01):44-56.[doi:10.11992/tis.201804062]
点击复制

多层信息网络故障定位综述(/HTML)
分享到:

《智能系统学报》[ISSN:1673-4785/CN:23-1538/TP]

卷:
第14卷
期数:
2019年01期
页码:
44-56
栏目:
出版日期:
2019-01-05

文章信息/Info

Title:
Survey of fault localization in multilayer information networks
作者:
齐小刚1 牛红曼1 刘兴成1 王晓琳1 刘立芳2
1. 西安电子科技大学 数学与统计学院, 陕西 西安 710126;
2. 西安电子科技大学 计算机学院, 陕西 西安 710071
Author(s):
QI Xiaogang1 NIU Hongman1 LIU Xingcheng1 WANG Xiaolin1 LIU Lifang2
1. School of Mathematics and Statistics, Xidian University, Xi’an 710126, China;
2. School of Computer Science and Technology, Xidian University, Xi’an 710071, China
关键词:
多层网络故障管理故障诊断故障定位故障传播模型节点故障链路故障虚拟网络覆盖网
Keywords:
multilayer networkfault managementfault diagnosisfault localizationfault propagation modelnode faultlink faultvirtual networkoverlay network
分类号:
TP393
DOI:
10.11992/tis.201804062
摘要:
本文对多层网络(覆盖网、虚拟网等)故障定位问题进行了分析和总结。讨论了多层网络探测故障信息获取策略和故障定位模型的发展状况,即介绍了被动监测、主动探测、主被动结合探测和终端用户观察等探测信息获取策略,以及基于图论故障传播模型、依赖矩阵模型、症状-故障-行动模型等故障传播模型的原理以及其优缺点。从故障定位模型、探测信息获取策略、故障定位计算、网络异构性、运行效率与成本多个方面重点综述了多层网络故障定位方法,讨论了每种方法的优点和局限性。最后,对多层网络故障定位研究的不足和亟待进一步研究解决的问题进行了探讨。
Abstract:
This study analyzes and summarizes the problems of fault localization in multilayer networks (e.g., overlay network and virtual network). First, the latest developments in fault detection information acquisition technologies and fault localization models for multilayer network are discussed. The detection information acquisition technologies for passive monitoring, active detection, active-passive detection, and end-user observation are introduced, as well as the fault localization models such as dependency matrix model, graph-based propagation model, and symptom-fault-action model. The principles, advantages, and disadvantages of these technologies and models are presented as well. The methods of multilayer network fault localization are summarized, considering fault localization strategy model, fault detection calculation technique, network heterogeneity, operational efficiency, and cost, and then the merits and demerits of each method are highlighted. Finally, some pressing issues that need further study are discussed.

参考文献/References:

[1] DUSIA A, SETHI A S. Recent advances in fault localization in computer networks[J]. IEEE communications surveys and tutorials, 2016, 18(4):3030-3051.
[2] KOZAT U C, LIANG Guanfeng, KÖKTEN K, et al. On optimal topology verification and failure localization for software defined networks[J]. IEEE/ACM transactions on networking, 2016, 24(5):2899-2912.
[3] KATZELA I, SCHWARTZ M. Schemes for fault identification in communication networks[J]. IEEE/ACM transactions on networking, 1995, 3(6):753-764.
[4] HAN Y, HYUN J, HONG J W K. Graph abstraction based Virtual Network management framework for SDN[C]//Proceedings of 2016 IEEE Conference on Computer Communications Workshops. San Francisco, USA, 2016:884-885.
[5] GU Lin, TAO Sheng, ZENG Deze, et al. Communication cost efficient virtualized network function placement for big data processing[C]//Proceedings of 2016 IEEE Conference on Computer Communications Workshops. San Francisco, USA, 2016:604-609.
[6] NATU M, SETHI A S, LLOYD E L. Efficient probe selection algorithms for fault diagnosis[J]. Telecommunication systems, 2008, 37(1/2/3):109-125.
[7] STEINDER M, SETHI A S. End-to-end service failure diagnosis using belief networks[C]//Proceedings of 2002 IEEE/IFIP Network Operations and Management Symposium. Management Solutions for the New Communications World. Florence, Italy, 2002:375-390.
[8] WU Bin, HO P H, TAPOLCAI J, et al. Optimal allocation of monitoring trails for fast SRLG failure localization in all-optical networks[C]//Proceedings of 2010 IEEE Global Telecommunications Conference. Miami, USA, 2010:1-5.
[9] BRODIE M, RISH I, MA Sheng, et al. Active probing strategies for problem diagnosis in distributed systems[C]//Proceedings of the 18th International Joint Conference on Artificial Intelligence. Acapulco, Mexico, 2003:1337-1338.
[10] BABARCZI P, TAPOLCAI J, HO P H. Adjacent link failure localization with monitoring trails in all-optical mesh networks[J]. IEEE/ACM transactions on networking, 2011, 19(3):907-920.
[11] XUAN Ying, SHEN Yilin, NGUYEN N P, et al. Efficient Multi-link failure localization schemes in all-optical networks[J]. IEEE transactions on communications, 2013, 61(3):1144-1151.
[12] TAPOLCAI J, HO P H, RONYAI L, et al. Failure localization for shared risk link groups in all-optical mesh networks using monitoring trails[J]. Journal of lightwave technology, 2011, 29(10):1597-1606.
[13] ALI M L, HO P H, TAPOLCAI J. SRLG failure localization using nested m-trails and their application to adaptive probing[J]. Networks, 2015, 66(4):347-363.
[14] BAI Linda, ROY S. A two-stage approach for network monitoring[J]. Journal of network and systems management, 2013, 21(2):238-263.
[15] TANG Yongning, AL-SHAER E S, BOUTABA R. Active integrated fault localization in communication networks[C]//Proceedings of the 9th IFIP/IEEE International Symposium on Integrated Network Management. Nice, France, 2005:543-556.
[16] TANG Y, AL-SHAER E. Towards collaborative user-level overlay fault diagnosis[C]//Proceedings of the 27th IEEE Conference on Computer Communications. Phoenix, AZ, USA, 2008:2476-2484.
[17] PAN Yalian, QIU Xuesong, ZHANG Shuili. Fault diagnosis in network virtualization environment[C]//Proceedings of the 18th International Conference on Telecommunications. Ayia Napa, Cyprus, 2011:517-522.
[18] TANG Yongning, AL-SHAER E, JOSHI K. Reasoning under uncertainty for overlay fault diagnosis[J]. IEEE transactions on network and service management, 2012, 9(1):34-47.
[19] WANG Hao, WANG Ying, QIU Xuesong, et al. Fault diagnosis based on evidences screening in virtual network[C]//Proceedings of 2015 IFIP/IEEE International Symposium on Integrated Network Management. Ottawa, Canada, 2015:802-805.
[20] GILLANI S F, DEMIRCI M, AL-SHAER E, et al. Problem localization and quantification using formal evidential reasoning for virtual networks[J]. IEEE transactions on network and service management, 2014, 11(3):307-320.
[21] NATU M, SETHI A S. Application of adaptive probing for fault diagnosis in computer networks[C]//Proceedings of 2008 IEEE Network Operations and Management Symposium. Salvador, Brazil, 2008:1055-1060.
[22] OGINO N, KITAHARA T, ARAKAWA S, et al. Decentralized boolean network tomography based on network partitioning[C]//Proceedings of 2016 IEEE/IFIP Network Operations and Management Symposium. Istanbul, Terkey, 2016:162-170.
[23] MA Liang, HE Ting, SWAMI A, et al. Network capability in localizing node failures via end-to-end path measurements[J]. IEEE/ACM transactions on networking, 2017, 25(1):434-450.
[24] PAN Shengli, ZHANG zhiyong, ZHOU Yingjie, et al. Identify congested links based on enlarged state space[J]. Journal of computer science and technology, 2016, 31(2):350-358.
[25] TABATABAⅡ H S A, RABIEE H R, ROHBAN M H, et al. Incorporating betweenness centrality in compressive sensing for congestion detection[C]//Proceedings of 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. Vancouver, Canada, 2013:4519-4523.
[26] CHEN Jinbiao, QI Xiao, WANG Yongcai. An efficient solution to locate sparsely congested links by network tomography[C]//Proceedings of 2014 IEEE International Conference on Communications. Sydney, Australia, 2014:1278-1283.
[27] GAO Yi, DONG Wei, CHEN Chun, et al. Accurate per-packet delay tomography in wireless Ad Hoc networks[J]. IEEE/ACM transactions on networking, 2017, 25(1):480-491.
[28] AGARWAL M K, APPLEBY K, GUPTA M, et al. Problem determination using dependency graphs and run-time behavior models[C]//Proceedings of the 15th IFIP/IEEE International Workshop on Distributed Systems:Operations and Management. Davis, USA, 2004:171-182.
[29] APPLEBY K, FAIK J, KAR G, et al. Threshold management for problem determination in transaction based e-commerce systems[C]//Proceedings of the 9th IFIP/IEEE International Symposium on Integrated Network Management. Nice, France, 2005:733-746.
[30] BENNACER L, AMIRAT Y, CHIBANI A, et al. Self-diagnosis technique for virtual private networks combining Bayesian networks and case-based reasoning[J]. IEEE transactions on automation science and engineering, 2015, 12(1):354-366.
[31] STEINDER M, SETHI A S. Probabilistic event-driven fault diagnosis through incremental hypothesis updating[C]//Proceedings of the 8th IFIP/IEEE International Symposium on Integrated Network Management. Colorado Springs, USA, 2003:635-648.
[32] NATU M, SETHI A S. Probabilistic fault diagnosis using adaptive probing[C]//Proceedings of the 18th IFIP/IEEE International Workshop on Distributed Systems:Operations and Management. San José, USA, 2007:38-49.
[33] JIN Ruofan, WANG Bing, WEI Wei, et al. Detecting node failures in mobile wireless networks:a probabilistic approach[J]. IEEE transactions on mobile computing, 2016, 15(7):1647-1660.
[34] RISH I, BRODIE M, MA Sheng, et al. Adaptive diagnosis in distributed systems[J]. IEEE transactions on neural networks, 2005, 16(5):1088-1109.
[35] BOYEN X, KOLLER D. Tractable inference for complex stochastic processes[C]//Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence. Madison, Wisconsin, 1998:33-42.
[36] LI Zhiqing, CHENG Lu, QIU Xuesong, et al. Fault diagnosis for large-scale IP networks based on dynamic Bayesian model[C]//Proceedings of the 5th International Conference on Natural Computation. Tianjin, China, 2009:67-71.
[37] TANG Yongning, AL-SHAER E, BOUTABA R. Efficient fault diagnosis using incremental alarm correlation and active investigation for internet and overlay networks[J]. IEEE transactions on network and service management, 2008, 5(1):36-49.
[38] DEMIRCI M, LO S, SEETHARAMAN S, et al. Multi-layer monitoring of overlay networks[C]//Proceedings of the 10th International Conference on Passive and Active Network Measurement. Seoul, Korea, 2009:77-86.
[39] YAN Congxian, WANG Ying, QIU Xuesong, et al. Multi-layer fault diagnosis method in the Network Virtualization Environment[C]//Proceedings of the 16th Asia-Pacific Network Operations and Management Symposium. Hsinchu, China, 2014:1-6.
[40] 刘娜, 张顺利, 王向东. 基于信任评估的虚拟网故障诊断算法[J]. 电视技术, 2016, 40(4):80-84 LIU Na, ZHANG Shunli, WANG Xiangdong. Virtual network fault diagnosis using trust evaluation[J]. Video engineering, 2016, 40(4):80-84
[41] 张顺利. 网络虚拟化环境下的网络资源分配与故障诊断技术[D]. 北京:北京邮电大学, 2012:9-11. ZHANG Shunli. The technology of network resources allocation and fault diagnosis for network virtualization environment[D]. Beijing:Beijing University of Posts and Telecommunications, 2012:9-11.
[42] DE MOURA L, BJ?RNER N. Satisfiability modulo theories:introduction and applications[J]. Communications of the ACM, 2011, 54(9):69-77.
[43] DEMIRCI M, GILLANI F, AMMAR M, et al. Overlay network placement for diagnosability[C]//Proceedings of 2013 IEEE Global Communications Conference. Atlanta, USA, 2013:2236-2242.
[44] RAHMAN M R, BOUTABA R. SVNE:survivable virtual network embedding algorithms for network virtualization[J]. IEEE transactions on network and service management, 2013, 10(2):105-118.

备注/Memo

备注/Memo:
收稿日期:2018-04-28。
基金项目:国家自然科学基金项目(61572435,61472305,61473222);宁波市自然科学基金项目(2016A610035,2017A610119);复杂电子系统仿真重点实验室基础研究基金项目(DXZT-JC-ZZ-2015-015).
作者简介:齐小刚,男,1973年生,教授,博士生导师,主要研究方向为复杂系统网络建模与仿真,网络优化与算法设计;牛红曼,女,1991年生,硕士研究生,主要研究方向为面向信息网络的快速故障诊断方法;刘兴成,男,1992年生,硕士研究生,主要研究方向为组合探测。
通讯作者:牛红曼.E-mail:1450772363@qq.com
更新日期/Last Update: 1900-01-01