论文:2020,Vol:38,Issue(5):1030-1037
引用本文:
王小玲, 张小芳, 李宁, 韩承枫, 袁祝平, 高环宇. 一种基于综合调优的数据库性能趋势预测方法[J]. 西北工业大学学报
WANG Xiaoling, ZHANG Xiaofang, LI Ning, HAN Chengfeng, YUAN Zhuping, GAO Huanyu. A Time Series Prediction Approach Based on Hybrid Tuning for Database Performance Indicator in AIOps[J]. Northwestern polytechnical university

一种基于综合调优的数据库性能趋势预测方法
王小玲1, 张小芳1,2, 李宁1,2, 韩承枫1,2, 袁祝平3, 高环宇3
1. 西北工业大学 计算机学院, 陕西 西安 710027;
2. 大数据存储与管理工业和信息化部重点实验室, 陕西 西安 710072;
3. 交通银行股份有限公司, 上海 310115
摘要:
云数据库智能运维中的重要应用场景之一是对监控采集的大量性能时序数据进行趋势预测。提出一种基于Prophet模型和ARIMA模型的综合调优智能趋势预测方法AutoPA4DB (auto prophet and ARIMA for database)。该方法根据数据库性能监控数据的特征,进行了原始监控数据的预处理、预测模型自动调参和模型优化。采用加权的时序预测准确性度量WMC (weighted MAPE coverage),基于多个企业级数据库实例(包含10种性能指标)进行了实验验证。实验对比了5种不同时序模型的预测效果,结果表明在单调变化模式(如磁盘使用量)的数据中,文中提出的AutoPA4DB方法时序预测准确性最高;然而在震荡模式的数据中,预测效果不太稳定,例如内存使用率趋势预测效果较好,但数据库连接数趋势预测效果不理想。
关键词:    智能运维    时序数据    Prophet模型    ARIMA模型    数据库性能监控   
A Time Series Prediction Approach Based on Hybrid Tuning for Database Performance Indicator in AIOps
WANG Xiaoling1, ZHANG Xiaofang1,2, LI Ning1,2, HAN Chengfeng1,2, YUAN Zhuping3, GAO Huanyu3
1. School of Computer, Northwestern Polytechnical University, Xi'an 710072, China;
2. Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Xi'an 710072, China;
3. Bank of Communication, Shanghai 310115, China
Abstract:
One of the most important applications of the intelligent operation and maintenance of a cloud database is its trend prediction of key performance indicators (KPI), such as disk use, memory use, etc. We propose a method named AutoPA4DB (Auto Prophet and ARIMA for Database) to predict the trend of the KPIs of the cloud database based on the Prophet model and the ARIMA model. Our AutoPA4DB method includes data preprocessing, model building, parameter tuning and optimization. We employ the weighted MAPE coverage to measure its accuracy and use 6 industrial datasets including 10 KPIs to compare the AutoPA4DB method with other three time-series trend prediction algorithms. The experimental results show that our AutoPA4DB method performs best in predicting monotonic variation data, e.g.disk use trend prediction. But it is unstable in predicting oscillatory variation data; for example, it is acceptable in memory use trend prediction but has poor accuracy in predicting the number of database connection trends.
Key words:    intelligent operation and maintenance    time series    prophet    ARIMA    database performance monitor   
收稿日期: 2019-11-05     修回日期:
DOI: 10.1051/jnwpu/20203851030
基金项目: 国家自然科学基金(61732014,61672434)资助
通讯作者: 李宁(1978-),女,西北工业大学博士、副教授,主要从事软件库挖掘、数据库研究。     Email:
作者简介: 王小玲(1994-),女,西北工业大学硕士研究生,主要从事数据库智能运维研究。
相关功能
PDF(2002KB) Free
打印本文
把本文推荐给朋友
作者相关文章
王小玲  在本刊中的所有文章
张小芳  在本刊中的所有文章
李宁  在本刊中的所有文章
韩承枫  在本刊中的所有文章
袁祝平  在本刊中的所有文章
高环宇  在本刊中的所有文章

参考文献:
[1] DANG Y, LIN Q, HUANG P. AIOps:Real-World Challenges and Research Innovations[C]//Proceedings of the 41st International Conference on Software Engineering:Companion Proceedings, 2019
[2] 裴丹,张圣林,裴昶华, 等. 基于机器学习的智能运维[J]. 中国计算机学会通讯, 2017, 13(12):68-72 PEI Dan, ZHANG Shenglin, PEI Changhua, et al. Intelligent Operation and Maintenance Based on Machine Learning[J]. Communications of the CCF, 2017, 13(12):68-72(in Chinese)
[3] 林子雨,赖永炫,林琛,等. 云数据库研究[J]. 软件学报, 2012, 23(5):1148-1166 LIN Ziyu, LAI Yongxuan, LIN Chen, et al. Research on Cloud Databases[J]. Journal of Software, 2012, 23(5):1148-1166(in Chinese)
[4] 原继东,王志海. 时间序列的表示与分类算法综述[J]. 计算机科学, 2015, 42(3):1-7 YUAN Jidong, WANG Zhihai. Review of Time Series Representation and Classification Techniques[J]. Computer Science, 2015, 42(3):1-7(in Chinese)
[5] JAN G. DE Gooijer, ROB J. Hyndman. 25 Years of Time Series Forecasting[J]. International Journal of Forecasting, 2006, 22(3):443-473
[6] 杨海民,潘志松,白玮. 时间序列预测方法综述[J]. 计算机科学, 2019, 46(1):21-28 YANG Haimin, PAN Zhisong, BAI Wei. Review of Time Series Prediction Methods[J]. Computer Science, 2019,46(1):21-28(in Chinese)
[7] TAYLOR S J, LETHAM B. Forecasting at Scale[J]. The American Statistician, 2018, 72(1):37-45
[8] 韩东明,郭方舟,潘嘉铖, 等. 面向时序数据异常检测的可视分析综述[J]. 计算机研究与发展, 2018, 55(9):1843-1852 HAN Dongming, GUO Fangzhou, Pan Jiacheng, et al. Visual Analysis for Anomaly Detection in Time-Series:a Survey[J]. Computer Research and Development, 2018, 55(9):1843-1852(in Chinese)
[9] 崔佳旭,杨博. 贝叶斯优化方法和应用综述[J]. 软件学报,2018,29(10):3068-3090 CUI Jiaxu, YANG Bo. Survey on Bayesian Aptimization Methodology and Applications[J]. Journal of Software, 2018, 29(10):3068-3090(in Chinese)
[10] ROB J H, ANNE B K, RALPH D S, et al. A State Space Framework for Automatic Forecasting Using Exponential Smoothing Methods[J]. International Journal of Forecasting, 2002,18(3):439-454