Information Technology of Optimization Parameters of the Assembly Models of Artificial Intelligence for Forecasting the Presence Precipitations by Meteorological Monitoring

Authors

  • M. V. Dratovanyi Vinnytsia National Technical University
  • O. M. Kozachko Vinnytsia National Technical University
  • O. L. Melnyk Vinnytsia National Technical University
  • I. V. Varchuk Vinnytsia National Technical University

DOI:

https://doi.org/10.31649/1997-9266-2020-153-6-76-83

Keywords:

information technology, artificial intelligence models, precipitation forecasting, informative features

Abstract

Data forecasting is a trivial task of systems analysis, there are different types of forecasts and predictions. One of them is a binary forecast that answers the question of whether an event will occur or not. One of the issues of meteorology is the issue of forecasting precipitation, as well as what accuracy will be in such a forecast.

The paper considers the problem of forecasting the presence of precipitation according to meteorological monitoring and proposes information technology to optimize the parameters of the ensemble of such models of machine learning as models of gradient boosting and logistic regression, built on a set of informative features. The proposed information technology allows you to perform intelligence analysis of input data and determine the optimal set of informative features, and due to the algorithm, which at each step determines the optimal one, two, three,… -element sets of features that maximize forecasting accuracy. Graphs of influence of signs on accuracy of the used models of machine learning are constructed. Each type of model has its own set of features. To provide information technology, the data provided by the Vinnytsia Center for Hydrometeorology were selected. These are the data of atmospheric monitoring of Vinnytsia for the last 10 years, which include: air temperature, humidity, dew point, cloudiness and wind speed.

To increase the accuracy of forecasting, a mathematical model is proposed, which is based on the optimal determination of the weights of the ensemble of models of gradient boosting and logistic regression. Experiments were performed that showed a fairly accurate result. The accuracy of the proposed information technology showed 80%. This confirmed the reliability of the proposed technology.

Author Biographies

M. V. Dratovanyi, Vinnytsia National Technical University

Assistant of the Chair of System Analysis and Information Technologies

O. M. Kozachko, Vinnytsia National Technical University

Cand. Sc. (Eng.), Associate Professor, Associate Professor of the Chair of System Analysis and Information Technologies

O. L. Melnyk, Vinnytsia National Technical University

Student of the Department of Computer Systems and Automation

I. V. Varchuk, Vinnytsia National Technical University

Cand. Sc. (Eng.), Associate Professor, Associate Professor of the Chair of System Analysis and Information Technologies

References

A. Bezerra, I. Silva, L. A. Guedes, D. Silva, G. Leitão, and K. Saito, “Extracting Value from Industrial Alarms and Events: A Data-Driven Approach Based on Exploratory Data Analysis,” Sensors, 2019, no. 19, issue 12, pp. 11-32.

Як роблять прогнози погоди і чому вони іноді не збуваються? Прогноз. [Електронний ресурс]. Режим доступу: https://www.bbc.com/ukrainian/features-51545290 . Назва з екрана.

Прогнози погоди і клімату та притаманні їм обмеження. [Електронний ресурс]. Режим доступу: http://prima.franko.lviv.ua/faculty/geology/phis_geo/fourman/E-books-FVV/Intera ctive%20books/Meteorology/Weather%20Forecasting/Weather%20Ukraine/Meteo-forecasting/Analyze-forecast%20of%20limits%20climate.htm . Назва з екрана.

Прогнозування погоди. [Електронний ресурс]. Режим доступу: http://prima.franko.lviv.ua/faculty/geology/phis_geo/fourman/E-books-FVV/Intera ctive%20books/Meteorology/ForecaseM.htm. Назва з екрана.

Guolin Ke, et. al.,“LightGBM: A Highly Efficient Gradient Boosting Decision Tree,” Advances in Neural Information Processing Systems 30 (NIPS 2017), pp. 3149-3157.

E. Bauer, and R. Kohavi, “An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants,” Machine Learning, 1999, pp. 105-139.

Module pandas_profiling. [Electronic resource]. Available: https://pandas-profiling.github.io/pandas-profiling/docs/ .

Matplotlib API Overview. [Electronic resource]. Available: https://matplotlib.org/api/index.html .

A new correlation coefficient between categorical, ordinal and interval variables with Pearson characteristics. [Electronic resource]. Available: https://arxiv.org/abs/1811.11440 .

XGBoost Documentation. [Electronic resource]. Available: https://xgboost.readthedocs.io/en/latest/ .

LightGBM Documentation. [Electronic resource]. Available: https://lightgbm.readthedocs.io/en/latest/ .

Downloads

Abstract views: 178

Published

2020-12-25

How to Cite

[1]
M. V. . Dratovanyi, O. M. . Kozachko, O. L. . Melnyk, and I. V. . Varchuk, “Information Technology of Optimization Parameters of the Assembly Models of Artificial Intelligence for Forecasting the Presence Precipitations by Meteorological Monitoring”, Вісник ВПІ, no. 6, pp. 76–83, Dec. 2020.

Issue

Section

Information technologies and computer sciences

Metrics

Downloads

Download data is not yet available.

Most read articles by the same author(s)

1 2 > >>