ARTÍCULO
TITULO

Data shift monitoring in machine learning models

Dmitry Namiot    
Eugene Ilyushin    

Resumen

The fundamental moment of the operation of machine learning systems is that the models are trained on some selected training data set. Accordingly, the generalizations obtained at the training stage are due to the characteristics of some subset of the general population. If the characteristics of the data change during the operation of the system, then generalizations of the model become, generally speaking, untenable. At the same time, such a change in data should be considered the rule rather than the exception. This change in data characteristics is called data shift. This, in turn, means that any machine learning system that claims to be industrial must track the possible data shift. The presence of such a shift reduces the confidence in the results of the work or even makes the system unsuitable for further operation. Taking into account (overcoming) such a data shift is a separate task, simple retraining can be a big problem for critical applications, for example. But in any case, the first task is to determine the fact of data shift. The data shift itself is divided into several types, the most serious of which is a change in the relationship between dependent and independent variables. Naturally, the definition of data offset for streams is of particular interest, since this is directly related to critical applications.

 Artículos similares

       
 
Chenglei Lv, Qiushi Sun, Huifang Chen and Lei Xie    
Due to the relative motion between transmitters and receivers and the multipath characteristic of wideband underwater acoustic channels, Doppler and channel estimations are of great significance for an underwater acoustic (UWA) communication system. In t... ver más

 
Dongye Lv, Hanbing Liu, Qiang Miao, Wensheng Wang, Guojin Tan, Chengwei Shi and Hanjun Li    
The passivation behavior of steel reinforcements in concrete is significantly influenced by the environment, concrete pore solution, and the passive film formed on the steel surface. The present study used electrochemical methods to successfully characte... ver más
Revista: Applied Sciences

 
Jih-Ching Chiu, Guan-Yi Lee, Chih-Yang Hsieh and Qing-You Lin    
In computer vision and image processing, the shift from traditional cameras to emerging sensing tools, such as gesture recognition and object detection, addresses privacy concerns. This study navigates the Integrated Sensing and Communication (ISAC) era,... ver más

 
R. J. Roosien, M. N. A. Lim, S. M. Petermeijer and W. F. Lammen    
To reduce the carbon footprint of transport, policymakers are simultaneously stimulating cleaner vehicles and more sustainable mobility choices, such as a shift to rail for short-haul flights within Europe. The purpose of this study is to determine the c... ver más
Revista: Aerospace

 
Yansong Li, Yaning Chen, Yapeng Chen, Weili Duan, Jiayou Wang and Xu Wang    
Global changes in drought and wetness and their future trends in arid regions have recently become a major focus of research attention. The Tarim River Basin (TRB) in Xinjiang, China, is among the most climate-sensitive regions in the world. This study u... ver más
Revista: Water