Inicio  /  Hydrology  /  Vol: 1 Par: 1 (2014)  /  Artículo
ARTÍCULO
TITULO

Training of Artificial Neural Networks Using Information-Rich Data

Shailesh Kumar Singh    
Sharad K. Jain and András Bárdossy    

Resumen

Artificial Neural Networks (ANNs) are classified as a data-driven technique, which implies that their learning improves as more and more training data are presented. This observation is based on the premise that a longer time series of training samples will contain more events of different types, and hence, the generalization ability of the ANN will improve. However, a longer time series need not necessarily contain more information. If there is considerable repetition of the same type of information, the ANN may not become ?wiser?, and one may be just wasting computational effort and time. This study assumes that there are segments in a long time series that contain a large quantum of information. The reason behind this assumption is that the information contained in any hydrological series is not uniformly distributed, and it may be cyclic in nature. If an ANN is trained using these segments rather than the whole series, the training would be the same or better based on the information contained in the series. A pre-processing can be used to select information-rich data for training. However, most of the conventional pre-processing methods do not perform well due to large variation in magnitude, scale and many zeros in the data series. Therefore, it is not very easy to identify these information-rich segments in long time series with large variation in magnitude and many zeros. In this study, the data depth function was used as a tool for the identification of critical (information) segments in a time series, which does not depend on large variation in magnitude, scale or the presence of many zeros in data. Data from two gauging sites were used to compare the performance of ANN trained on the whole data set and just the data from critical events. Selection of data for critical events was done by two methods, using the depth function (identification of critical events (ICE) algorithm) and using random selection. Inter-comparison of the performance of the ANNs trained using the complete data sets and the pruned data sets shows that the ANN trained using the data from critical events, i.e., information-rich data (whose length could be one third to half of the series), gave similar results as the ANN trained using the complete data set. However, if the data set is pruned randomly, the performance of the ANN degrades significantly. The concept of this paper may be very useful for training data-driven models where the training time series is incomplete.

 Artículos similares

       
 
William Villegas-Ch, Joselin García-Ortiz and Angel Jaramillo-Alcazar    
This paper investigated the importance of explainability in artificial intelligence models and its application in the context of prediction in Formula (1). A step-by-step analysis was carried out, including collecting and preparing data from previous rac... ver más

 
Nikolaos Makrakis, Prodromos N. Psarropoulos and Yiannis Tsompanakis    
Large-scale lifelines in seismic-prone regions very frequently cross areas that are characterized by active tectonic faulting, as complete avoidance might be techno-economically unfeasible. The resulting Permanent Ground Displacements (PGDs) constitute a... ver más
Revista: Infrastructures

 
Mohamed Gad, Aissam Gaagai, Mohamed Hamdy Eid, Péter Szucs, Hend Hussein, Osama Elsherbiny, Salah Elsayed, Moataz M. Khalifa, Farahat S. Moghanm, Moustapha E. Moustapha, Dina A. Tolan and Hekmat Ibrahim    
The assessment and prediction of water quality are important aspects of water resource management. Therefore, the groundwater (GW) quality of the Nubian Sandstone Aquifer (NSSA) in El Kharga Oasis was evaluated using indexing approaches, such as the drin... ver más
Revista: Water

 
Sarra Bel Haj Salem, Aissam Gaagai, Imed Ben Slimene, Amor Ben Moussa, Kamel Zouari, Krishna Kumar Yadav, Mohamed Hamdy Eid, Mostafa R. Abukhadra, Ahmed M. El-Sherbeeny, Mohamed Gad, Mohamed Farouk, Osama Elsherbiny, Salah Elsayed, Stefano Bellucci and Hekmat Ibrahim    
In the Zeroud basin, a diverse array of methodologies were employed to assess, simulate, and predict the quality of groundwater intended for irrigation. These methodologies included the irrigation water quality indices (IWQIs); intricate statistical anal... ver más
Revista: Water

 
Renner de Assis Garcia Sobrinho, Franklin Piauhy Neto and Henrique Fernandes    
The use of technology, such as artificial intelligence (AI), in production processes has been optimizing several industrial realities. In civil construction, AI can be used in different applications, one of which is building inspection. One of the diffic... ver más
Revista: Buildings