Inicio  /  IoT  /  Vol: 2 Par: 2 (2021)  /  Artículo
ARTÍCULO
TITULO

ThriftyNets: Convolutional Neural Networks with Tiny Parameter Budget

Guillaume Coiffier    
Ghouthi Boukli Hacene and Vincent Gripon    

Resumen

Deep Neural Networks are state-of-the-art in a large number of challenges in machine learning. However, to reach the best performance they require a huge pool of parameters. Indeed, typical deep convolutional architectures present an increasing number of feature maps as we go deeper in the network, whereas spatial resolution of inputs is decreased through downsampling operations. This means that most of the parameters lay in the final layers, while a large portion of the computations are performed by a small fraction of the total parameters in the first layers. In an effort to use every parameter of a network at its maximum, we propose a new convolutional neural network architecture, called ThriftyNet. In ThriftyNet, only one convolutional layer is defined and used recursively, leading to a maximal parameter factorization. In complement, normalization, non-linearities, downsamplings and shortcut ensure sufficient expressivity of the model. ThriftyNet achieves competitive performance on a tiny parameters budget, exceeding 91% accuracy on CIFAR-10 with less than 40 k parameters in total, 74.3% on CIFAR-100 with less than 600 k parameters, and 67.1% On ImageNet ILSVRC 2012 with no more than 4.15 M parameters. However, the proposed method typically requires more computations than existing counterparts.

 Artículos similares

       
 
Yuting Chen, Pengjun Zhao, Yi Lin, Yushi Sun, Rui Chen, Ling Yu and Yu Liu    
Precise identification of spatial unit functional features in the city is a pre-condition for urban planning and policy-making. However, inferring unknown attributes of urban spatial units from data mining of spatial interaction remains a challenge in ge... ver más

 
Ching-Lung Fan    
The emergence of deep learning-based classification methods has led to considerable advancements and remarkable performance in image recognition. This study introduces the Multiscale Feature Convolutional Neural Network (MSFCNN) for the extraction of com... ver más

 
Jui-Fa Chen, Yu-Ting Liao and Po-Chun Wang    
Climate change has exacerbated severe rainfall events, leading to rapid and unpredictable fluctuations in river water levels. This environment necessitates the development of real-time, automated systems for water level detection. Due to degradation, tra... ver más
Revista: Water

 
Jiahui Zhao, Zhibin Li, Pan Liu, Mingye Zhang     Pág. 115 - 142
Demand prediction plays a critical role in traffic research. The key challenge of traffic demand prediction lies in modeling the complex spatial dependencies and temporal dynamics. However, there is no mature and widely accepted concept to support the so... ver más

 
Joaquim Miguel, Pedro Mendonça, Agnelo Quelhas, João M. L. P. Caldeira and Vasco N. G. J. Soares    
Hiking and cycling have become popular activities for promoting well-being and physical activity. Portugal has been investing in hiking and cycling trail infrastructures to boost sustainable tourism. However, the lack of reliable data on the use of these... ver más
Revista: Future Internet