Home » Publication » 28581

Dettaglio pubblicazione

2024, SOFTWARE IMPACTS, Pages - (volume: 20)

OptDNN: Automatic deep neural networks optimizer for edge computing (01a Articolo in rivista)

Giovannesi Luca, Mattia Gabriele Proietti, Beraldi Roberto

DNNs are widely used for complex tasks like image and signal processing, and they are in increasing demand for implementation on Internet of Things (IoT) devices. For these devices, optimizing DNN models is a necessary task. Generally, standard optimization approaches require specialists to manually fine-tune hyper-parameters to find a good trade-off between efficiency and accuracy. In this paper, we propose OptDNN, a software that employs innovative and automatic approaches to determine optimal hyper-parameters for pruning, clustering, and quantization. The models optimized by OptDNN have a smaller memory footprint, faster inference time, and a similar accuracy to the original models.
keywords
© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma