Efficient CNN Inference Acceleration on FPGAs: Pruning-Driven Approach

Defense type
Thesis
Starting date
End date
Location
IRISA Rennes
Room
Salle Métivier
Speaker
Léo PRADELS (Equipe TARAN)
Main department
Theme

CNN-based deep learning models provide state-of-the-art performance in image and video processing tasks, particularly for image enhancement or classification. However, these models are computationally and memory-intensive, making them unsuitable for real-time constraints on embedded FPGA systems. As a result, compressing these CNNs and designing accelerator architectures for inference that integrate compression in a hardware-software co-design approach is essential. While software optimizations like pruning have been proposed, they often lack the structured approach needed for effective accelerator integration.
To address these limitations, this thesis focuses on accelerating CNNs on FPGAs while complying with real-time constraints on embedded systems. This is achieved through several key contributions. First, it introduces pattern pruning, which imposes structure on network sparsity, enabling efficient hardware acceleration with minimal accuracy loss due to compression. Second, a scalable accelerator for CNN inference is presented, which adapts its architecture based on input performance criteria, FPGA specifications, and target CNN model architecture. An efficient method for integrating pattern pruning within the accelerator and a complete flow for CNN acceleration are proposed. Finally, improvements in network compression are explored through Shift&Add quantization, which modifies FPGA computation methods while maintaining baseline network accuracy.

Composition of the jury
Reviewers:
Alberto BOSIO Professeur des Universités, École Centrale de Lyon
Martin KUMM Full Professor, Université de Sciences Appliquées de Fulda, Allemagne

Jury:
Lilia ZAOURAR Ingénieur Experte, CEA Saclay
Frédéric PETROT Professeur des Universités, Grenoble INP/Ensimag

Supervisors:
Silviu Ioan FILIP Chargé de Recherche à INRIA
Olivier SENTIEYS Professeur des Universités, Université de Rennes
Thibaut LE CALLOCH Ingénieur Expert Safran
Daniel CHILLET Professeur des Universités, Université de Rennes