A statistical approach for neural network pruning with application to internet of things
Abstract Pruning is showing huge potential for compressing and accelerating deep neural networks by eliminating redundant parameters.Along with more terminal chips integrated with AI accelerators for internet of things (IoT) devices, structured pruning is gaining popularity with the edge computing research area.Different from filter pruning and gro