The method includes the steps of (a) performing an intra-convolution kernel pruning of the DNN model wherein a fixed number of weights are pruned in each convolution kernel of the DNN model to generate sparse convolution patterns; (b) performing inter-convolution kernel pruning of the DNN ...