GET THE APP

..

International Journal of Sensor Networks and Data Communications

ISSN: 2090-4886

Open Access

Zhengang Li

Department of Electrical and Computer Engineering, Northeastern University, Boston, United States

Publications
  • Review Article   
    A Compiler-Aware Framework of Network Pruning and Architecture Search for Mobile Acceleration
    Author(s): Zhengang Li*

    With the increasing demand to efficiently deploy DNNs on mobile edge devices, it becomes much more important to reduce unnecessary computation and increase the execution speed. Prior methods towards this goal, including model compression and network architecture search (NAS), are largely performed independently and do not fully consider compiler-level optimization which is a must-do for mobile acceleration. In this work, we propose NPAS, a compiler-aware unified network pruning and architecture search and the corresponding comprehensive compiler optimizations supporting different DNNs and different pruning schemes, which bridge the gap of weight pruning and NAS. Our framework achieves 6.7 ms, 5.9 ms, and 3.9 ms ImageNet inference times with 78%, 75% (MobileNet-V3 level), and 71% (MobileNet-V2 level) Top-1 accuracy respectively on an off-the-shelf mobile phone, consistently outperformi.. Read More»
    DOI: 10.37421/2090-4886.2021.10.138

    Abstract HTML PDF

Google Scholar citation report
Citations: 343

International Journal of Sensor Networks and Data Communications received 343 citations as per Google Scholar report

International Journal of Sensor Networks and Data Communications peer review process verified at publons

Indexed In

 
arrow_upward arrow_upward