题名 | Optimization based layer-wise magnitude-based pruning for DNN compression |
作者 | |
发表日期 | 2018
|
ISSN | 1045-0823
|
会议录名称 | |
卷号 | 2018-July
|
页码 | 2383-2389
|
会议地点 | Stockholm, Sweden
|
出版者 | |
摘要 | Layer-wise magnitude-based pruning (LMP) is a very popular method for deep neural network (DNN) compression. However, tuning the layer-specific thresholds is a difficult task, since the space of threshold candidates is exponentially large and the evaluation is very expensive. Previous methods are mainly by hand and require expertise. In this paper, we propose an automatic tuning approach based on optimization, named OLMP. The idea is to transform the threshold tuning problem into a constrained optimization problem (i.e., minimizing the size of the pruned model subject to a constraint on the accuracy loss), and then use powerful derivative-free optimization algorithms to solve it. To compress a trained DNN, OLMP is conducted within a new iterative pruning and adjusting pipeline. Empirical results show that OLMP can achieve the best pruning ratio on LeNet-style models (i.e., 114 times for LeNet-300-100 and 298 times for LeNet-5) compared with some state-ofthe-art DNN pruning methods, and can reduce the size of an AlexNet-style network up to 82 times without accuracy loss. © 2018 International Joint Conferences on Artificial Intelligence. All right reserved. |
学校署名 | 其他
|
收录类别 | |
资助项目 | [2017YFB1003102]
; [ZDSYS201703031748284]
; National Natural Science Foundation of China[61672478]
; National Natural Science Foundation of China[61603367]
; [2016QNRC001]
|
EI入藏号 | 20184406016535
|
EI主题词 | Artificial intelligence
; Arts computing
; Constrained optimization
; Iterative methods
|
EI分类号 | Data Processing and Image Processing:723.2
; Artificial Intelligence:723.4
; Numerical Methods:921.6
; Systems Science:961
|
来源库 | EV Compendex
|
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/50983 |
专题 | 工学院_计算机科学与工程系 |
作者单位 | 1.Anhui Province Key Lab of Big Data Analysis and Application, University of Science and Technology of China, Hefei; 230027, China 2.CERCIA, School of Computer Science, University of Birmingham, Birmingham; B15 2TT, United Kingdom 3.Shenzhen Key Lab of Computational Intelligence, Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen; 518055, China |
推荐引用方式 GB/T 7714 |
Li, Guiying,Qian, Chao,Jiang, Chunhui,et al. Optimization based layer-wise magnitude-based pruning for DNN compression[C]:International Joint Conferences on Artificial Intelligence,2018:2383-2389.
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论