南方科技大学知识苑(SUSTech KC): FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations

题名	FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations
作者	Tao, Chaofan 1; Lin, Rui 1; Chen, Quan2 ; Zhang, Zhaoyang 3; Luo, Ping 4; Wong, Ngai 1
通讯作者	Tao, Chaofan
发表日期	2024-02
DOI	10.1109/TNNLS.2022.3190607
发表期刊	IEEE Transactions on Neural Networks and Learning Systems 影响因子和分区
ISSN	2162-237X
EISSN	2162-2388
卷号	PP 期号:99页码:1-15
摘要	Learning low-bitwidth convolutional neural networks (CNNs) is challenging because performance may drop significantly after quantization. Prior arts often quantize the network weights by carefully tuning hyperparameters such as nonuniform stepsize and layerwise bitwidths, which are complicated since the full-and low-precision representations have large discrepancies. This work presents a novel quantization pipeline, named frequency-aware transformation (FAT), that features important benefits: 1) instead of designing complicated quantizers, FAT learns to transform network weights in the frequency domain to remove redundant information before quantization, making them amenable to training in low bitwidth with simple quantizers; 2) FAT readily embeds CNNs in low bitwidths using standard quantizers without tedious hyperparameter tuning and theoretical analyses show that FAT minimizes the quantization errors in both uniform and nonuniform quantizations; and 3) FAT can be easily plugged into various CNN architectures. Using FAT with a simple uniform/logarithmic quantizer can achieve the state-of-the-art performance in different bitwidths on various model architectures. Consequently, FAT serves to provide a novel frequency-based perspective for model quantization.
关键词	Quantization (Signal) Fats Transforms Training Adaptation Models Standards Frequency-domain Analysis Efficient Neural Network Model Compression Quantization Representation Learning
相关链接	[来源记录]
收录类别	SCI ; EI
语种	英语
学校署名	其他
资助项目	General Research Fund (GRF)[
WOS研究方向	Computer Science ; Engineering
WOS类目	Computer Science, Artificial Intelligence ; Computer Science, Hardware & Architecture ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic
WOS记录号	WOS:000833054400001
出版者	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
来源库	Web of Science
全文链接	https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9837828
引用统计	被引频次[WOS]：4
成果类型	期刊论文
条目标识符	http://sustech.caswiz.com/handle/2SGJ60CL/364975
专题	工学院_深港微电子学院
作者单位	1.Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Peoples R China 2.Southern Univ Sci & Technol, Sch Microelect, Shenzhen 518055, Peoples R China 3.Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China 4.Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
推荐引用方式 GB/T 7714	Tao, Chaofan,Lin, Rui,Chen, Quan,et al. FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations[J]. IEEE Transactions on Neural Networks and Learning Systems,2024,PP(99):1-15.
APA	Tao, Chaofan,Lin, Rui,Chen, Quan,Zhang, Zhaoyang,Luo, Ping,&Wong, Ngai.(2024).FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations.IEEE Transactions on Neural Networks and Learning Systems,PP(99),1-15.
MLA	Tao, Chaofan,et al."FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations".IEEE Transactions on Neural Networks and Learning Systems PP.99(2024):1-15.