删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

高吞吐率双模浮点可重构FFT处理器设计实现

本站小编 Free考研考试/2022-01-03

魏星1, 2,
黄志洪1,
杨海钢1, 2,,
1.中国科学院电子学研究所 ??北京 ??100190
2.中国科学院大学 ??北京 ??100190
基金项目:国家自然科学基金(61704173, 61474120),北京市科技重大专项课题(Z171100000117019)

详细信息
作者简介:魏星:男,1991年生,博士,研究方向为算法硬件加速设计、可重构计算芯片架构设计
黄志洪:男,1984年生,助理研究员,研究方向为可编程逻辑结构设计、新型卷积神经网络芯片体系架构开发
杨海钢:男,1960年生,研究员,研究方向为数模混合信号集成电路设计、超大规模集成电路设计等
通讯作者:杨海钢  yanghg@mail.ie.ac.cn
中图分类号:TN47

计量

文章访问数:1196
HTML全文浏览量:398
PDF下载量:29
被引次数:0
出版历程

收稿日期:2018-02-08
修回日期:2018-07-05
网络出版日期:2018-07-24
刊出日期:2018-12-01

High Throughput Dual-mode Reconfigurable Floating-point FFT Processor

Xing WEI1, 2,
Zhihong HUANG1,
Haigang YANG1, 2,,
1. Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
2. University of Chinese Academy of Sciences, Beijing 100190, China
Funds:The National Natural Science Foundation of China (61704173, 61474120), The Major Program of Beijing Science and Technology (Z171100000117019)


摘要
摘要:高吞吐浮点可灵活重构的快速傅里叶变换(FFT)处理器可满足尖端雷达实时成像和高精度科学计算等多种应用需求。与定点FFT相比,浮点运算复杂度更高,使得浮点型FFT的运算吞吐率与其实现面积、功耗之间的矛盾问题尤为突出。鉴于此,为降低运算复杂度,首先将大点数FFT分解成若干个小点数基2k 级联子级实现,提出分别针对128/256/512/1024/2048点FFT的优化混合基算法。同时,结合所提出同时支持单通道单精度和双通道半精度两种浮点模式的新型融合加减与点乘运算单元,首次提出一款高吞吐率双模浮点可变点FFT处理器结构,并在28 nm标准CMOS工艺下进行设计并实现。实验结果表明,单通道单精度和双通道半精度浮点两种模式下的运算吞吐率和输出平均信号量化噪声比分别为3.478 GSample/s, 135 dB和6.957 GSample/s, 60 dB。归一化吞吐率面积比相比于现有其他浮点FFT实现可提高约12倍。
关键词:快速傅里叶变换/
双模浮点/
混合基/
融合运算单元
Abstract:In the advanced applications of real-time radar imaging and high-precision scientific computing systems, the design of high throughput and reconfigurable Floating-Point (FP) FFT accelerator is significant. Achieving high throughput FP FFT with low area and power cost poses a greater challenge due to high complexity of FP operations in comparison to fixed-point implementations. To address these issues, a serial of mixed-radix algorithms for 128/256/512/1024/2048-point FFT are proposed by decomposing long FFT into short implementations with cascaded radix-2k stages so that the complexity of multiplications can be significantly reduced. Besides, two novel fused FP add-subtract and dot-product units for dual-mode functionality are proposed, which can either compute on a pair of double precision operands or on two pairs of single precision operands in parallel. Thus, a high throughput dual-mode floating-point variable length FFT is designed. The proposed processor is implemented based on SMIC 28 nm CMOS technology. Simulation results show that the throughput and Signal-to-Quantization Noise Ratio (SQNR) in single-channel single precision and dual-channel half precision floating-point mode are 3.478 GSample/s, 135 dB and 6.957 GSample/s, 60 dB respectively. Compare to the other FP FFT, this processor can achieve 12 times improvement of normalized throughput-area ratio.
Key words:Fast Fourier Transform (FFT)/
Dual-mode floating point/
Mixed-radix/
Fused arithmetic unit



PDF全文下载地址:

https://jeit.ac.cn/article/exportPdf?id=62118d14-0ce1-468d-901c-2a8d75cbc3a6
相关话题/设计 信号 计算 北京 中国科学院