Education, Science, Technology, Innovation and Life
Open Access
Sign In

A Lookup Table of Mixed-radix FFT Implementation and Optimization on Sunway Embedded Platform

Download as PDF

DOI: 10.23977/CNCI2020046

Author(s)

Youcai Luo, Hongsheng Wang and Lin Han

Corresponding Author

Youcai Luo

ABSTRACT

The Fast Fourier Transform is one of the most fundamental algorithms used in digital signal processing. It is of great significance to implement an efficient Fast Fourier Transform algorithm on Sunway embedded platform. In this paper, based on the radix 2-radix 4 mixed-radix algorithm implemented in DSPLIB Library of Texas Instruments, it takes the lookup table to replace the reversal computation function, which improves the performance by 1.69 times. In addition, the algorithm is manually optimized for the Sunway embedded processor by using loop unrolling optimization, SIMD vectorization, and data prefetch optimization with the studying of Sunway processor chip architecture. Comparing the initial state, the optimal average performance of the algorithm has been improved by about three times on Sunway.

KEYWORDS

Sunway processor; fast fourier transform; lookup table; loop unrolling; vectorization; data prefetching

All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.