# DDFS based on Modified Taylor Series Approximation with Piecewise Linearization

S. Nusrath Nasreen Fathima Department of ECE Thiagarajar College of Engineering Madurai, India

B. Raja

Department of ECE Thiagarajar College of Engineering Madurai, India

*Abstract* — The ultimate choice for signal generation is direct digital frequency synthesis (DDFS) which is gaining more popularity among modern communication systems. DDFS is capable of generating a digitally controllable and digitally tunable signal frequency. The existing conventional DDFS architectures requires large read only memory (ROM) and more hardware overhead, which ultimately increases the power consumption of the system. This paper proposes a DDFS architecture with smaller read only memory and minimum hardware overhead. It is based on the modification of the Taylor series approximation based on Piecewise Linearization approach. The proposed method has the advantage of high spectral purity, low power consumption and high speed. The proposed DDFS architecture is able to achieve a spectral purity of about 170 dBc.

Keywords— direct digital frequency synthesis; read only memory; Taylor series; Piecewise Linearization; spectral purity; DDFS architecture

### I. INTRODUCTION

Generating a digitally controllable and tunable waveform is of a great concern, because in many areas, the ability to generate waveform with high spectral purity, minimum noise level and low spurious signal content is needed. Also, the communication systems demand maximum frequency tuning resolution with minimum quantization error. The cost, chip area, power consumption and memory size are the deciding factors for preferring one design over the other.

The method of generating desired waveform from a fixed frequency reference signal is called frequency synthesizers. There are various possibilities of this method such as analog frequency synthesizers or digital frequency synthesizers. But the most convenient method which fits the constrain is the direct digital frequency synthesis. A Direct Digital Frequency Synthesizer (DDFS) produces an analog waveform by generating a time-varying signal in digital form and then performing a digital-to-analog (D/A) conversion. The most common analog waveform generated by this approach is the sine waveform, since according to Fourier synthesis, any waveform can be decomposed into sum of pure sinusoids. M.R. Satheesh Kumar Department of ECE Thiagarajar College of Engineering Madurai, India

Dr. K. Hariharan Department of ECE Thiagarajar College of Engineering Madurai, India

This digital approach of generating a sinusoidal signal has an advantage of fast switching between output frequencies, fine frequency resolution, fast settling time, low phase noise and spurious noise and operation over a broad spectrum of frequencies.

ROM based DDFS [3] uses Look-up Table (LUT) of sampled sine values of a complete cycle and accessing it depending on the output frequency required. But this method has several drawbacks including large chip area and high power consumption. Common methods employed to reduce the size of the ROM are quarter wave symmetry [9]. Algorithmic methods such as CORDIC can be used as a replacement for the ROM by calculating the phase to amplitude conversion in digital circuitry [6]. However, the additional digital circuits increase the chip area and power consumption [10].

To overcome the problems associated with look-up table and algorithm based systems, the sine and cosine functions are approximated using extended Taylor series approximation [14]. However, computation of the expansion polynomial requires considerable multiplier hardware which increases the complexity of the system. To reduce the complexity, a piecewise linearization method of approximation is proposed. The spectral purity of the system is greatly improved in addition to the reduction in hardware. However, the reduction of multiplication is at the cost of extra memory to store the polynomial coefficients, as well as the function values of the expansion points. Thus, the system has less hardware, high operating speed and memory is also reduced, thus reducing the power consumption of the system.

# II. PROPOSED DDFS ALGORITHM

The proposed DDFS design provides minimizing the size of the ROM lookup table by using an approximation algorithm with a judicious choice of polynomial [8]. The primary objective for using such an approach is that precision can be achieved with relatively less hardware overhead when compared to other techniques of sine wave generation. The resulting low-power architecture meets the specifications of low power communication standards such as Bluetooth [2]. The performance of the system depends on the size of the DDFS architecture.

# A. Trigonometric and Extended Taylor Series approximation using Piecewise Linearization Algorithm

### 1) Sine $(\gamma_s)$ and Cosine $(\gamma_c)$ Accumulator

In the Taylor series expansion, the Sine and Cosine functions can be implemented at a point  $\gamma$  (constant) given by neglecting higher order terms,

$$\sin(\theta) \sim \sin(\gamma) + (\theta - \gamma) * \cos(\gamma) \tag{1}$$
  

$$\cos(\theta) \sim \cos(\gamma) - (\theta - \gamma) * \sin(\gamma) \tag{2}$$

Another form of Taylor series expansion for sine and cosine function at a point  $\gamma$  is given by neglecting higher order terms,

By applying Equation (3) and (4) in Equation (1),

$$\sin(\theta) \approx \left(1 - (\gamma^2/2!)\right) * \theta \tag{5}$$

$$\gamma_{\rm s} = \sqrt{\left(2 * \left(1 - (\sin(\theta) / \theta)\right)\right)} \tag{6}$$

Similarly, by applying Equation (3) and (4) in Equation (2),

$$\cos(\theta) \approx \left(1 + (\gamma * \gamma/2)\right) - (\theta * \gamma) \tag{7}$$
$$\gamma_c^2 - 2(\theta * \gamma_c) - 2(\cos(\theta) - 1) = 0 \tag{8}$$

where,  $\theta$  ranges from 0° to 45°. The minimum of the two roots from equation (8) is used for the computation.

## 2) Linear Approximation

To reduce the complexity of sine and cosine accumulator, the equation approximated by means of a linear equation which can be constructed easily using an accumulator and a comparator.

The linear equation for sine accumulator is given by,

$$\gamma_s = (\gamma_{s,max} / \gamma_{s,sten}) * \theta$$
 (for  $\theta$  varies from 0 to  $\pi/4$ ) (9)

Where,  $\gamma_{s,max} = \gamma_s (\text{for } \theta = \pi/4)$  $\gamma_{s,step} = \gamma_{s,max} / (\pi/4 * 2^N)$ 

Similarly, the linear equation for cosine accumulator is given by,

$$\gamma_c = (\gamma_{c,max} / \gamma_{c,step}) * \theta$$
 (for  $\theta$  varies from 0 to  $\pi/4$ ) (10)

Where,  $\gamma_{c,max} = \gamma_s$  (for  $\theta = \pi/4$ )  $\gamma_{c,step} = \gamma_{c,max} / (\pi/4 * 2^N)$ 



Fig. 1. Deviation in theoretical  $\gamma$  values and linear increment of  $\gamma$  values for sine



Fig. 2. Deviation in theoretical  $\gamma$  values and linear increment of  $\gamma$  values for cosine

Fig. 1 shows the comparison between theoretical and the linear increment values of  $\gamma_s$ . It is inferred from the graph that the linear increment  $\gamma_s$  values are in line with the theoretical  $\gamma_s$  values. But, the variation between theoretical  $\gamma_c$  and linear increment  $\gamma_c$  is slightly larger due to the strong dependency of cosine function with  $\gamma$ . This can be observed from Fig. 2.

#### 3) Piecewise Linearization

The slight deviation between theoretical  $\gamma_c$  and linear increment  $\gamma_c$  is reduced by using piecewise linearization. Piecewise linearization of theoretical  $\gamma_c$  is done by dividing the region into several subintervals. In turns, each subinterval carries out its own linear approximation. The accuracy of the output signal depends on the degree of the reconstructing polynomial, the number of subsections, the word length of the truncated phase accumulator output, as well as the word length of the DDFS system output. However, this leads to an increase in the spectral purity of the generated signal, but with some hardware overhead. The purity of the signal varies in direct proportion to the number of linear segments. Fig. 3 shows piecewise linearization using four linear segments. It is observed that the spectral purity of the signal gets saturated to almost constant beyond four linear segments and hence four segment based piecewise linear approximation is proposed.

73

Let, points = 4 and i = 1,2,3,4

The piecewise linear equation is given by,

$$\gamma_{c,i} = ((\gamma_{end,i} - \gamma_{start,i}) / \gamma_{step,i}) * \theta$$
(11)  
(for  $\theta$  varies from 0 to  $\pi/4$ )

Where,  $\gamma_{max} = \gamma_c$  (for  $\theta = \pi/4$ )  $\gamma_{start,i} = (\gamma_{max} * 2^N) * ((i-1) / points)$   $\gamma_{step,i} = (\gamma_{max} * 2^N) / points$  $\gamma_{end,i} = (\gamma_{max} * 2^N) * (i / points)$ 



Fig. 3. Piecewise linearization of theoretical  $\gamma_{\rm c}$  values with four uniform segments

### III. PROPOSED DDFS ARCHITECTURE

Fig. 4 shows the block diagram of the proposed DDFS architecture. It consists of phase generator, quadrant selector, sine/cosine generator, digital multiplexers and a DAC in the output stage. The phase generator (PG) of the proposed DDFS receives an input as a frequency control word (FCW) from the system in which the proposed DDFS is integrated. The phase generator output is then fed to quadrant selector which selects the phase corresponding to the quadrant requirement. The output of the quadrant selector is fed to sine/cosine generator and any one of these output is selected and given to the input of the generators either from sine generator or cosine generator to complete a full cycle of sinusoidal as discussed. The full cycle is generated by exploiting the symmetry of sine wave with respect to zero crossings over one full period.





# *A. γ Accumulator sine and cosine based on piecewise linearization approach*

Fig. 5 shows the block diagram of the  $\gamma_s$  accumulator employed for generating  $\gamma s$  values. It consists of an adder, latch, comparator, subtractor and a multiplexer. Also the module comprises of two registers, one register to store the  $\gamma$  value ( $\gamma_s$  value corresponding to ( $\pi/4 \ *2^N$ )th clock cycle) and another register to store the  $\gamma$  step value (linear incremental step size of the  $\gamma$ ) which is a product of frequency control word (FCW) and  $\gamma_{max}/(\pi/4 \ *2^N)$ .

For each clock input, the adder with the latch generates an accumulated  $\gamma$  value with the step size of( $\gamma_{max}$ / ( $\pi$ /4 \*2N))\*FCW. This accumulated  $\gamma$  value is fed to a comparator and a subtractor.

The comparator compares accumulated  $\gamma$  value with the  $\gamma_{max}$  value for every instance of clock. When the accumulated  $\gamma$  value reaches to  $\gamma$ max, the latch is cleared to zero and  $\gamma$  accumulation is started from initial value.

The subtractor subtracts accumulated  $\gamma$  value with the  $\gamma_{max}$  value for every instance of clock. The subtracted value of the  $\gamma$  accumulator is fed to multiplexer and the accumulated  $\gamma$  value is also given to the multiplexer. One of the inputs of the multiplexer is selected by the LSB of the counter in quadrant selector and given to sine/cosine generator.

Thus,  $\gamma$  accumulator is employed to generate  $\gamma_s$  values between 0 to  $\pi/4$ . Similar hardware architecture is used to generate  $\gamma_c$  values. Fig.6 shows the block diagram of  $\gamma_c$ accumulator. The primary difference between the architectures employed to generate  $\gamma_s$  and  $\gamma_c$  values lies in the  $\gamma_{max}$  register content. The appropriate values to be stored in the  $\gamma_{max}$ registers are determined from the above equations.



Fig. 5. Block Diagram of sine y Accumulator



Fig. 6. Block Diagram of cosine y Accumulator

### IV. RESULTS AND DISCUSSIONS

### A. DDFS using LUT Based y Values

The computed  $\gamma$  values for sine ranging from 0.0003 to 0.4465 and cosine functions ranging from 0.0005 to 0.6091 for 12 bit resolution are stored in a LUT. For each increment of phase accumulator output, the corresponding  $\gamma$  values can be accessed from the ROM for computation. The performance of the DDFS using the LUT based  $\gamma$  values is nearly 230 dBc. Fig. 7 shows the spectrum obtained using these values. However, storing all  $\gamma$ s and  $\gamma$ c values in a lookup table greatly increases the hardware overhead and power consumption.

### B. DDFS using y Accumulator

Instead of using look up table for storing  $\gamma$  values, combinational circuits like adder and multiplier are used for the computation of  $\gamma$  values thereby reducing the computational time. Fig.5 shows the architecture of  $\gamma$  accumulator. A simple combinational circuitry can automatically generate the  $\gamma$  values if one knows the range of  $\gamma$  values that is the maximum and minimum values of  $\gamma$ , and the linear equation. Thus, using the linear increment  $\gamma$  values, the ROM size is drastically reduced.

 $\gamma$ s is varied from 0 to  $\gamma$ max i.e., 0.4465 in steps of  $\gamma$  max/( $\pi/4*2^{N}$ ). Similarly  $\gamma$ c is varied from 0 to  $\gamma$ max i.e., 0.6091 in steps of  $\gamma$  max/( $\pi/4*2^{N}$ ).Fig. 8 shows the spectrum for DDFS using  $\gamma$  accumulator. As it is seen from the graph, the SFDR performance of the DDFS using  $\gamma$ s and  $\gamma$ c accumulator over the range 0 -  $\pi/4$  is about 135.6 dBc as compared to LUT  $\gamma$  values set of 230 dBc.



Fig. 7. Spectrum of the proposed DDFS using LUT for y values



Fig.8. Spectrum of the proposed DDFS using  $\gamma$  accumulator

Fig. 9 shows the spectrum obtained using piecewise linearization with four linear segments. It is clear that the spectral purity of the signal is increased towards the theoretical  $\gamma$  value of 230 dBc. The SFDR value obtained with four uniform segments is about 169.4 dBc. The proposed method is compared with previous DDFS techniques as shown in Table I. The SFDR performance with various linear segments is shown in Table II.



Fig. 9. Spectrum of the proposed DDFS using piecewise linearization with four uniform segments

TABLE I. Comparison with existing DDFS techniques

| Ref.             | Bit<br>resolution | SFDR<br>(dBc) | SNR<br>(dBc) | Hardware<br>Overhead /<br>Comments                                        |
|------------------|-------------------|---------------|--------------|---------------------------------------------------------------------------|
| [2]              | 16                | 84.4          | 90.2         | Multiplier Less,<br>Pipelined Data Path                                   |
| [6]              | 16                | 90.3          | 92           | Polynomial<br>Interpolation<br>Methods                                    |
| [14]             | 14                | 90            | 75           | Two Segment Fourth<br>Order Parabolic<br>Approximation<br>Technique       |
| [15]             | 20                | 138           | 122          | Two squares, four<br>general multipliers<br>and three 24-bit<br>adders.   |
| Proposed<br>work | 12                | 169.4         | 69           | Modified Taylor<br>series approximation<br>and piecewise<br>linearization |

TABLE II. SFDR performance of the proposed method

| No. of uniform<br>Segments | SFDR<br>(dBc) |
|----------------------------|---------------|
| 3                          | 168.4         |
| 4                          | 169.4         |
| 5                          | 170.1         |
| 6                          | 170.3         |
| 7                          | 170.7         |
| 8                          | 170.8         |
| 9                          | 170.9         |
| 20                         | 171.3         |

### V. CONCLUSION

In this paper, a novel low complexity DDFS architecture using modified Taylor Series approximation based on piecewise linearization has been proposed. It can be noted that the measured SFDR is 169.4dBc using piecewise linearization (four linear segments) with only 12 bit resolution, as compared to SFDR of 90.3 dBc with 16- bit resolution attainable with Polynomial Interpolation methods. Also, the piecewise linearization maximizes the SFDR for a given number of uniform segmentation. Since, linear increment of  $\gamma$  is used, the requirement of ROM has been reduced to minimal, thus reducing the hardware requirement.

### ACKNOWLEDGMENT

The authors would like to express their deepest gratitude to K. Hariharan, Department of Electronics and Communication Engineering, for his continuous guidance, stupendous help, encouragement throughout this research.

#### REFERENCES

- D. De Caro, N. Petra, and A. Strollo, "Digital synthesizer/mixer with hybrid CORDIC-multiplier architecture: Error analysis and optimization," IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 56, no.2, pp. 364-373, Feb. 2009.
- [2] Tze-Yun Sung, Lyu-Ting Ko and Hsi-Chin Hsin, "Low-Power and High-SFDR Direct Digital Frequency Synthesizer Based on Hybrid CORDIC Algorithm", IEEE International Symposium on Circuits and Systems, pp. 249-252 (2009).
- [3] Ming-gang Gan, Jie Chen, Lan Cheng, Ting Yu, "The Design of Direct Digital Frequency Synthesis based on ROM Lookup Table", International Conference on Information Engineering and Computer Science, pp. 1-3,dec 2009.
- [4] A. Ashrafi, "On the SFDR upperbound in DDFS utilizing polynomial interpolation methods," IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 59, no.5, pp.307-311, May 2012.
- [5] Jianfeng Zhang, Hengzhu Liu, "A low-power and high-SFDR Direct Digital Frequency Synthesizer based on adaptive recoding CORDIC", IEEE 16<sup>th</sup> Annual, pp.1-3, 2015.
- [6] Ashkan Ashrafi, "Optimization of the Quantized Coefficients for DDFS Utilizing Polynomial Interpolation Methods", IEEE Transactions on Circuits and Systems II: Express Briefs, vol.61, Issue: 2, pp. 105-109, 2014.
- [7] Ru Xin, Xiao-tang Zhang, Han Li, Qin Wang and Zhang-cai Li, "An Area Optimized Direct Digital Frequency Synthesizer Based on Improved Hybrid CORDIC Algorithm", IEEE Proceedings of International Workshop on Signal Design and its Applications in communications, pp. 243-246 (2007).
- [8] Salah Hasan Alkurwy, Sawal Hamid Md Ali, Shabiul Islam, "Implementation of low power compressed ROM for direct digital frequency synthesizer", IEEE International Conference, pp.309-312, 2014.
- [9] Davor Petrinovic, Marko Brezovi, "Spline-Based High-Accuracy Piecewise-Polynomial Phase-to-Sinusoid Amplitude Converters", IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control, vol. 58, pp. 711-729, Issue 4, 2011.
- [10] Chunxiang Li, Jinhua Li, Peng Dong, "Spline Interpolation based FFT simulation Algorithm", Fifth International Joint Conference, pp., 1737-1742, 2009.
- [11] A. Ashrafi, R. Adhami, and A. Milenkovic, "A direct digital frequency synthesizer based on the quasi-linear interpolation method," IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 57, no.2, pp.t. I, Reg. Papers, vol. 57, no.2, pp. 863-872, Apr.2010.
- [12] S.S Jeng, "A novel ROM compression architecture for DDFS utilizing the parabolic approximation of equi-section division", IEEE Trans. Ultrason., Feroelectr., Freq. Control, vol. 59, no. 12, pp. 2603-2612, Dec.2012.
- [13] Xiaojin Li, Linhui Lai, Ao Lei and Zongsheng Lai, "A Direct Digital Frequency Synthesizer Based on Two Segment Fourth-Order Parabolic Approximation", IEEE Transactions on Consumer Electronics, Vol. 55, No. 2 May 2009."
- [14] Davide De Caro, Nicola Petra, Antonio Giuseppe Maria Strollo, "Direct digital frequency synthesizer using non-uniform piecewiselinear approximation", IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 58, pp. 2409 – 2419, Issue:10, 2011.