



Available online at www.sciencedirect.com

CrossMark Journal of Applied Research and Technology



www.jart.ccadet.unam.mx

Journal of Applied Research and Technology 15 (2017) 211-222 Original

Digital counter cell design using carbon nanotube FETs

Mehdi Bagherizadeh<sup>a,\*</sup>, Mohammad Hossein Moaiyeri<sup>b</sup>, Mohammad Eshghi<sup>b</sup>

<sup>a</sup> Department of Computer Engineering, Rafsanjan Branch, Islamic Azad University, Rafsanjan, Iran <sup>b</sup> Faculty of Electrical Engineering, Shahid Beheshti University, G.C., Tehran, Iran

> Received 24 July 2015; accepted 9 December 2016 Available online 22 April 2017

## Abstract

Compressor and counter cells are the basic blocks used to accumulate the partial products in a multiplication process. In this paper, novel high speed and low power carbon nanotube counter cells are suggested. The efficiency of circuits is improved by using carbon nanotube field-effect transistors (CNFETs). The proposed designs are 4-to-3, 5-to-3, 6-to-3, and 7-to-3 counters. Using HSPICE, these proposed designs are simulated at different conditions. Simulation results confirm that the proposed designs are the fastest counters with lowest PDP in different working circumstance. © 2017 Universidad Nacional Autónoma de México, Centro de Ciencias Aplicadas y Desarrollo Tecnológico. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Keywords: Compressor; Counter cell; Carbon nanotube transistor; Multiplier

## 1. Introduction

Among the most basic blocks in computer arithmetic are multipliers, which are commonly used in different digital signal processors. In different applications of computing systems there is growing demands for high-speed multipliers, such as computer graphics, image processing, scientific calculation, and so on (Azarderakhsh & Reyhani-Masoleh, 2013; Bagherizadeh, Gerami, & Eshghi, 2014; Rodríguez-Reséndiz, Gutiérrez-Villalobos, Duarte-Correa, Mendiola-Santibañez, & Santillán-Méndez, 2012). The speed of a multiplier specifies how fast a processor will run. Designers are now more focused on low power consumption and high speed processors. A multiplication operation conventionally involves three operational stages: generation of partial products, reduction of partial products to two additive operands, and finally, carry propagation addition. The partial product reduction stage is respectful for a main part of the total multiplication delay, power and area (Waters & Swartzlander, 2010). Hence, in order to acquire partial products, compressor and counter cells usually implement this stage because they contribute to the reduction of the par-

\* Corresponding author.

E-mail address: m.bagherizadeh@srbiau.ac.ir (M. Bagherizadeh).

tial products and the critical path (Jaberipur & Kaivani, 2009). Counters are much faster than conventional adders because they could act without carry signal along their digital stages. Counters are the basic blocks, which are used to acquire the partial products in the multiplication process (Deng & Chen, 2013). Thus, improving the power performance of these structures could lead to remarkable saving of the power used by the entire multiplier. Because of the miniaturization of the transistors, it is possible to increase the number used in each circuit. This nano level reduction will cause some acute problems such as short channel effects, remarkable gate control degradation and high leakage power consumption. A solution to these problems is using carbon nanotube field effect transistors (CNFETs) (Bagherizadeh & Eshghi, 2011a, 2011b; Perri & Corsonello, 2012). One of the most considerable properties of this type of transistor is the ability of having optional threshold voltage by changing the diameters of the nanotubes. This property makes CNFETs suitable for designing circuits with multiple threshold voltage ranges (Bagherizadeh & Eshghi, 2011a, 2011b).

In this paper, four novel high speed and low PDP CNFETbased counters are proposed. These counters are capable of adding four, five, six or seven bits per decade, and generate 3 outputs. CNFETs with different threshold voltages are used to design our proposed counters.

In the remainder of this paper, in Section 2, a brief review of the CNFET technology is presented. In the next section related

http://dx.doi.org/10.1016/j.jart.2016.12.005

Peer Review under the responsibility of Universidad Nacional Autónoma de México.

<sup>1665-6423/© 2017</sup> Universidad Nacional Autónoma de México, Centro de Ciencias Aplicadas y Desarrollo Tecnológico. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

works are discussed. The proposed CNFET-based counter cells are presented in Section 4. Experimental results, analyses and comparisons are presented in Section 5, and finally, Section 6 concludes the paper.

## 2. Carbon nanotube field effect transistors (CNFETs)

A carbon nanotube (CNT) is an allotrope of carbon with a tubal nanostructure. Transistors, which have carbon nanotubes as their channel, are called carbon nanotube field-effect transistors (Mehrabani & Eshghi, 2015). CNTs have particular properties that make them promising to be used in the field of integrated circuits. The most important and remarkable feature of CNFETs is their spectacular capability in current driving or current carrying, and experiments have shown that CNFETs are the best for this aim (Bagherizadeh & Eshghi, 2011a, 2011b; Lin, Kim, & Lombardi, 2009). Without any extra power overhead, CNFETs could operate five times faster than CMOS in the best case. Another fundamental feature of CNTFETs is their varied behavior in manipulating the threshold voltage (Vth) by adopting an appropriate diameter for CNTs (Bagherizadeh & Eshghi, 2011a, 2011b).

Multi-walled nanotubes (MWNT) consist of multiple rolled layers (concentric tubes) of graphite. A single-wall CNT consists of a tube-shaped wall which is made of graphite with the diameter of 1–2 nm. A multi-walled CNT has a thinner wall. The walls of the tubes are 34 nm each. The outer wall diameter of the multi-walled CNT is 2–25 nm (Mehrabi, Mirzaee, Moaiyeri, Navi, & Hashemipour, 2013).

A CNT is described by its chiral vector. The chirality vector is defined by an ordered pair  $(n_1, n_2)$  that characterize many electrical and physical properties of the carbon nanotube (Bagherizadeh & Eshghi, 2011a, 2011b; Mehrabi, Mirzaee, Zamanzadeh, Navi, & Hashemipour, 2013; Mehrabi, Navi, &

Hashemipour, 2013; Mehrabi, Mirzaee, Moaiyeri, et al., 2013). It is calculated using Eq. (1):

$$C_h = n_1 \hat{a}_1 + n_2 \hat{a}_2 \tag{1}$$

where  $[\hat{a}_1, \hat{a}_2]$  are the lattice unit vectors and  $n_1, n_2$  are positive integers which specify the tube's structure. The single-walled carbon nanotubes (SWCNT) could be a metal or a semiconductor, if  $n_1 - n_2 \neq 3k(k \in \mathbb{Z})$ , then the SWCNT is a semiconductor; otherwise it is a metal.

The diameter of the CNT is calculated using Eq. (2) (Alkaldy, Navi, & Sharifi, 2014; Bagherizadeh & Eshghi, 2011a, 2011b; Mehrabani & Eshghi, 2015).

$$D_{CNT} = 0.0783 \sqrt{n_1^2 + n_2^2 + n_1 n_2}$$
(2)

The threshold voltage of a CNT channel is approximated as the inverse function of diameter and could be computed as Eq. (3). A special threshold voltage of a CNFET could be obtained by a suitable diameter (Alkaldy et al., 2014).

$$V_{th} \approx \frac{E_g}{2e} = \frac{\sqrt{3}}{3} \frac{aV_{\pi}}{eD_{CNT}} \approx \frac{0.43}{D_{CNT}}$$
(3)

where parameter  $a \ (\approx 0.249 \text{ nm})$  is the carbon-to-carbon atom distance,  $V_{\pi} (\cong 3.033 \text{ eV})$  is the carbon  $\pi - \pi$  bond energy in the tight bonding model, and e is the unit electron charge,  $E_g$  is the band gap, and  $D_{CNT}$  is the CNT diameter.

There are three kinds of CNFET: The first kind is the MOSFET-like CNFET (which contains the p-type CNFET and n-type CNFET), which operates in a unipolar mode; the second kind is the Schottky barrier (SB) CNFET, which shows ambipolar characteristics, but because of the SB element, it is not appropriate for high performance applications; the third kind of CNFET, suitable for low power applications, is the band-toband tunneling CNFET that has obviously low current in its active mode (Alkaldy et al., 2014). Fig. 1 shows three kinds of



Fig. 1. (a) SB-CNFET, (b) T-CNFET, and (c) MOSFET-like CNFET.

CNFETs. A CNFET has relatively low off-current while the transistor is inactive; in other words, when the CNFET is off, leakage power consumption is low. CNFETs have electrical specifications and structure similar to MOSFETs, which facilitates the exploitation and reuse of previous MOSFET-based architectures and manufacturing processes (Bagherizadeh & Eshghi, 2011a, 2011b).

# 3. Compresssor and counter cells in the literarure

In this section, some of the compressor and counter cells which are presented in the literature are discussed.

#### 3.1. Compressor cells

For different compressor cells, different designs of high performance and low power have been presented. In this section, some of these compressors are reviewed.

A 3-to-2 compressor applies three inputs, *X1*, *X2*, and *X3* and produces two outputs, *SUM* and *CARRY*. This cell could also be considered as a full adder cell when the third input is the carry input from the previous compressor block, or X3 = Cin. In the literature, there are different implementations of a full adder optimized for one or more figures of merits, delay, power dissipation, and power-delay product (PDP) (Bagherizadeh & Eshghi, 2011a, 2011b; Mehrabi, Mirzaee, Zamanzadeh, et al., 2013; Mehrabi, Navi, et al., 2013; Mehrabi, Mirzaee, Moaiyeri, et al., 2013).



Fig. 2. (a) 4-to-3, (b) 5-to-3, (c) 6-to-3, and (d) 7-to-3 counter.

A 4-to-2 compressor compresses five partial products bits into three. This compressor has four inputs X1, X2, X3 and X4 and two outputs, SUM and CARRY along with a CARRY-IN (CIN) and a CARRY-OUT (COUT). The CIN input is the output from the previous lower significant compressor. The COUT is the output to the compressor in the next significant stage. Using two 3-to-2 compressors in series, a 4-to-2 compressor could be set up as well that involves a critical path delay of four XORs. Different alternative implementations using XOR gates and MUX are described in Pishvaie, Jaberipur, and Jahanian (2012), Bahrepour and Sharifi (2013).

A 5-to-2 compressor block has five inputs X1, X2, X3, X4, X5, and two outputs, *SUM* and *CARRY*, along with two input carry bits ( $C_{IN1}$ ,  $C_{IN2}$ ) and two output carry bits ( $C_{OUT1}$ ,  $C_{OUT2}$ ). The input carry bits are the outputs from the previous less significant compressor block. The output carries are passed on to the next high significant compressor block. In addition, a 5-to-2 compressor could be set up using 3-to-2 compressors. It consists of three 3-to-2 compressors (full adders) in series (Chang, Gu, & Zhang, 2004; Najafi, Timarchi, & Najafi, 2014). This architecture has a critical path delay of six XOR gates.

In Qi, Kim, and Choi (2012), in order to reduce the power delay product of a multiplier, different types of the MUX-based compressor cells are presented. These designs are for 4-to-2, 5-to-2, and 7-to-2 compressors. The MUX-based compressor cells use much less transistors than the full adder based compressors.

# 3.2. Counter cells

One of the main components to construct high performance DSP units is the parallel counter. As the multiplier size increases, designing optimized wider counters are critical for its performance (Bagherizadeh et al., 2014; Parandeh-Afshar, Verma, Brisk, & Ienne, 2010). A counter is the hardware that counts the number of logic '1' of inputted bits. Counters are different from compressors. The essential difference between a counter and a compressor is that the counter has no inter-stage carries and no inter-stage interconnects. On the other hand, the compressor has a number of carries that come from or go to the neighboring cells in the same stage. Counter cells are greatly used in many different scopes such as triggering in multichannel high energy spectrometers, digital nervous networks, parallel multipliers and multiple input adders (Bahrami & Sadeghiyan, 2000; Mohd, Abed, & Alouneh, 2013).

A 4-to-3 counter is a network with 4 inputs and 3 outputs, where the outputs show the count the 1's inputs. As shown in Fig. 2(a), a 4-to-3 counter is contained of a full adder and two half adders in series. Another structure for 4-to-3 counter has been designed in Mehrabi, Mirzaee, Zamanzadeh, et al. (2013) (Fig. 3). The actual efficiency of these counter cells extremely depends on the underlying technology which hosts the implementation of the primary blocks (i.e., MUX, XOR, and FA). Based on the CMOS implementation of these primary blocks, a total of 80 transistors are required for the 4-to-3 counter design of Fig. 3.



Fig. 3. The 4-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al. (2013), Mehrabi, Navi, et al. (2013); Mehrabi, Mirzaee, Moaiyeri, et al. (2013).

A 5-to-3 counter will compress five partial products into three outputs. As shown in Fig. 2(b), this counter is constructed using two full adders and one half adder in series. The counting limit of this counter is zero to five. The conventional 5-to-3 counter actually has five XOR gates to generate the output. Because of its importance, researchers have proposed different methods by rearranging the XOR gates to reduce the delay in this counter. A 5-to-3 counter has been designed in Chowdhury, Banerjee, Roy, and Saha (2008a, 2008b) (Fig. 4). It is a hybrid structure of 2-input XORs, MUXs, and a 2-input AND gate. In comparison with its conventional 5-to-3 counter, it benefits from a shorter critical path. A total of 80 transistors are required to design this cell using a CMOS implementation of basic blocks.

A 6-to-3 counter compress six partial products into three outputs. The structure of this counter is composed of three full adders and one half adder (Fig. 2(c)). This kind of counter could be used in a high speed multiplier to reduce the number of partial products. A 6-to-3 counter has been designed in Mehrabi, Mirzaee, Zamanzadeh, et al. (2013), Mehrabi, Navi, et al. (2013); Mehrabi, Mirzaee, Moaiyeri, et al. (2013). It is a



Fig. 4. The 5-to-3 counter in Chowdhury et al. (2008a, 2008b).

hybrid structure of 2-input XORs, MUXs, and a 2-input AND gate. A total of 112 transistors are required to design the cell in Fig. 5, using a CMOS implementation of basic blocks.

A 7-to-3 counter includes seven inputs and three outputs called SUM, COUT1 and COUT2. The architecture of a conventional 7-to-3 counter is shown in Fig. 2(d). This counter is composed of four full adders. In Ghasemzadeh, Mohabbatian, Akbari, Hadidi, and Khoei (2014), a novel CMOS structure for the 7-to-3 counter is designed. In this design, the outputs are based on the number of bits with value '1' in truth table. Another design for the 7-to-3 counter has been proposed in Mohd et al. (2013) (Fig. 6). The actual performance of these counter cells heavily depends on the underlined technology as well, that hosts the implementation of the basic blocks (i.e., FA, XOR and MUX). Based on the CMOS implementation of these basic blocks, a total of 180 transistors are required for the 7-to-3 counter design of Fig. 6.

In addition, in Dandapat, Ghosal, Sarkar, and Mukhopadhyay (2010) different counter logics are designed upon the concept of the counter of a full adder. It could be defined as a single-bit adder circuit, which has four/five/six/seven inputs and three outputs. In Joshy, Prasath, Ravi, and Karman (2012), different CNFET



Fig. 5. The 6-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al. (2013), Mehrabi, Navi, et al. (2013); Mehrabi, Mirzaee, Moaiyeri, et al. (2013).



Fig. 6. The 7-to-3 counter in Mohd et al. (2013).



Fig. 7. Proposed 4-to-3 counter.

counter cells using full adder cells are designed. However, the design of counter cells including full adder trees has a relatively high delay and consumes more power. These counter structures also grow quickly with the input vector size in terms of the needed number of full adder cells.

In Mehrabi, Navi, et al. (2013) two different structures of low power and high speed 6-to-3 and 7-to-3 compressors are proposed. Counter cells in this paper include XOR/MUX-based of full adder cells and a XOR/MUX-based structure.

## 4. Proposed designs for counter cells

In this section, four designs of 4-to-3, 5-to-3, 6-to-3 and 7-to-3 counters are proposed. These are composed of a *capacitor* network and a *transistor* network. In these designs, CNFETs with different threshold voltages are used to determine the required switching thresholds. According to Eq. (3), the diameter of the nanotubes controls the threshold voltage of the CNFET, which is used in different circuit designs. In the remainder of this section, a design proposal for different counter cells is presented. To design this counter, proper transistors with different threshold voltages (low, normal, and high) and appropriate NOT gates containing an NMOS with Vt = vt and a PMOS with |Vtp| = Vdd - vt are used. In the simulation by changing  $n_1$  and  $n_2$  according

to Eq. (2), appropriate  $D_{CNT}$  is reached. Therefore, based on Eq. (3), appropriate  $V_{th}$  is reached.

#### 4.1. Proposed 4-to-3 counter

The proposed 4-to-3 counter is designed based on the simplified truth table in Table 1. This circuit includes a single array of capacitors and a CNFET-based threshold detector network. As indicated in Table 1, the *SUM* output is '1', if the sum of four inputs is equal to 1 or 3; otherwise, it is equal to '0'. *COUT1* is equal to '1', if sum of four inputs is equal to 2 or 3. The *COUT2* is '1' only when the sum of the four inputs is equal to 4.

Fig. 7 shows the details of this design for *SUM*, *COUT1*, and *COUT2*. To design this counter, proper transistors with different threshold voltages (low, normal, and high) and

Table 1 Simplified truth table of a 4-to-3 compressor.

| $\Sigma_{in}$ | COUT2 | COUT1 | SUM |
|---------------|-------|-------|-----|
| 0             | 0     | 0     | 0   |
| 1             | 0     | 0     | 1   |
| 2             | 0     | 1     | 0   |
| 3             | 0     | 1     | 1   |
| 4             | 1     | 0     | 0   |

appropriate NOT gates containing an NMOS with Vt = vt and a PMOS with |Vtp| = Vdd - vt are used. In this design, a NOT gate containing an NMOS transistor with Vt = 3.5M and a PMOS with |Vtp| = Vdd - 3.5M is used, where 'M' is equal to Vdd/4. When the sum of the inputs is "four", then the output of this NOT is "0"; otherwise, it is "1". Another NOT gate which is used in this design contains an NMOS transistor with Vt = 1.5M and a PMOS with |Vtp| = Vdd - 1.5M. When the sum of the inputs is "zero" or "one", then the output of this NOT is "1"; otherwise it is "0". When the sum of the inputs is "zero", the NMOS pull down transistor with low threshold of Vt=0.5M=0.1125 volts is "off"; otherwise, it is "on". In this state, the PMOS pull up transistor with |Vtp| = 0.5M is "on"; otherwise it is "off". Both a PMOS and an NMOS with |Vt| = 2.5M are used in this counter. When the sum of the inputs is "three" or "four", then these transistors are "off" and "on", and in the other states they are "on" and "off", respectively. Other transistors have normal thresholds (Vt = Vdd/2).

The same analysis can be done for the other two *COUT1* outputs. The COUT2 output can be implemented only with two NOT gates. Table 2 shows the state of all transistors for different

|     |    | • |  |
|-----|----|---|--|
| lab | le | 2 |  |

| State | of all | transistors | for | different | values | of | $\Sigma_{in}$ |
|-------|--------|-------------|-----|-----------|--------|----|---------------|
| oune  | or un  | umbibibibib | 101 | uniterent | raiaco | 01 |               |

| $\overline{\Sigma_{in}}$ | ON transistors | SUM |
|--------------------------|----------------|-----|
| 0                        | 1, 3, 5, 7     | 0   |
| 1                        | 2, 3, 5, 7     | 1   |
| 2                        | 2, 4, 5, 7     | 0   |
| 3                        | 2, 4, 6, 7     | 1   |
| 4                        | 2, 4, 6, 8     | 0   |

values of the sum of the inputs. Using this technique, a standard and symmetric circuit is obtained.

#### 4.2. Proposed 5-to-3 counter

A 5-to-3 compressor has 5 inputs, *X1*, *X2*, *X3*, *X4*, *X5* and three outputs, *SUM*, *COUT1*, and *COUT2*. A simplified truth table of this circuit is shown in Table 3. This truth table is similar to Table 1, but the one row for the sum of inputs equals 5. Therefore, the circuit for *COUT1* and *COUT2* remains unchanged, with respect to the 4-to-3 compressor, and only the circuit for the *SUM* 



Fig. 8. Proposed 5-to-3 counter.



Fig. 9. Proposed 6-to-3 counter.

needs a slight modification. However, the threshold voltage of all the transistors are changed because M = Vdd/5. Fig. 8 shows a complete circuit for the 5-to-3 counter proposed.

## 4.3. Proposed 6-to-3 counter

A 6-to-3 counter has 6 inputs, *X1*, *X2*, *X3*, *X4*, *X5*, *X6* and three outputs, *SUM*, *COUT1*, and *COUT2*. The simplified truth table of this circuit is similar to Table 3, but the one row for the sum of inputs equals 6. As a result, the circuit for *COUT2* remains unchanged, with respect to the 5-to-3 compressor and

| Table 3                                    |   |
|--------------------------------------------|---|
| Simplified truth table of a 5-to-3 counter | 1 |

| $\overline{\Sigma_{in}}$ | COUT2 | COUTI | SUM |
|--------------------------|-------|-------|-----|
| 0                        | 0     | 0     | 0   |
| 1                        | 0     | 0     | 1   |
| 2                        | 0     | 1     | 0   |
| 3                        | 0     | 1     | 1   |
| 4                        | 1     | 0     | 0   |
| 5                        | 1     | 0     | 1   |
|                          |       |       |     |

the circuit for the *COUT1* and *SUM* needs slight modifications. The threshold voltages of all the transistors are changed because M = Vdd/6. Fig. 9 shows the complete circuit for the 6-to-3 counter proposed.

## 4.4. Proposed 7-to-3 counter

A 7-to-3 counter has 7 inputs, *X1*, *X2*, *X3*, *X4*, *X5*, *X6*, *X7* and three outputs, *SUM*, *COUT1*, and *COUT2*. The simplified truth table of this circuit is similar to the simplified truth table of the 6-to-3 counter, but the one row for the sum of inputs equals 7. Hence, the circuit for *COUT1* and *COUT2* remains unchanged with respect to the 6-to-3 counter, and only the circuit for the *SUM* needs a slight change. The threshold voltages of all the transistors are changed because M = Vdd/7. Fig. 10 shows the complete circuit for the 7-to-3 counter proposed.

# 5. Simulation results

All the designs proposed for the counters based on CNFET described in Section 4 are simulated using HSPICE. In these



Fig. 10. Proposed 7-to-3 counter.

simulations, the compact model, presented in Deng and Wong (2007a, 2007b) is employed for the CNFET circuits. This standard model has been designed for unipolar and MOSFET-like CNFET devices. Each transistor in this model may consist of one or more CNTs. The 32 nm PTM CMOS technology has been used for simulating CMOS circuits. These simulation results

contain three common figures of merits: delay, power dissipation, and power-delay product (PDP). In all of these simulations, the room temperature is considered. Average power consumption during a long period of time is considered as the power consumption parameter. The delay is calculated from 50% voltage level of the input to 50% voltage level of the output. PDP

Table 4

Simulation results for counters @ 250 MHz with 2.1 fF load in 0.65 V.

| Design                                                                                                                                       | Delay (ps)       | Power (µW)         | PDP (fJ)            |
|----------------------------------------------------------------------------------------------------------------------------------------------|------------------|--------------------|---------------------|
| 4-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al.<br>(2013), Mehrabi, Navi, et al. (2013); Mehrabi,<br>Mirzaee, Moaiyeri, et al. (2013) | 1341             | 0.077831           | 0.10437             |
| Proposed 4-to-3 counter                                                                                                                      | 53.194           | 0.31318            | 0.016659            |
| 5-to-3 counter in Chowdhury et al. (2008a, 2008b)                                                                                            | 1564.1           | 0.069825           | 0.10921             |
| Proposed 5-to-3 counter                                                                                                                      | 80.477           | 0.58199            | 0.046837            |
| 6-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al.<br>(2013), Mehrabi, Navi, et al. (2013); Mehrabi,<br>Mirzaee, Moaiyeri, et al. (2013) | 1849.1           | 0.10424            | 0.19276             |
| Proposed 6-to-3 counter                                                                                                                      | 228.35           | 0.52115            | 0.11900             |
| 7-to-3 counter in Mohd et al. (2013)<br>Proposed 7-to-3 counter                                                                              | 1892.5<br>92.127 | 0.22321<br>0.90202 | 0.42241<br>0.083101 |
| <u>r</u>                                                                                                                                     |                  |                    |                     |

Table 5

Simulation results for counters @ 250 MHz with 2.1 fF load in 0.5 V.

| Design                                                                                                                                       | Delay (ps) | Power (µW) | PDP (fJ) |
|----------------------------------------------------------------------------------------------------------------------------------------------|------------|------------|----------|
| 4-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al.<br>(2013), Mehrabi, Navi, et al. (2013); Mehrabi,<br>Mirzaee, Moaiyeri, et al. (2013) | 473.22     | 0.13005    | 0.061543 |
| Proposed 4-to-3 counter                                                                                                                      | 43.334     | 0.43245    | 0.018740 |
| 5-to-3 counter in Chowdhury et al. (2008a, 2008b)                                                                                            | 501.21     | 0.16305    | 0.081723 |
| Proposed 5-to-3 counter                                                                                                                      | 66.239     | 0.56647    | 0.037523 |
| 6-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al.<br>(2013), Mehrabi, Navi, et al. (2013); Mehrabi,<br>Mirzaee, Moaiyeri, et al. (2013) | 593.63     | 0.15557    | 0.092352 |
| Proposed 6-to-3 counter                                                                                                                      | 81.223     | 0.68219    | 0.055410 |
| 7-to-3 counter in Mohd et al. (2013)                                                                                                         | 682.04     | 0.15518    | 0.10584  |
| Proposed 7-to-3 counter                                                                                                                      | 88.150     | 87.850     | 0.077440 |

#### Table 6

Simulation results for counters @ 1000 MHz with 2.1 fF load in 0.65 V.

| Design                                                                                                                                       | Delay (pS) | Power (µW) | $PDP \times E - 15 (fJ)$ |
|----------------------------------------------------------------------------------------------------------------------------------------------|------------|------------|--------------------------|
| 4-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al.<br>(2013), Mehrabi, Navi, et al. (2013); Mehrabi,                                     | 965.14     | 0.1361     | 0.13142                  |
| Mirzaee, Moaiyeri, et al. (2013)<br>Proposed 4-to-3 counter                                                                                  | 54 374     | 0 50930    | 0.027692                 |
| 5-to-3 counter in Chowdhury et al. (2008a, 2008b)                                                                                            | 985.25     | 0.1381     | 0.13616                  |
| Proposed 5-to-3 counter                                                                                                                      | 80.634     | 0.81624    | 0.065817                 |
| 6-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al.<br>(2013), Mehrabi, Navi, et al. (2013); Mehrabi,<br>Mirzaee, Moaiyeri, et al. (2013) | 1423.6     | 0.2346     | 0.33410                  |
| Proposed 6-to-3 counter                                                                                                                      | 227.7      | 0.75479    | 0.17193                  |
| 7-to-3 counter in Mohd et al. (2013)                                                                                                         | 1894.2     | 0.49788    | 0.94308                  |
| Proposed 7-to-3 counter                                                                                                                      | 429.07     | 1.0805     | 0.46361                  |

is the multiplication of the average power consumption and the maximum delay. To measure the cell delay, all the possible transitions have been generated via test patterns and it is applied for all the structures.

The architectures proposed are compared to the best existing counter architectures in terms of power, delay, and PDP. For thorough evaluation of functionality of the designs in various conditions, simulations were performed at various supply voltages and room temperature, loads and frequencies. All the designs have been simulated at room temperature at 0.65, and 0.5 supply voltages, operating frequencies equal to 100 MHz and 250 MHz, and different load capacitances.

For comparisons, the best previous works on different counter cells were selected (Chowdhury et al., 2008a, 2008b; Mohd et al., 2013). First, the circuits are simulated at 0.65 V and 0.5 V supply voltages and at 250 MHz operating frequency with 2.1 fF output load capacitance. The results of this simulation are revealed in Tables 4 and 5, respectively. According to the experimental results, designs proposed have the lowest delay and PDP compared to the other designs at all supply voltages.

In the second experimentation, all the circuits were simulated at the discussed 0.65 supply voltage and 100 MHz frequency using 2.1 fF output capacitor. It is comprehended from Table 6. The designs proposed have the best PDP and power in comparison with the other circuits.



Fig. 11. PDP versus different temperatures.

Another important feature of the circuits which should be considered is their immunity to ambient temperature variations. Thus, for investigating their sensitivity to temperature variation and noises, the circuits proposed were simulated in a vast range of temperatures, from  $0^{\circ}$ C up to  $70^{\circ}$ C at 0.65 V supply voltage and 250 MHz operating frequency with 2.1 fF output load capacitance. The results are plotted in Fig. 11. This figure shows that the proposed design operates correctly with low energy consumption variation at different ambient temperatures.

PDP versus load capacitor in the counter cells(Vdd=0.65, @250MHZ)



Fig. 12. PDP versus different capacitors.

 Table 7

 The number of elements in different counter cells.

| Design                                                                                                                                 | Number of devices |
|----------------------------------------------------------------------------------------------------------------------------------------|-------------------|
| 4-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al. (2013), Mehrabi, Navi, et al. (2013); Mehrabi, Mirzaee, Moaiyeri, et al. (2013) | 60                |
| Proposed 4-to-3 counter                                                                                                                | 26                |
| 5-to-3 counter in Chowdhury et al. (2008a, 2008b)                                                                                      | 72                |
| Proposed 5-to-3 counter                                                                                                                | 29                |
| 6-to-3 counter in Mehrabi, Mirzaee, Zamanzadeh, et al. (2013), Mehrabi, Navi, et al. (2013); Mehrabi, Mirzaee, Moaiyeri, et al. (2013) | 100               |
| Proposed 6-to-3 counter                                                                                                                | 36                |
| 7-to-3 counter in Mohd et al. (2013)                                                                                                   | 192               |
| Proposed 7-to-3 counter                                                                                                                | 39                |

Driving capability is a very important metric for the logic gates. Thus, in this work the driving capability of the proposed structures is also scrutinized. The designs proposed are simulated using a variety of output load capacitors, ranging from 1 up to 6 fF, at 0.65 V supply voltage and 250 MHz frequency. The PDP of the different counter designs, against the load capacitor variation, are schemed in Fig. 12.

Table 7 indicates the number of elements used in different structures. As shown in this table, the designs proposed have a lower transistor in comparison to other designs.

# 6. Conclusion

The carbon nanotube field-effect transistor (CNFET) is one of the new devices to design low power and high performance circuits. In a multiplication process, to accumulate partial products, compressor and counter cells are used. 3-to-2, 4-to-2, and 5-to-2 compressors do not have enough capability to design high-bit size multipliers. In this paper, four novel high speed and low power carbon nanotube counter cells were proposed. These counters were capable to add 4 or 5 or 6 or 7 bits per decade, and generate 3 outputs. CNFETs with different threshold voltages were used to design our proposed counters. The counters proposed were simulated using HSPICE. The simulation results contained three common figures of merits: delay, power dissipation, and power-delay product (PDP). The simulation results indicate that the designs proposed consume low power and work properly. Our proposed counters are more effective for high order multiplications.

# **Conflict of interest**

The authors have no conflicts of interest to declare.

### References

- Alkaldy, E., Navi, K., & Sharifi, F. (2014). A novel design approach for multi-input XOR gate using multi-input majority function. Arabian Journal for Science and Engineering, 39(11), 7923–7932. http://dx.doi.org/10.1007/s13369-014-1387-x
- Azarderakhsh, R., & Reyhani-Masoleh, A. (2013). Low-complexity multiplier architectures for single and hybrid-double multiplications in Gaussian normal bases. *IEEE Transactions on Computers*, 62(4), 744–757. http://dx.doi.org/10.1109/TC.2012.22

- Bagherizadeh, M., & Eshghi, M. (2011a). A low power & high speed carbon nanotube 5 to 3 compressor. *Faible Tension Faible Consommation*, 115–118. http://dx.doi.org/10.1109/FTFC.2011.5948902
- Bagherizadeh, M., & Eshghi, M. (2011b). Two novel low-power and high-speed dynamic carbon nanotube full-adder cells. *Nanoscale Research Letters*, 6(1), 519. http://dx.doi.org/10.1186/1556-276X-6-519
- Bagherizadeh, M., Gerami, M., & Eshghi, M. (2014). Design an ASIP for edit distance algorithm in pattern recognition. In *The 22nd Iranian Conference on Electrical Engineering (ICEE)* (pp. 971–975). http://dx.doi.org/10.1109/IranianCEE.2014.6999676
- Bahrami, M., & Sadeghiyan, B. (2000). "Efficient modulo 2" +1 multiplication schemes for IDEA. *IEEE International Symposium on Circuits and Systems*, 4, 653–656.
- Bahrepour, D., & Sharifi, M. J. (2013). A novel high speed full adder based on linear threshold gate and its application to a 4-2 compressor. Arabian Journal for Science and Engineering, 38(11), 3041–3050. http://dx.doi.org/10.1007/s13369-013-0615-0
- Chang, C. H., Gu, J., & Zhang, M. (2004). Ultra low-voltage low-power CMOS 4-2 and 5-2 compressors for fast arithmetic circuits. *IEEE Transactions on Circuits and Systems I: Regular Papers*, 51(10), 1985–1997.
- Chowdhury, S. R., Banerjee, A., Roy, A., & Saha, H. (2008). Design, simulation and testing of a high speed low power 15-4 compressor for high speed multiplication applications. In *First International Conference on Paper Presented* at the Emerging Trends in Engineering and Technology, 2008. ICETET'08.
- Chowdhury, S. R., Banerjee, A., Roy, A., & Saha, H. (2008 July). Design, simulation and testing of a high speed low power 15-4 compressor for high speed multiplication applications. In *IEEE First International Conference on Emerging Trends in Engineering and Technology 2008. ICETET'08* (pp. 434–438). http://dx.doi.org/10.1109/ICETET.2008.151
- Dandapat, A., Ghosal, S., Sarkar, P., & Mukhopadhyay, D. (2010). A 1.2-ns 16 × 16-bit binary multiplier using high speed compressors. *International Journal of Electrical and Electronics Engineering*, 4(3), 234–239.
- Deng, G., & Chen, C. (2013). Binary multiplication using hybrid MOS and multi-gate single-electron transistors. *IEEE Transactions on Very Large Scale Integration (VLSI)*, 21(9), 1573–1582. http://dx.doi.org/10.1109/TVLSI.2012.2217993
- Deng, J., & Wong, H. S. P. (2007a). A compact SPICE model for carbon-nanotube field-effect transistors including nonidealities and its application—Part I: Model of the intrinsic channel region. *IEEE Transactions on Electron Devices*, 54(12), 3186–3194. http://dx.doi.org/10.1109/TED.2007.909030
- Deng, J., & Wong, H. S. P. (2007b). A compact spice model for carbon-nanotube field-effect transistors including nonidealities and its application—Part II: Full device model and circuit performance benchmarking. *IEEE Transactions on Electron Devices*, 54(12), 3195–3205. http://dx.doi.org/10.1109/TED.2007.909043
- Ghasemzadeh, M., Mohabbatian, N., Akbari, A., Hadidi, K., & Khoei, A. (2014). A novel fast glitchless 7-3 counter with a new structure. In *IEEE Proceedings of the 21st International Conference Mixed Design of Integrated Circuits & Systems (MIXDES), 2014* (pp. 131–134). http://dx.doi.org/10.1109/MIXDES.2014.6872171
- Jaberipur, G., & Kaivani, A. (2009). Improving the speed of parallel decimal multiplication. *IEEE Transactions on Computers*, 58(11), 1539–11552. http://dx.doi.org/10.1109/TC.2009.110

- Joshy, V. Z., Prasath, R. A., Ravi, T., & Karman, V. (2012). Design and analysis of compressors using CNTFET. In *IEEE 2012 International Conference on Emerging Trends in Science, Engineering and Technology (INCOSET)* (pp. 323–328).
- Lin, S., Kim, Y. B., & Lombardi, F. (2009). A novel CNTFET-based ternary logic gate design. In 52nd IEEE Int. Midwest Symposium on Circuits and Systems (pp. 435–438). http://dx.doi.org/10.1109/MWSCAS.2009.5236063
- Mehrabani, Y. S., & Eshghi, M. (2015). A symmetric, multi-threshold, highspeed and efficient-energy 1-bit full adder cell design using CNFET technology. *Circuits, Systems, and Signal Processing*, 34(3), 739–759. http://dx.doi.org/10.1007/s00034-014-9887-1
- Mehrabi, S., Mirzaee, R. F., Zamanzadeh, S., Navi, K., & Hashemipour, O. (2013). Design, analysis, and implementation of partial product reduction phase by using wide m:3 ( $4 \le m \le 10$ ) compressors. *International Journal of High Performance Systems Architecture*, 4(4), 231–241. http://dx.doi.org/10.1504/IJHPSA.2013.058986
- Mehrabi, S., Navi, K., & Hashemipour, O. (2013). Performance analysis and simulation of two different architectures of (6: 3) and (7: 3) compressors based on carbon nano-tube field effect transistors. In *IEEE 5th International Paper Presented at the Nanoelectronics Conference (INEC)*, 2013.
- Mehrabi, S., Mirzaee, R. F., Moaiyeri, M. H., Navi, K., & Hashemipour, O. (2013). CNFET-based design of energy-efficient symmetric three-input XOR and full adder circuits. *Arabian Journal for Science and Engineering*, 38(12), 3367–3382. http://dx.doi.org/10.1007/s13369-013-0627-9
- Mohd, B. J., Abed, S. E., & Alouneh, S. (2013). Carry-based reduction parallel counter design. *International Journal of Electronics*, 100(11), 1510–1528. http://dx.doi.org/10.1080/00207217.2012.751320
- Najafi, A., Timarchi, S., & Najafi, A. (2014). High-speed energy-efficient 5:2 compressor. In *IEEE 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)* (pp. 80–84).
- Parandeh-Afshar, H., Verma, A. K., Brisk, P., & Ienne, P. (2010). Improving FPGA performance for carry-save arithmetic. *IEEE Transactions* on Very Large Scale Integration (VLSI) Systems, 18(4), 578–590. http://dx.doi.org/10.1109/TVLSI.2009.2014380
- Perri, S., & Corsonello, P. (2012). New methodology for the design of efficient binary addition circuits in QCA. *IEEE Transactions on Nanotechnology*, 11(6), 1192–1200.
- Pishvaie, A., Jaberipur, G., & Jahanian, A. (2012). Improved CMOS (4,2) compressor designs for parallel multipliers. *Computers & Electrical Engineering*, 38(6), 1703–1716. http://dx.doi.org/10.1016/ j.compeleceng.2012.07.015
- Qi, H., Kim, Y. B., & Choi, M. (2012). A high speed low power modulo 2n + 1 multiplier design using carbon-nanotube technology. In *IEEE 55th International Midwest Symposium on Circuits and Systems (MWSCAS)*, 2012 (pp. 406–409).
- Rodríguez-Reséndiz, J., Gutiérrez-Villalobos, J. M., Duarte-Correa, D., Mendiola-Santibañez, J. D., & Santillán-Méndez, I. M. (2012). Design and implementation of an adjustable speed drive for motion control applications. *Journal of Applied Research and Technology*, 10(2), 180–194.
- Waters, R. S., & Swartzlander, E. E. (2010). A reduced complexity wallace multiplier reduction. *IEEE Transactions on Computers*, 59(8), 1134–1137. http://dx.doi.org/10.1109/TC.2010.103