10T SRAM Using Half-V Precharge and Row-Wise Dynamically Powered Read Port For Low Switching Power and Ultralow RBL Leakage
10T SRAM Using Half-V Precharge and Row-Wise Dynamically Powered Read Port For Low Switching Power and Ultralow RBL Leakage
Abstract— We present, in this paper, a new 10T static will also increase the yield and improve the SoC reliability.
random access memory cell having single ended decoupled Although the six transistor (6T) SRAM cell is a widely used
read-bitline (RBL) with a 4T read port for low power operation standard in industry, it has its own limitations. 6T SRAM
and leakage reduction. The RBL is precharged at half the cell’s
supply voltage, and is allowed to charge and discharge according not only has conflicting read and write requirements, it also
to the stored data bit. An inverter, driven by the complementary has read static noise margin (RSNM) degradation. The most
data node (QB), connects the RBL to the virtual power rails important factors to consider in the design of SRAM in
through a transmission gate during the read operation. RBL modern nanometer technologies are the: 1) read stability;
increases toward the VDD level for a read-1, and discharges 2) write stability; 3) cell supply reduction; 4) power dissipa-
toward the ground level for a read-0. Virtual power rails have
the same value of the RBL precharging level during the write tion; 5) leakage currents; 6) bitline (BL) ION to IOFF ratio; and
and the hold mode, and are connected to true supply levels 7) variability [6]. With increasing process variations, achieving
only during the read operation. Dynamic control of virtual rails specific yield is getting difficult, and novel designs and tech-
substantially reduces the RBL leakage. The proposed 10T cell niques, including read and write assist circuits, are adopted at
in a commercial 65 nm technology is 2.47× the size of 6T with the cost of area, power dissipation, or speed to improve the
β = 2, provides 2.3× read static noise margin, and reduces the
read power dissipation by 50% than that of 6T. The value of read/write stability and increase the number of cells in a single
RBL leakage is reduced by more than 3 orders of magnitude column [7].
and (ION /IOFF ) is greatly improved compared with the 6T BL Reduction of the supply voltage is the most straightforward
leakage. The overall leakage characteristics of 6T and 10T are technique to reduce the active power dissipation. However,
similar, and competitive performance is achieved. 6T SRAM power supply cannot be reduced aggressively due to
Index Terms— 10T, charge recycling, leakage reduction, low its RSNM degradation. Many SRAM cell have been proposed
power, precharging, single ended (SE) read bitline (RBL), static that improve RSNM, including single ended (SE) 8T [8],
random access memory (SRAM), virtual rails. 9T [9]–[11], 10T [12], [13] and differential 7T [14],
8T [15], 9T [16], [17], and 10T [18]. Also, numerous SRAM
I. I NTRODUCTION assist techniques have been described in the literature as a
cost-effective method to increase the write margin, and lower
P OWER dissipation has become a first class design
constraint [1], [2], as we have hit the utilization wall,
and the low power circuit, architecture, and system level
the leakage power dissipation compared to bitcell transistor
upsizing or operating the memory array at a higher supply
techniques are sought out [3], [4]. In addition, the static voltage [19]–[21]. A 10T cell in [22] uses virtual ground rail
random access memory (SRAM) is the most important digital for read port to achieve lower BL leakage and differential,
macro and its portion on a system-on-chip (SoC) is ever- while Kanda et al. [23] used row-by-row dynamic control of
increasing [5]. Decreasing the power dissipation of SRAM cell supply voltage and negative wordline voltage for 2 orders
will not only lower the overall system power dissipation, but of magnitude reduction in leakage currents.
In this paper, we present our half VDD precharge and charge
Manuscript received August 30, 2016; revised November 6, 2016; accepted recycling technique for low power read operation. A 4T read
December 6, 2016. This work was supported in part by the Basic Research port is designed to employ the proposed technique. Read
Program through the National Research Foundation of Korea funded by the
Ministry of Education under Grant NRF-2016R1D1A1B03933605, and in BL (RBL) is charged and discharged through the read port
part by the Industrial Strategic Technology Development Program (10052653) according to the state of stored bit. Read port is powered
funded by the Ministry of Trade, Industry & Energy, Korea. Design tools were by virtual power rails that run horizontal and are shared by
supported by IDEC, KAIST.
The authors are with the Department of Information and Communication the cells of a word. The dynamic control of read port power
Engineering, Sungkyunkwan University, Suwon 16419, South Korea (e-mail: rails reduces the RBL leakage substantially. The rest of this
[email protected]; [email protected]). paper is organized as follows. In Section II, we review the
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. conventional and the state-of-the-art cell designs. Section III
Digital Object Identifier 10.1109/TVLSI.2016.2637918 presents the proposed cell and its associated scheme. The
1063-8210 © 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 1. Conventional 6T SRAM read. (a) Column of M bit-cells during read. (b) Top: hold and read SNM butterfly curve (with worst case noise polarity
during hold). Bottom: transient behavior showing read disturbance.
postlayout results are gathered in Section IV, and Section V increases during the read operation. This increase in volt-
presents some important discussion. Finally, a brief conclusion age (V ) is dependent upon transistor sizing. For a successful
is presented in Section VI. read operation, β ratio, defined as (((W/L) N )/((W/L) A ))
must be larger than 1 (typically 2 to 3) [5]. The vulnerability
II. C ONVENTIONAL AND THE S TATE - OF - THE -A RT of the internal nodes of an SRAM cell is captured through
SRAM D ESIGNS metrics HSNM, RSNM, and WNM/write trip point (WTP)
SRAM cell must robustly operate under hold, read, and during hold, read, and write mode, respectively. Fig. 1(b)
write mode. An SRAM cell uses the positive feedback of shows the HSNM and RSNM butterfly curve (top) [24] and the
cross-coupled inverters (INVs) to store a single bit of infor- internal state disturbance of node QB for a slow rising WL
mation in a complementary fashion. Access transistors provide signal (bottom). The increase in node Qa voltage not only
the mechanism for the read and write operation. Before every decreases the cell stability, but also increases the short-circuit
access, column BL pair (BL and BLB) is precharged to the current from VDD to VSS and lets pass (now) the higher amount
supply voltage. For the write operation, one of the precharged of leakage current from BLB (IaLeak1 ). This decreases the
BLs is discharged through the write driver. differential BL voltage (VBL ) and requires the increases in
Fig. 1(a) shows a single column of M 6T SRAM cells, WL pulse duration. A wider read pulse can cause dynamic
where one cell is accessed in read mode with data = 0 instability and increases the power dissipation [25]. In addi-
(Qa = 0), while other M − 1 cells are in the hold mode. tion, for a successful write operation, access transistors of the
Leakage components are labeled, and for the worst case 6T must be strong enough to take over the pull-up pMOS
leakage, all M − 1 cells store data = 1 (Qu = 1). Iread flows transistors. Thus, γ ratio, defined as (((W/L) P )/((W/L) A ))
from BL to the VSS through AL and NL of the accessed cell, must be smaller than 1. However, a stronger pMOS is benefi-
and the BL voltage is decreased. The unaccessed cell on the cial for read operation (to decrease the V ), and a weaker
BL exhibits BL leakage. IuLeak0 is the main component of pull-down nMOS is beneficial for write operation (to let
BL leakage while IuLeak1 is negligible, as VDS of AR of the access transistors have more strength for injecting current).
unaccessed cell is large, while VDS of its AL is very small Thus, proper sizing is required for specific conditions and
(varies from 0 to VBL ). These leakage components decrease application.
the differential BL voltage development. As there are a large The conventional 6T SRAM has two BL s (BL and BLB),
number of cells in a single column, the worst case BL leakage and for each read operation one of the BLs is decreased.
can decrease BLB voltage enough to make an erroneous read. We can model 6T SRAM by a single BL with an activity
Thus, Iread must be greater than (M − 1) × IuLeak0 , where M factor of 1. Thus, the dynamic read power dissipation of 6T
is the number of cells in a single column. is given as
During read operation, the internal node of the 6T cell
storing a zero (Qa) lies in the read current path and its voltage Pd6T = N × CBL × VDD × VBL × f (1)
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
MAROOF AND KONG: 10T SRAM USING HALF-VDD PRECHARGE AND ROW-WISE DYNAMICALLY POWERED READ PORT FOR LOW SWITCHING POWER 3
Fig. 2. SRAM read ports (a) 6T. (b) 8T. (c) 9T [11]. (d) 9T [10]. (e) 10T [12].
where N is the number of cells connected in a wordline In 6T SRAM read operation, one of the BL stays at the VDD
(known as word size), CBL is the total switching BL capac- while the other decreases by VBL amount. However, in the
itance, VDD is the precharging supply voltage, and f is the case of 8T SRAM, there is only one BL (RBL) and it either
frequency of operation. CBL is not decreasing gracefully with decreases or stays at the VDD level depending on the bit read.
technology, and a minimum of around 100 mV of VBL is Now, the sensing of SE BL can be done using different circuits
required for proper sensing considering noise, leakage, and such as: 1) domino sensing that requires full VDD swing ON
the process parameter fluctuations. Leakage power dissipation the local-BL; 2) psuedo-differential that requires a reference
is highly data dependent. As the subthreshold leakage current signal; and 3) ac coupled sensing that requires the use of
is a dominant part of the overall transistor leakage, the worst capacitors [32]. Using a reference-based sense amplifier, only
case leakage current of 6T SRAM during the hold can be a small voltage difference is required. In the case of 6T, one of
estimated as the BLs always serves as a reference, while in the case of 8T,
W −Vth −VDD reference is set at VDD − VBL . Thus, for a read-1, RBL stays
Il6T = M × N × I0 × × e ηvt × 1 − e vt (2) at VDD and Vref − VRBL = −VBL is sensed. For a read-0,
L
RBL must decrease to produce +VBL of the sensing margin,
where M × N is the size of the SRAM array, Vth is the
threshold voltage of access transistor, v t is the thermal voltage which means that now RBL must decrease by y2 × VBL ,
such that Vref − VRBL = VDD − VBL − (VDD − 2 × VBL ) =
(≈26 mV at 27 °C), I0 is the technology-dependent coefficient
+VBL .
and is the leakage current at VGS = Vth of a minimum sized
transistor, and η is the technology-dependent subthreshold Now, for the sensing of the proposed cell, Vref is set at
VDD /2, and for each read operation only VBL of the sensing
factor [26]. In Fig. 1(a), AR of the unaccessed cell is OFF (hav-
margin is required because for each read operation, RBL of the
ing VGS = 0 V), while its VDS = VDD , which exhibits large
leakage. LP10T either decreases or increases from VDD /2, depending
on the data read.
In essence, 6T SRAM has conflicting read and write require-
ments [5] and transistor sizing cannot be done independently. Thus, though the 8T has activity factor of 0.5, it still has the
same dynamic power dissipation as of 6T SRAM due to the
Also, 6T has inherit RSNM problem as the read current
higher differential voltage requirement. Also, the differential
passes through the cell internal node [27], and it further
degrades with VDD scaling [28]. Also, being considered as voltage needs to be developed in the same time (Tread ), to
provide similar performance. Thus, the read port of 8T is
baseline design, 6T has overall a higher power dissipation,
sized wider to provide higher Iread , which exhibits higher RBL
and higher BL leakages, as the low power techniques employ
a certain mechanism to lower the dynamic power dissipation, leakages.
SE 9T SRAM [11] uses a 3T read port shown in Fig. 2(c).
e.g., charge sharing [29], [30] and hierarchical BL [31] and
the leakages (by employing virtual rails) [22], [23]. The read It effectively stacks the M2 between M1 and M3 to reduce
the RBL leakage. Write performance and dynamic power
port of 6T SRAM cell is shown in Fig. 2(a) that highlights
dissipation of this cell is the same as 8T, however, speed
the internal node Q in the read current path. Many alternative
bitcells and techniques have been proposed in the literature to is degraded compared with 6T and 8T cell due to 3T read
path.
improve SRAM cell stability, reduce the leakage currents, and
Another state-of-the-art 9T SRAM cell [10] uses a 3T read
achieve low power operation compared with the conventional
6T design. port shown in Fig. 2(d). It provides the leakage current to
An 8T SRAM cell adds a separate 2T read port, shown the RBL through M3, to compensate the BL leakage when
in Fig. 2(b), and necessarily solves the problem of read the RBL is to stay at a higher level but decreases due to
stability. Internal nodes are isolated from the read current path, the leakage currents (of unaccessed cells). The cell improves
and thus a high RSNM is achieved. Also, sizing of 8T read the sensing margin, and provides better performance due to
port can be done independently without affecting the write 2T (M1-M2) read current path. However, its dynamic power
operation. is the same as 6T, and overall static power is increased.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 3. Proposed 10T SRAM cell with row-wise read port dynamic power
lines.
MAROOF AND KONG: 10T SRAM USING HALF-VDD PRECHARGE AND ROW-WISE DYNAMICALLY POWERED READ PORT FOR LOW SWITCHING POWER 5
read-1 operation, as the single nMOS could not charge well the TABLE I
RBL through P1. Furthermore, sizing of read port is important T RANSISTOR S IZING I NFORMATION (W μm). L min OF 60 nm I S U SED
in terms of area and performance.
C. Dynamic Power
RBL is precharged by V P to the vdd/2 level. For the read-0
operation (i.e., QB = 1), RBL is discharged by VBL amount
through TG-N2. Thus for the next precharge, CBL × VBL
amount of charge is transferred to the RBL through V P supply. D. Leakage Reduction
As the value of V P is vdd/2, the dynamic power dissipation
Virtual power rails run horizontal and are shared by the
of an LP10T cell due to read-0 is given as
cells of a row. These rails are activated during read operation,
VDD (i.e., VVDD is connected to VDD , and VVSS is connected to
P0 = α0 × CBL × × VBL × f (3)
2 ground). During the hold and write mode, these virtual rails
where α0 is the activity factor of read-0. Now, for a read-1 have value of vdd/2. These control signals are shown in
(i.e., QB = 0), RBL is charged from vdd/2 level toward vdd Fig. 4(a) for read and hold/write mode. Fig. 4(b) shows the
level by the amount of VBL through VVDD . The charge equal state of read port transistors in the hold/write mode. As both
to CBL × VBL is transferred from VVDD to RBL. For the next the virtual rails have voltage level vdd/2 during nonread,
precharge, RBL is discharged to the vdd/2 level through V P . the voltage of node Z stays near vdd/2 value. As RBL is
The charge CBL × VBL is recycled from RBL to V P , and precharged at vdd/2 level, and read signals are not activated
can be used for future pre-charging intervals. As the value of (TG is OFF), RBL leakage is reduced substantially due to near
V P is vdd/2, and the value of VVDD is vdd during the read zero VDS of TG. Also, the boosted read signals help reduce
operation, the read-1 dynamic power dissipation is given as the leakage currents, as the VGS of pMOS (P2) becomes more
positive.
VDD
P1 = α1 × CBL × VDD − × VBL × f
2
VDD E. Transistor Sizing and Layout
= α1 × CBL × × VBL × f (4)
2 The β and γ ratios of 6T must be considered for proper
where α1 is the activity factor of read-1 operation. Hence, read and write operation [5]. Thus, for a 6T a stronger pull-
assuming equal probability of read-1 and read-0, the dynamic down nMOS, a medium strength access-nMOS, and a weaker
power dissipation of LP10T is given as pull-up pMOS is used. Due to mobility difference, access
VDD and pull-up transistors are sized minimum. Pull-down nMOS
PdLP10T = N × CBL × × VBL × f. (5) are sized 2 × Wmin to make β ratio of 2 for a 6T cell. For
2
8T and LP10T, minimum sized transistors are used for the
Comparing (5) and (1) shows that the proposed half-vdd
cross-coupled INVs s and for the write access transistors.
precharging scheme and charge recycling mechanism, due to
This achieves relatively low write BL leakage currents and a
charging/discharging of RBL during read interval using the
higher write noise margin. Read port transistors of 8T are sized
proposed 4T read port, reduces the average dynamic read
2 × Wmin . Read port of LP10T is sized as pMOS 2.5 × Wmin ,
power dissipation by 50% compared with the 6T SRAM.
nMOS 1.5 × Wmin . Sizing information of 6T, 8T, and LP10T
However, LP10T incurs area and power overheads due to
is shown in Table I. Although the wider transistors used for
control signal complexity. There are two virtual rails, and each
read port may exhibit higher leakage, the RBL precharging
one is modulated by value VDD /2 from its nominal value
level and dynamic control of read port power rails of LP10T
of VDD /2 when the read control signal is asserted. VVSS is
substantially reduce the RBL leakage current.
generated by passing signal “R” through an INV between
Layouts have been produced in commercial 65 nm technol-
VSS and VDD /2. Now, the capacitance of VVSS rail is much
ogy, and are shown for both the 6T and LP10T in Fig. 5.
lower than the BL capacitances, because the number of bitcells
Metal-2 runs vertical, and connects the cells in a single
in a word are very small compared with the number of bitcells
column. Control signals runs horizontal on M3. 6T has only
in a single column. Considering M × N bit organization of
one control signal (WL), while LP10T has five control signals
SRAM, N M, where N is the number of bits per word and
(R, R B, W, VVDD , VVSS ). The layout of 6T shown in Fig. 5(a)
M is the number of words in the bank. Here, we can safely
stacks AL and NL transistors. Thin layout to avoid lithographic
assume that the Cvvss is 1/8th of the CBL (128 × 8 bit SRAM
defects [34] increases the cell area of 6T by 39%. For LP10T,
organization). The power dissipation due to virtual rails VVSS
however, it was necessary to choose a taller structure to allow
is Pvvss = 1 × Cvvss × ((VDD )/2) × ((VDD)/2) × f . As there
five M3 rails run horizontally. As AL/AR and NL/NR are sized
are two virtual power rails that modulate by vdd/2 for every
minimum for LP10T, such a taller layout structure allows their
read cycle, the total power penalty is
vertical stacking.
1 In the used technology, Wmin of 120 nm is used, while L min
Pv = CBL × VDD × × f (6)
16 of 60 nm is used. Poly–poly spacing of 130 nm (and 150 for
which is 6.25% of the 6T read power dissipation. polycontacts), along with increased sizes of vias and contacts
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
MAROOF AND KONG: 10T SRAM USING HALF-VDD PRECHARGE AND ROW-WISE DYNAMICALLY POWERED READ PORT FOR LOW SWITCHING POWER 7
Fig. 9. Leakage current (on log scale) in nA. (a) Total leakage current of 6T, 8T, and LP10T. (b) RBL leakage of 8T and LP10T.
MAROOF AND KONG: 10T SRAM USING HALF-VDD PRECHARGE AND ROW-WISE DYNAMICALLY POWERED READ PORT FOR LOW SWITCHING POWER 9
Fig. 11. Read delay time (on log scale) of all designs at different supply
voltage and temperature values for typical process corner.
leakage power of 6T, 8T, and LP10T is the same. Nevertheless, [8] L. Chang et al., “Stable SRAM cell design for the 32 nm node
with the use of row-by-row dynamic control of read-port and beyond,” in Symp. VLSI Technol. Dig. Tech. Papers., Jun. 2005,
pp. 128–129.
power rails, LP10T substantially lowers the RBL leakage, [9] M.-H. Tu et al., “A single-ended disturb-free 9T subthreshold SRAM
and thus many more cells can be added in a single column. with cross-point data-aware write word-line structure, negative bit-line,
8T worst case leakage is higher than 6T as it uses wider read and adaptive read operation timing tracing,” IEEE J. Solid-State Circuits,
vol. 47, no. 6, pp. 1469–1482, Jun. 2012.
port transistors to achieve 2 × VBL for a read-0 operation. [10] B. Wang, T. Q. Nguyen, A. T. Do, J. Zhou, M. Je, and T. T. H. Kim,
Virtual rail with 8T has also been used to reduce the RBL “Design of an ultra-low voltage 9T SRAM with equalized bitline leakage
leakage. However, 8T requires same (average dynamic) power and CAM-assisted energy efficiency improvement,” IEEE Trans. Circuits
Syst. I, Reg. Papers, vol. 62, no. 2, pp. 441–448, Feb. 2015.
as does the 6T. LP10T with the proposed precharging scheme
[11] S. Lin, Y.-B. Kim, and F. Lombardi, “A low leakage 9t SRAM cell for
and charge recycling mechanism only uses half of the (average ultra-low power operation,” in Proc. 18th ACM Great Lakes Symp. VLSI,
dynamic) power of 6T. LP10T achieves 2.3× RSNM of 6T, 2008, pp. 123–126.
and reduces read delay by 10% and 32% compared with 6T [12] B. H. Calhoun and A. P. Chandrakasan, “A 256-kb 65-nm sub-threshold
SRAM design for ultra-low-voltage operation,” IEEE J. Solid-State
and 8T. As LP10T uses two nMOS and two pMOS transistors Circuits, vol. 42, no. 3, pp. 680–688, Mar. 2007.
for read port, read-1, and read-0 performance is not the same. [13] T.-H. Kim, J. Liu, J. Keane, and C. H. Kim, “A 0.2 V, 480 kb
In addition, a single read operation can be effected differently subthreshold SRAM with 1 k cells per bitline for ultra-low-voltage
computing,” IEEE J. Solid-State Circuits, vol. 43, no. 2, pp. 518–529,
with the change in temperature. LP10T requires 5 control Feb. 2008.
signals, and an area of 2.47× the area of 6T (with β = 2). [14] K. Takeda et al., “A read-static-noise-margin-free SRAM cell for low-
Furthermore, LP10T requires V P supply voltage with value of VDD and high-speed applications,” IEEE J. Solid-State Circuits, vol. 41,
vdd/2 for precharging. RBL precharging transistors for LP10T no. 1, pp. 113–121, Jan. 2006.
[15] R. Saeidi, M. Sharifkhani, and K. Hajsadeghi, “A subthreshold symmet-
are wider than BL/BLB precharging. Yet, low dynamic power, ric SRAM cell with high read stability,” IEEE Trans. Circuits Syst. II,
high ION /IOFF , read stability and competitive read time are Express Briefs, vol. 61, no. 1, pp. 26–30, Jan. 2014.
salient features of the proposed LP10T SRAM. [16] Z. Liu and V. Kursun, “Characterization of a novel nine-transistor SRAM
cell,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 16, no. 4,
pp. 488–492, Apr. 2008.
V. C ONCLUSION [17] S. Lutkemeier, T. Jungeblut, H. K. O. Berge, S. Aunet,
In this paper, we have presented our 10T SRAM cell that M. Porrmann, and U. Ruckert, “A 65 nm 32 b subthreshold processor
with 9T multi-Vt SRAM and adaptive supply voltage control,” IEEE J.
uses a 4T read port and SE RBL. RBL is precharged at half the Solid-State Circuits, vol. 48, no. 1, pp. 8–19, Jan. 2013.
supply voltage and, during the read operation, is charged or [18] I. J. Chang, J.-J. Kim, S. P. Park, and K. Roy, “A 32 kb 10T
discharged according to the bit stored. For a read-0 operation, sub-threshold SRAM array with bit-interleaving and differential read
RBL discharges through TG and nMOS transistor, and for scheme in 90 nm CMOS,” IEEE J. Solid-State Circuits, vol. 44, no. 2,
pp. 650–658, Feb. 2009.
the next precharge, RBL is supplied current by VP. For a [19] E. Karl et al., “A 0.6 V, 1.5 GHz 84 Mb SRAM in 14 nm FinFET
read-1 operation, RBL is charged from vdd/2 to vdd by virtual CMOS technology with capacitive charge-sharing write assist circuitry,”
read port. For the next precharge, RBL level is decreased and IEEE J. Solid-State Circuits, vol. 51, no. 1, pp. 222–229, Jan. 2016.
[20] K. Nii et al., “A 45-nm single-port and dual-port SRAM family
current flows from RBL to VP. By precharging through VP with robust read/write stabilizing circuitry under DVFS environment,”
(which is half vdd) and charge recycling mechanism, LP10T in Proc. IEEE Symp. VLSI Circuits, Jun. 2008, pp. 212–213.
only dissipates half the average read dynamic power compared [21] F. Hamzaoglu et al., “A 3.8 GHz 153 Mb SRAM design with dynamic
stability enhancement and leakage reduction in 45 nm high-k metal
with 6T. In 65 nm, performance figure (mV /μW ) of 1.83× gate CMOS technology,” IEEE J. Solid-State Circuits, vol. 44, no. 1,
of 6T is achieved at 1 V, and 1.84× on average at different pp. 148–154, Jan. 2009.
supply levels. Due to decoupling of internal nodes, RSNM is [22] T. Song, S. Kim, K. Lim, and J. Laskar, “Fully-gated ground
increased by 2.3× compared with 6T. Overall leakage power 10T-SRAM bitcell in 45 nm SOI technology,” Electron. Lett., vol. 46,
no. 7, pp. 515–516, Apr. 2010.
of LP10T is similar to 6T, however, RBL leakage is reduced [23] K. Kanda, T. Miyazaki, M. K. Sik, H. Kawaguchi, and T. Sakurai, “Two
by more than 3 orders of magnitude, and thus a higher number orders of magnitude leakage power reduction of low voltage SRAM’s
of cell could be integrated on a single column. by row-by-row dynamic VDD control (RRDV) scheme,” in Proc. 15th
Annu. IEEE Int. ASIC/SoC Conf., Sep. 2002, pp. 381–385.
[24] E. Seevinck, F. J. List, and J. Lohstroh, “Static-noise margin analysis
R EFERENCES of MOS SRAM cells,” IEEE J. Solid-State Circuits, vol. 22, no. 5,
pp. 748–754, Oct. 1987.
[1] T. Mudge, “Power: A first-class architectural design constraint,” Com-
puter, vol. 34, no. 4, pp. 52–58, Apr. 2001. [25] M. H. Abu-Rahma, M. Anis, and S. S. Yoon, “Reducing SRAM power
[2] N. S. Kim et al., “Leakage current: Moore’s law meets static power,” using fine-grained wordline pulsewidth control,” IEEE Trans. Very Large
Computer, vol. 36, no. 12, pp. 68–75, Dec. 2003. Scale Integr. (VLSI) Syst., vol. 18, no. 3, pp. 356–364, Mar. 2010.
[3] G. Venkatesh et al., “Conservation cores: Reducing the energy of mature [26] M. Alioto, “Ultra-low power VLSI circuit design demystified and
computations,” ACM SIGARCH Comput. Archit. News, vol. 38, no. 1, explained: A tutorial,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 59,
p. 205, 2010. no. 1, pp. 3–29, Jan. 2012.
[4] N. Goulding-Hotta et al., “The GreenDroid mobile application proces- [27] G. Pasandi and S. M. Fakhraie, “A 256-kb 9T near-threshold SRAM
sor: An architecture for silicon’s dark future,” IEEE Micro, vol. 31, no. 2, with 1k cells per bitline and enhanced write and read operations,”
pp. 86–95, Mar./Apr. 2011. IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 23, no. 11,
[5] A. Pavlov and M. Sachdev, CMOS SRAM Circuit Design and Parametric pp. 2438–2446, Nov. 2015.
Test in NANO-Scaled Technologies: Process-Aware SRAM Design and [28] S. Ahmad, M. K. Gupta, N. Alam, and M. Hasan, “Single-ended
Test, vol. 40. The Netherlands: Springer, 2008. Schmitt-trigger-based robust low-power SRAM cell,” IEEE Trans. Very
[6] J. Singh, S. P. Mohanty, and D. Pradhan, Robust SRAM Designs and Large Scale Integr. (VLSI) Syst., vol. 24, no. 8, pp. 2634–2642,
Analysis. New York, NY, USA: Springer-Verlag, 2012. Aug. 2016.
[7] M. A. Rahma and M. Anis, Nanometer Variation-Tolerant SRAM: [29] B. D. Yang, “A low-power SRAM using bit-line charge-recycling for
Circuits and Statistical Design for Yield. New York, NY, USA: read and write operations,” IEEE J. Solid-State Circuits, vol. 45, no. 10,
Springer-Verlag, 2012. pp. 2173–2183, Oct. 2010.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
MAROOF AND KONG: 10T SRAM USING HALF-VDD PRECHARGE AND ROW-WISE DYNAMICALLY POWERED READ PORT FOR LOW SWITCHING POWER 11
[30] K. Kim, H. Mahmoodi, and K. Roy, “A low-power SRAM using bit-line Naeem Maroof (S’14–M’16) received the B.Sc.
charge-recycling technique,” in Proc. ACM/IEEE Int. Symp. Low Power degree in computer engineering from the COM-
Electron. Design (ISLPED), Aug. 2007, pp. 177–182. SATS Institute of Information Technology (CIIT),
[31] B.-D. Yang and L.-S. Kim, “A low-power SRAM using hierarchical bit Islamabad, Pakistan, in 2006, the M.Sc. degree in
line and local sense amplifiers,” IEEE J. Solid-State Circuits, vol. 40, electronic communications and computer engineer-
no. 6, pp. 1366–1376, Jun. 2005. ing from the University of Nottingham, Nottingham,
[32] H. Jeong, T. Kim, T. Song, G. Kim, and S. O. Jung, “Trip-point bit- U.K., in 2007, and the Ph.D. degree in electronics
line precharge sensing scheme for single-ended SRAM,” IEEE Trans. engineering from Hanyang University, Seoul, South
Very Large Scale Integr. (VLSI) Syst., vol. 23, no. 7, pp. 1370–1374, Korea, in 2016.
Jul. 2015. From 2007 to 2012, he was a full-time
[33] H. Noguchi et al., “Which is the best dual-port SRAM in 45-nm Faculty Member with the Department of Electrical
process technology?—8T, 10T single end, and 10T differential,” in Engineering, CIIT. He is currently a Post-Doctoral Research Fellow with the
Proc. IEEE Int. Conf. Integr. Circuit Design Technol. Tut., Jun. 2008, Integrated System Design Laboratory, College of Information and Commu-
pp. 55–58. nication Engineering, Sungkyunkwan University, Suwon, South Korea. His
[34] R. W. Mann and B. H. Calhoun, “New category of ultra-thin current research interests include low power and reliable integrated circuit
notchless 6T SRAM cell layout topologies for sub-22nm,” in Proc. design, memory architecture (SRAM, DRAM, NVM, RRAM, and hybrids),
12th Int. Symp. Quality Electron. Design (ISQED), Mar. 2011, and the power management ICs.
pp. 1–6.
[35] I. M. Filanovsky and A. Allam, “Mutual compensation of mobility Bai-Sun Kong (S’94–M’00) received the B.S.
and threshold voltage temperature effects with applications in CMOS degree in electronics engineering from Yonsei
circuits,” IEEE Trans. Circuits Syst. I, Fundam. Theory Appl., vol. 48, University, Seoul, South Korea, in 1990, and the
no. 7, pp. 876–884, Jul. 2001. M.S. and Ph.D. degrees in electrical engineering
[36] J. C. Ku and Y. Ismail, “On the scaling of temperature-dependent from the Korea Advanced Institute of Science and
effects,” IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., Technology, Daejon, South Korea, in 1992 and 1996,
vol. 26, no. 10, pp. 1882–1888, Oct. 2007. respectively.
[37] C. Park et al., “Reversal of temperature dependence of From 1996 to 1999, he was with LG Semicon
integrated circuits operating at very low voltages,” in Company Ltd., Seoul, where he was involved in
Proc. Int. Electron Devices Meeting (IEDM), Dec. 1995, the design of high-bandwidth DRAMs including
pp. 71–74. 18 M CONCURRENT RDRAM, 72 M Concur-
[38] A. Bellaouar, A. Fridi, M. I. Elmasry, and K. Itoh, “Supply voltage rent RDRAM, and 128 M Direct RDRAM. From 2000 to 2005, he was
scaling for temperature insensitive CMOS circuit operation,” IEEE with Korea Aerospace University, Goyang, South Korea, where he was
Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 45, no. 3, an Associate Professor at the School of Electronics, Telecommunication,
pp. 415–417, Mar. 1998. and Computer Engineering. In 2005, he joined Sungkyunkwan University,
[39] D. Wolpert and P. Ampadu, “Temperature effects in semiconductors,” Suwon, South Korea, where he is currently a Professor at the College of
in Managing Temperature Effects in Nanoscale Adaptive Information and Communication Engineering. His current research interests
Systems. New York, NY, USA: Springer-Verlag, 2012, include microprocessor, and memory architecture and circuit design, and VLSI
pp. 15–33. circuit and system design for low-power and/or high-speed applications.