ArticlePDF Available

A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM

Authors:

Abstract and Figures

A 1.2-V 72-Mb double data rate 3 (DDR3) SRAM achieves a data rate of 1.5 Gb/s using dynamic self-resetting circuits. Single-ended main data lines halve the data line precharging power dissipation and the number of data lines. Clocks phase shifted by 0°, 90°, and 270° are generated through the proposed clock adjustment circuits. The latter circuits make input data sampled with an optimized setup/hold window. On-chip input termination with a linearity error of ±4.1% is developed to improve signal integrity at higher data rates. A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM is fabricated in a 0.10-μm CMOS process with five metals. The cell size and the chip size are 0.845 μm<sup>2</sup> and 151.1 mm<sup>2</sup>, respectively.
Content may be subject to copyright.
This document is downloaded from DR-NTU, Nanyang Technological
University Library, Singapore.
Title A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM.
Author(s)
Cho, Uk Rae.; Kim, Tae Hyoung.; Yoon, Yong Jin.; Lee,
Jong Cheol.; Bae, Dae Gi.; Kim, Nam Seog.; Kim, Kang
Young.; Son, Young Jae.; Yang, Jeong Suk.; Sohn, Kwon
Il.; Kim, Sung Tae.; Lee, In Yeol.; Lee, Kwang Jin.; Kang,
Tae Gyoung.; Kim, Su Chul.; Ahn, Kee Sik.; Byun, Hyun
Geun.
Citation
Cho, U. R., Kim, T. H., Yoon, Y. J., Lee, J. C., Bae, D. G.,
Kim, N. S., et al. (2003). A 1.2-V 1.5-Gb/s 72-Mb DDR3
SRAM. IEEE Journal of Solid State Circuits, 38(11), 1943
-1951.
Date 2003
URL http://hdl.handle.net/10220/6438
Rights
© 2003 IEEE. Personal use of this material is permitted.
However, permission to reprint/republish this material for
advertising or promotional purposes or for creating new
collective works for resale or redistribution to servers or
lists, or to reuse any copyrighted component of this work
in other works must be obtained from the IEEE. This
material is presented to ensure timely dissemination of
scholarly and technical work. Copyright and all rights
therein are retained by authors or by other copyright
holders. All persons copying this information are
expected to adhere to the terms and constraints invoked
by each author's copyright. In most cases, these works
may not be reposted without the explicit permission of the
copyright holder. http://www.ieee.org/portal/site This
material is presented to ensure timely dissemination of
scholarly and technical work. Copyright and all rights
therein are retained by authors or by other copyright
holders. All persons copying this information are
expected to adhere to the terms and constraints invoked
by each author's copyright. In most cases, these works
may not be reposted without the explicit permission of the
copyright holder.
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 38, NO. 11, NOVEMBER 2003 1943
A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM
Uk-Rae Cho, Tae-Hyoung Kim, Yong-Jin Yoon, Jong-Cheol Lee, Dae-Gi Bae, Nam-Seog Kim, Kang-Young Kim,
Young-Jae Son, Jeong-Suk Yang, Kwon-Il Sohn, Sung-Tae Kim, In-Yeol Lee, Kwang-Jin Lee, Tae-Gyoung Kang,
Su-Chul Kim, Kee-Sik Ahn, and Hyun-Geun Byun
Abstract—A 1.2-V 72-Mb double data rate 3 (DDR3) SRAM
achieves a data rate of 1.5 Gb/s using dynamic self-resetting
circuits [5]. Single-ended main data lines halve the data line
precharging power dissipation and the number of data lines.
Clocks phase shifted by 0
,90 , and 270 are generated through
the proposed clock adjustment circuits. The proposed clock
adjustment circuits make input data sampled with optimized
setup/hold window. On-chip input termination with the linearity
error of
4.1% is developed to improve signal integrity at higher
data rates. A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM is fabricated in
a 0.10-
m CMOS process with five metals. The cell size and the
chip size are 0.845
m and 151.1 mm , respectively.
Index Terms—1.5 Gb/s, 72 Mb, CMOS memory circuits, DDR3,
high-speed SRAM, on-chip termination.
I. INTRODUCTION
H
IGH-SPEED SRAMs are used in ultrafast systems,
such as networking systems, servers and workstations.
The development of these systems has dramatically increased
the performance requirements of high-speed SRAMs. The
main features asked on high-speed SRAMs are higher data
rate and higher density, which are directly related to system
performance, but it is quite difficult to implement SRAM with
high density and high data rate. In high-speed SRAM, 32 Mb
has been considered as the limit due to the chip size, standby
current, speed, etc.
This paper describes a 1.2-V 1.5-Gb/s 72-Mb double data
rate 3 (DDR3) SRAM which satisfies the data rate and the
density required in recent ultrafast systems. The 72-Mb SRAM
is fabricated using a 0.1-
m CMOS technology with the
implemented SRAM, breaking the 1-
m barrier for the cell
size. Some papers have been published including six-transistor
(6T) SRAM cells near the 1-
m barrier [1]–[3]. One paper
described a 6T embedded SRAM cell breaking the 1-
m
barrier with the cell size of 0.998 m [4]. Fig. 1 shows the
trend in SRAM size recently reported. The implemented cell
size of this work is 0.845
m , which is the smallest size
to date. The chip size is 151.1 mm
. To halve the data line
precharging power dissipation and the number of data lines,
single-ended main data lines (SMDLs) are designed using
dynamic self-resetting circuits [5]. The SRAM operates in
two user-selectable modes: clock-aligned mode (CA mode)
and clock-centered mode (CC mode). In CA mode, clocks
phase shifted by 0
,90 , and 270 from the external clock are
generated through the proposed clock adjustment circuits to
Manuscript received April 9, 2003; revised June 23, 2003.
The authors are with the SRAM Memory Division, Samsung Electronics,
Gyeonggi-Do 445-701, Korea (e-mail: purekth.kim@samsung.com).
Digital Object Identifier 10.1109/JSSC.2003.818137
Fig. 1. Trend of SRAM cell size reported.
sample address, control, and input data. In CC mode, clocks
synchronized with the rising and falling edge of the external
clock are generated and used to sample input signals. On-chip
input termination with the linearity error of
4.1% is developed
to improve signal integrity at higher data rate. The impedance
of input termination is programmable by external resistor RT. A
programmable impedance controller (PIC) tracks the process,
voltage, and temperature (PVT) variations of the termination
impedance.
The detailed architecture is described in Section II. In
Section III, the SMDL scheme is presented. The proposed
clock adjustment circuit is explained in Section IV. On-chip
input termination is explained in Section V. Finally, the conclu-
sions and hardware results are presented in Section VI.
II. C
HIP ARCHITECTURE
Fig. 2 briefly shows a simplified architecture of the 72-Mb
DDR3 SRAM. The SRAM is configured as either 2M
36 or
4M
18. The SRAM is divided into four mats each having nine
I/Os, and each mat is subdivided by four submats for double-
data-rate and burst-mode operations. These submats consist of
32blocks,16blocksintheleftand16blocksintheright,sharing
section wordlines and wordline drivers. Each block has nine
I/Os and each I/O is divided into 512 wordlines by 32 columns.
Two submats are activated in each mat, and one block is ac-
tivated in each selected submats. That is, eight blocks are ac-
cessed in four mats at the same time to provide the 72 bits re-
quired in double-data-rate operation.
The wordline decoder is composed of a wordline predecoder,
main wordline decoders, and section wordline decoders. The
wordline predecoder is located in the center of the chip and the
main wordline decoders are located in the vertical channels be-
tween mat A and mat B and between mat C and mat D. Finally,
the section wordline decoders are located in the center of each
0018-9200/03$17.00 © 2003 IEEE
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
1944 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 38, NO. 11, NOVEMBER 2003
Fig. 2. Architecture of 72-Mb DDR3 SRAM.
block. The bitline decoder consists of a bitline predecoder and
main bitline decoders. The bitline predecoder is in the center
of the chip and the main bitline decoders are distributed at the
bottom of each block. The main data lines are distributed in
four mats and divided into two stages. The first main data lines
are located in the horizontal channels between submat X and
submat Z, and submat Y and submat W. The second main data
lines are routed vertically across the center of each mat to make
balanced timing and minimum capacitance [6]. All I/O circuits
and control circuits are located in the horizontal channel in the
center of the chip.
III. SMDL
A. SMDL Scheme
In high-speed SRAMs suchas the 4M DDRSRAM, differen-
tial main data lines have been used to reduce the delay through
data line [7].In the differentialmaindataline scheme, one of the
two differential data lines is precharged regardless of the data
after finishing operation. Furthermore, the main data lines have
large capacitive loads, causing large precharging power dissi-
pation. In the SMDL scheme, the data line precharging opera-
tion is executed when the data is only “0”. If the probability of
“0” is equal to that of “1”, SMDL reduces data line precharging
power dissipation and the number of main data lines by half.
Fig. 3 shows the SMDL scheme for an I/O in a mat. The outputs
of 16 sense amplifiers in the near or far area are connected to
one of four first main data lines (MDL1) and four MDL1s are
connected to second main data lines (MDL2) through nMOS
switches and drivers. The sense amplifier outputs of two sub-
mats are tied together because they are not selected at the same
time in any burst-operation situations. Only one block is se-
lected in submats W and Y. The other block is selected in sub-
mats X and Z.
When one out of 64 blocks is activated in two submats, the
output of the sense amplifier in the selected block is transferred
to MDL2 through nMOS switches and drivers. After that,
the sampling clock RS_KCORE samples the data of MDL2
and transmits it to the data output buffer DLATCH. MDL1s,
MDL_SUM, and MDL2 are initially precharged to
. If the
selected sense amplifier output is “0”, one MDL1 connected
to the selected sense amplifier falls to Gnd while the other
MDL1s remains at
. If one of four MDL1s falls to Gnd,
MDL_SUM also becomes Gnd. After
from the falling
edge of MDL_SUM, the reset signal turns the pMOS on and
MDL_SUM is precharged to
by a dynamic self-resetting
circuit [5]. MDL_SUM can be precharged with the pulsewidth
of
because it has a small capacitive load compared to the
main data lines. On the contrary, if the selected sense amplifier
output is “1”, the MDL1 connected to the selected sense ampli-
fier as well as the other MDL1s remain at
. In this case, four
MDL1s turn the nMOS drivers off, causing MDL_SUM and
MDL2 to remain at
. That is, all nodes are in their initial
states without additional power dissipation for precharging
of MDL1s, MDL_SUM, and MDL2 when transmitting “1”
to DLATCH. Precharging MDL1s, MDL_SUM, and MDL2
occurs only when the sense amplifier output data is “0”. If the
probability of “0” is equal to that of “1”, power dissipation is
reduced by half. In addition, the number of data lines is also
reduced by half. Power dissipation in main data lines is reduced
from 173 to 80 mA by adopting the SMDL scheme.
B. Sampling Clock (RS_KCORE) Generator
Another important point in SMDL is the timing of the sam-
pling clock RS_KCORE. Since the data window of MDL2 is
just
, controlling the timing of RS_KCORE is very impor-
tant to transfer the right data to DLATCH. In the conventional
design, the sampling clock for DLATCH is derived from the ex-
ternalclock by adding delays. Since thedata path isindependent
of the clock path, their timing difference becomes sensitive to
PVTvariations.Theamountoftimingdifferencecaused byPVT
variations can be neglected in low data rates. In high data rates,
however, an automatic internal sampling clock generator, which
is robust over PVT variations, is required.
As shown in Fig. 4, the sampling clock generator is com-
posed of the same circuits used in SMDL except that it is
differential. One I/O out of nine I/Os has differential outputs.
The differential outputs of one sense amplifier in a selected
block are used to generate the sampling clock. The operation
is similar to that of SMDL. In precharge state, differential
outputs of sense amplifiers are both precharged to
, but in
the access state, the differential outputs of the sense amplifier
in a selected block become complementary. The sampling
clock is generated from the moment when the outputs of the
selected sense amplifier start to be complementary. The com-
plementary sense amplifier outputs make MDL_SUM_T and
MDL_SUM_C complementary. This makes the MDL2_T_C
and RS_KCORE signal go high. After
from the
moment when RS_KCORE becomes complementary, the
path from MDL2_T_C to RS_KCORE is disconnected and
RS_KCORE becomes low. EN is an enabling signal indicating
read operation. In this way, a sampling clock having pulsewidth
of
is generated. Like SMDL, one of MDL_SUM_T
or MDL_SUM_C is precharged to
after from the
moment when MDL_SUM_T and MDL_SUM_C become
complementary. The timing between RS_KCORE and reset
signals is important because they are pulsed signals. For
RS_KCORE to sample the data safely, data in MDL2 needs
a setup margin. The timing difference between MDL2 and
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
CHO et al.: A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM 1945
Fig. 3. Schematic of SMDL scheme.
Fig. 4. Schematic of sampling clock (RS_KCORE) generator.
RS_KCORE plays the role of a setup time. To obey the setup
time, data must arrive at MDL2 earlier than at RS_KCORE. In
addition, the pulsewidth of RS_KCORE must be guaranteed
to have the time for sampling. If the reset signal (/reset_T,
/reset_C) is enabled while RS_KCORE is still sampling
MDL2, the pulsewidth of RS_KCORE is reduced. The reduced
pulsewidth of RS_KCORE can cause sampling to fail due to
insufficient sampling time. To prevent this,
must be larger
than
to guarantee the pulsewidth of RS_KCORE,
. The delay from the sense amplifier output to
MDL_SUM_T and MDL_SUM_C is the same as that from
MDL1 to MDL_SUM in SMDL because the same circuits and
the same layout architectures are used. Only the control circuit
delay from four-input
NAND gate to RS_KCORE is affected by
PVT variations. The timing variances of the sampling clock are
minimized because the delay from the four-input
NAND gate to
RS_KCORE is relatively small compared with that of the sense
amplifier output to MDL_SUM_T and MDL_SUM_C. In this
sampling clock generator, the timing of the sampling clock is
made highly correlated with that of the data. As a result, the
timing of RS_KCORE becomes robust over PVT variations.
Fig. 5 shows the timing of SMDL.
IV. C
LOCK ADJUSTMENT CIRCUIT (CAC)
A. Timing Diagram of DDR3
The DDR3 SRAM supports two user-selectable modes:
clock-centered (CC) mode and clock-aligned (CA) mode.
In CC mode, all input signals are clock centered. Therefore,
internal clocks synchronized with the rising and falling edges
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
1946 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 38, NO. 11, NOVEMBER 2003
Fig. 5. Timing of SMDL scheme.
Fig. 6. Input timing diagram of DDR3SRAM and internal clocks in CA mode.
of the external clock are needed to sample input signals. In CA
mode, address and control signals are clock centered, but input
data is clock aligned. Therefore, clocks phase shifted by 90
and 270 are needed to sample input data with the optimized
setup/hold window. Fig. 6 shows the timing diagram of the
DDR3 (CA mode) input signals. Clocks for address, clock1,
and clock2 are generated from clock adjustment circuits by
shifting the phase of the input clock. The clock for address is
synchronized with the rising edge of the external clock. Clock1
and clock2 are generated from the rising and falling edges of the
external clock, shifted in phase by 90
. Two clock adjustment
circuits are used to process the rising and the falling edges.
B. Clock Adjustment Circuit (CAC)
Fig. 7 showsthe principle ofthe multiphase shiftscheme. The
input clock goes through twopaths:the fast path and thedelayed
path, which consists of a delay block (Td) and a forward delay
chain (FWD). A phase comparator compares the phases of the
two paths and decides the length of FWD where the phase dif-
ference becomes 360
. If the delay Tm is added in both paths,
the delay of FWD causing a phase difference of 360
becomes
Tclk
Td Tm. The delayed clock goes through backward
delay chain (BWD), which has the same delay as FWD. The
Fig. 7. Principle of multiphase shift.
Fig. 8. Block diagram of the proposed clock adjustment circuit.
total delay is Td Tclk Td Tm Td Tclk Tm.
This means that the output clock phase shifted by
Tm Tclk
is generated after two clock cycles. In [8], Tm is zero. There-
fore, only a clock synchronized with an external clock could be
made, but, in the proposed CAC, Tm is not zero and the PVT
variationsof Tm are automatically compensated. By controlling
the amount of the delay Tm, clocks phase shifted by anydegrees
can be generated.
Fig. 8 shows the block diagram of the proposed CAC.
The CAC is composed of clock receivers (CLK_RCV), de-
tection circuit mirror delay (DT_Mirror), JTAG controller
(JTAG_CONT), clock driver (CLK_DRV), delay chain for
0
, and delay chain for 90 . The input clock goes through the
shorter path to make the clockfor address and control sampling.
The total delay of the clock for address and control is
Tclk Td Td Td Td Td
Tclk Td Td Td Td Td Td Td
That is, the internal clock synchronized with the external clock
is generated in two clock cycles, as described in [8]. The input
clock goes through the proposed longer path to make the clock
for data. The total delay of the clock for data is
Tclk
Tclk
Td Td Td Td Td
Tclk Td Td Td Td Td Td
Td Tclk Tclk Td4 Td Td
which means that internal clock phase shifted by 90 is gen-
erated after four clock cycles. Tm in Fig. 6 is 1/8 Tclk in the
proposed clock adjustment circuits. DT_Mirror is added in the
delayed path to compensate for the delay of the phase compara-
tors in the delay chains. CLK_RCV in the delayed path is to
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
CHO et al.: A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM 1947
Fig. 9. Block diagram of the delay chain for 90 phase shift.
compensate for the delay of CLK_RCV receiving the external
clock.
Fig. 9 shows how the proposed delay chain for 90
is con-
structed. As mentioned before, a clock phase shifted by 90
is
generated by adding delay units
in both the fast path and the
delayed path. The phase comparators
compare the phase of
clocks in the fast path and in the delayed path, and select aclock
path where the phase difference is 360
. But the delay of delay
unit changes over PVT variations. Therefore, the clock path dif-
fers even at a fixed clock frequency. To eliminate this effect
of PVT variations, the delay units for additional delay are uni-
formly distributed over the delay chains and the CAC controls
automatically the number of delay units to maintain the amount
of additional delay equal to 90
. In the following, we show how
the delay units for 90
phase shift are distributed. If is the
number of delay units for 0
phase shift, the total forward delay
can be obtained by (1). If
is the number of additional delay
units for the phase shift of Tclk
, the total forward delay be-
comes (2). Let us assume that one delay unit is added at every
units in both paths. Then the total additional delay can be ob-
tained as in (3) by dividing the total forward delay by
. This is
equal to Tclk
and . Finally, (4) can be obtained by solving
(1), (2), and (3). With
and , one can generate
a clock phase shifted by 90
. This is equivalent of adding one
delay unit at every nine delay units for 90
phase shift. Further-
more,
and are independent from the number of delay units
for 0
and 90 phase shift. Therefore, PVT variations changing
the
and do not affect the phase of the output clock because
the change in
and are compensated by the change in unit
delay, resulting in a phase shift of 90
.
Td
Tclk (1)
Td
Tclk
Tclk
(2)
Td
Tclk
(3)
Td Tclk
Tclk Tclk
for phase shift
(4)
The jitter of the clock shifted by 0
is 13 ps and that of the
clock shifted by 90
is 40 ps. In addition, the amount of phase
Fig. 10. Test waveforms of the proposed clock adjustment circuit.
shift can be controlled by JTAG_CONT to trim the setup/hold
time and the data valid window. The covered range of the phase
shift by JTAG_CONT is
20 with 7 steps. Fig. 10 shows the
testresult of theproposedCAC.Twoclocksphaseshiftedby90
from the rising and falling edge of external clock are measured.
V. O
N-CHIP TERMINATION
A. Termination Scheme
Termination has been used in high-data-rate systems to
prevent unwanted reflections and improve signal integrity
[9]. Off-chip termination has been widely used, but there
is a difference between on-chip termination and off-chip
termination. Off-chip termination has an unterminated stub
composed of package parasitic and internal circuitry. This
unterminated stub causes relatively large reflections compared
with on-chip termination due to the impedance mismatch.
Therefore, on-chip termination is developed to remove the
effect of the unterminated stub and improve signal integrity.
An on-chip input termination of the center-tapped-termination
(CTT) type is designed by using CMOS transistors. Fig. 11(a)
shows the input termination scheme of data pad. The off-chip
driver (OCD) is activated during the read operation and the ter-
minator is activated during the write operation. The impedance
of the OCD and the terminator is controlled by digital codes
generated by two PICs with reference resistors RQ and RT [10].
In 72-Mb DDR3 SRAM, RQ is 125
and RT is 150 . This
means that the output impedance of the off-chip drivers is 25
,
which is equal to the board characteristic impedance, and the
input impedance of terminators in the data pads is 75
. Ter-
mination impedance is not matched to the board characteristic
impedance to reduce the dc current dissipated in the terminator.
To reduce input capacitance, 1/5 of the OCD transistors are
used as terminators during nonread operation. Fig. 11(b) and
(c) shows the simplified schematic of the terminator for the
data pad and the address/control/clock pad. The terminator
for the data pads consists of transistor arrays of nMOS and
diode-connected nMOS pairs for pulldown, and pMOS and
diode-connected pMOS pairs for pullup. The terminator for
address/control/clock pads is composed of transmission gate
arrays of nMOS and pMOS pairs.
-bit impedance codes for
pullup and pulldown are generated from the PIC independently.
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
1948 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 38, NO. 11, NOVEMBER 2003
(a)
(b)
(c)
Fig. 11. (a) Input termination scheme of data pad. (b) Simplified schematic of
terminator for data pad. (c) Simplified schematic of terminator for address pad.
Therefore, the pullup impedance code (Pn) and pulldown
impedance code (Nn) are not always complementary.
B. Programmable Impedance Controller
The PIC generates digital impedance codes which make
the output impedance of driver and the input impedance of
the terminator close to RQ and RT within PIC resolution.
Because of the different characteristics of nMOS and pMOS
after fabrication, the PIC generates impedance codes for pullup
and pulldown independently [10]. In the previously published
approaches, the pulldown code is generated from the pullup
code which has a quantization error [10]. As a result, the
accuracy of the pulldown code is dependent upon the pullup
code. To eliminate the dependency of the pulldown code on
the pullup code, a new impedance code generation scheme
is proposed. Fig. 12 shows the block diagram of the PIC. By
feedback operation of AMP1 and a pMOS (M0), VZQ becomes
VREF (
VDDQ ). The reference current of VREF/RQ flows
through M0 and RQ. M3 copies the current in M0 to make
pulldown impedance code and M1, M2, M4, and AMP2 copy
the current in M0 to make pullup impedance code. Therefore,
pulldown impedance code is independent from the pullup code.
Control blocks control the size of the detector (NDET, PDET)
to make the detector output (DCUR, UCUR) equal to VREF.
When DCUR and UCUR become VREF, the bias condition
of NDET and PDET is equal to that of RQ and M0, which
means that impedance of NDET and PDET is RQ. But, due
to the digital control of NDET and PDET, DCUR and UCUR
Fig. 12. Block diagram of the PIC.
have quantization error. Quantization error can be reduced by
increasing the number of control bits and the resolution. The
quantization error of the designed PIC is within 2% with five
bits when RQ is 125
. This means that an error of 0.5 exists
in the off-chip driver when the target impedance is 25
.
C. Linearity of Terminator
The input impedance of terminator should be linear to main-
tain equal channel environment for changing pad voltage and to
improve the signal integrity and the input data valid windows in
a system. Fig. 13(a) shows the linearity of an ideal terminator.
In the ideal case, the relationship between pad voltage and input
current is perfectly linear. But due to the nonlinear characteris-
tics in transistors, there exists linearity error. Fig. 13(b) shows
the linearity error of the terminator. The impedance of termi-
nator is evaluated either by forcing a voltage to a pad and mea-
suring the current flowing into a pad, or by measuring pullup
impedance and pulldown impedance, respectively. Measuring
pullup impedance and pulldownimpedance, respectively, is just
executed in test mode. Fig. 13(b) shows the result of forcing a
voltageandmeasuringthecurrent. The total linearityerrorofthe
terminator is
4.1% over PVT variations. The linearity error is
measured between 0.3 and 1.2 V.
D. Eye Diagram
Fig. 14 shows the eye diagram of the input data at a data
rate of 1.5 Gb/s and a power supply of 1.5 V. In the case of
no termination, the signal swing is larger than that in on-chip
termination. But the noise and the reflections increase jitter and
reduce the input data valid window. In the case of on-chip ter-
mination, the signal swing is reduced because of the dc current
pathintermination.Butdue to the reduced noise and reflections,
the wider input data valid window is obtained. The data input
valid window with on-chip termination is 480 ps at 750 mV
200 mV when the terminator impedance and the board char-
acteristic impedance is 75 and 25
, respectively. Included in
theresults are 10% PVT variations,10% termination impedance
variations, and system models.
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
CHO et al.: A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM 1949
(a)
(b)
Fig. 13. (a) Relationship between pad voltage and input current in ideal termination. (b) Linearity error of the designed terminator over PVT variations.
Fig. 14. Eye diagram of on-chip termination versus off-chip termination.
VI. HARDWARE RESULTS
Fig. 15 shows the hardware results at a data rate of 1.5 Gb/s
and 750-MHz core frequencies. K is the external clock of
750 MHz. Data (DQ0, DQ1) of 1.5 Gb/s is well aligned with
the echo clock (CQ, CQb). DQ latency is 2.1 ns from the time
that an address is captured. Fig. 16 shows the chip micrograph
of the 72-Mb SRAM.
VII. C
ONCLUSION
A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM with the highest den-
sity and the smallest cell size to date in 6T SRAM is devel-
oped. The SMDL scheme is adopted to reduce main data line
precharging power dissipation and the number of main data
Fig. 15. Hardware results.
Fig. 16. Chip micrograph of 72-Mb SRAM.
lines by half. A multiphase clock adjustment circuit is imple-
mented to generate a clock phase shifted by 90
. With this clock
adjustment circuit, input data can be sampled with optimized
setup/hold window. Finally, on-chip termination controlled by
PIC is developed to improve signal integrity and to remove the
effect of the unterminated stub.
The SRAM is fabricated in a 0.10-
m CMOS process tech-
nology with five metals. The standby current in the memory cell
and the powerdissipation in the critical read/write path are min-
imized with the regulated voltage of 1.2 V. Off-current of the
72-Mb DDR3 SRAM measured is 80 mA. Gate oxide thicker
than that of the memory cell is used to support the 1.5-V HSTL
interface. Average core power dissipation with 50% read oper-
ation and 50% write operation is 1.2 W including off-current at
750 MHz and 1.5 V. Cell size and chip size are 0.845
m and
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
1950 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 38, NO. 11, NOVEMBER 2003
TABLE I
F
EATURES OF 72-Mb DDR3 SRAM
151.1 mm respectively. Table I summarizes the features of the
72-Mb DDR3 SRAM.
R
EFERENCES
[1] S. Huang et al., “High performance 50 nm CMOS devices for micropro-
cessor and embedded processor core application,” in IEDM Tech. Dig.,
2001, pp. 237–240.
[2] S. Parihar et al., “A high density 0.10
m CMOS technology using low
K dielectric and copper interconnect,” in IEDM Tech. Dig., 2001, pp.
249–252.
[3] S. B. Kim et al., “A 1.29
m
full CMOS ultra-low power SRAM cell
with 0.12
m spacer-on-stopper (SOS) CMOS technology,” in IEDM
Tech. Dig., 2001, pp. 253–256.
[4] K. Tomita et al., “Sub-
m high density embeddedSRAM technologies
for100 nm generation SOC and beyond,”inSymp.VLSITech.Dig.Tech.
Papers, 2002, pp. 14–15.
[5] T. Chappell et al., “A 2-ns cycle, 3.8-ns access 512-kb CMOS ECL
SRAM with a fullypipelined architecture,” IEEE J. Solid-StateCircuits,
vol. 26, pp. 1577–1584, Nov. 1991.
[6] H. Pilo et al., “An 833 MHz 1.5 W 18 Mb CMOS SRAM with
1.67Gb/s/pin,” in IEEE Int. Solid-State Circuits Conf. Dig. Tech.
Papers, Feb. 2000, pp. 266–267.
[7] H.-C. Park et al., “A 833 Mb/s 2.5V 4 Mb double data rate SRAM,”
in IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, vol. 464, Feb.
1998, pp. 356–357.
[8] T. Saeki, “A 2.5 ns clock access, 250 MHz 256 Mb SDRAM with
synchronous mirror delay,” IEEE J. Solid-State Circuits, vol. 31, pp.
1656–1668, Nov. 1996.
[9] T. J. Gabara and D. W. Thompson, “A 200 MHz 100K ECL output
buffer for CMOS ASICs,” in Proc. IEEE ASIC Seminar and Exhibit,
Sept. 1990, pp. P8/5.1–P8/5.4.
[10] T. J. Gabara et al., “Digitally adjustable resistors in CMOS for high-
performance applications,” IEEE J. Solid-State Circuits, vol. 27, pp.
1176–1185, Aug. 1992.
Uk-Rae Cho was born in Sang-Ju, Korea, in 1962.
He received the B.S. degree in electronicengineering
fromKyung-pookNationalUniversity,Taegu,Korea,
in 1985.
In 1984, he joined the Device Solution Network
Division, Samsung Electronics Company, Yong-in,
Korea, where he is currently a Project Leader of the
SRAM Development Team. He holds eight interna-
tional patents with 18 patents pending. His research
interests include core circuits of ultrahigh-speed
SRAM, high-bandwidth interface design, design for
test, device modeling, and analysis of BGA package substrate.
Tae-Hyoung Kim was born in Cheongju, Korea,
in 1973. He received the B.S. and M.S. degrees in
electrical engineering from Korea University, Seoul,
Korea, in 1999 and 2001, respectively.
In 2001, he joined the Device Solution Network
Division, Samsung Electronics Company, Yong-in,
Korea. Since 2001, he has been working on the
design of high-speed SRAM memories. His research
interests are analog circuits and high-speed I/O
interface.
Yong-Jin Yoon was born in Seoul, Korea, in 1964.
He received the B.S. and M.S. degrees in electrical
engineering from Seoul National University in
1987 and 1989, respectively. Since 1998, he has
been working towards the Ph.D. degree at the same
university.
In 1989, he joined the Device Solution Network
Division, Samsung Electronics Company, Yong-in,
Korea, where he is currently a Member of Technical
Staff of the SRAM Development Team. His current
research is in gh-speed synchronous SRAM.
Jong-Cheol Lee received the B.S. and M.S. degrees
in electrical engineering from Yonsei University,
Seoul, Korea, in 1996 and 1998, respectively.
He joined Samsung Electronics Company,
Yong-in, Korea, in 1998 and has been working on
design and test of high-speed SRAM. His work
experience includes design and analysis of chip
critical timing path including decoder and sense
amplifier, power analysis with voltage regulators,
and floorplanning of high-speed SRAM. He is
currently involved in chip verification for Samsung
DDR3 SRAMs.
Dae-Gi Bae was born on June 1, 1970, in Young-Ju,
Korea. He received the B.S. degree in electric engi-
neering from In-ha University, Korea, in 1996.
He joined the Memory Division, Samsung Elec-
tronics Corporation, Kiheung, Korea, in 1996, where
he was involved in the circuit design of SRAM.
From 1996 to the present, he has been working on
the circuit design of high-speed synchronous SRAM
memories.
Nam-Seog Kim was born in Seoul, Korea, in 1974.
He received the B.S. degree in electrical engineering
from Korea University, Seoul, Korea, in 1997 and the
M.S. degree in electrical engineering from Seoul Na-
tional University in 1999.
Since 1999, he has been a Member of Tech-
nical Staff with Samsung Electronics, Kiheung,
Korea, where he is working on SRAM design
and high-speed I/O. His research interests include
low-power and high-performance circuits, clock
recovery circuits, and high-speed link design.
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
CHO et al.: A 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM 1951
Kang-Young Kim was born on June 6, 1970, in
Kang-wondo, Korea. He graduated from Suwon
Science College, Korea, in 1994.
He joined Samsung Electronics Corporation, Ki-
heung, Korea, in 1988, where he was involved in the
circuit design of high-density
NAND flash memories
and MROMs. He has been working on the circuit de-
sign of high-speed SRAMs.
Young-Jae Son was born on November 17, 1971, in
Busan, Korea. He received the B.E. degree in elec-
tronics engineering fromHanyangUniversity, Korea,
in 1994.
He joined the Memory Division, Samsung
Electronics, Kiheung, Korea, in 1998, where he
was involved in the circuit design of asynchronous
fast SRAMs. From 1999 to the present, he has been
working on the circuit design of ultrahigh-speed
SRAMs.
Jeong-Suk Yang was born in Taejon, Korea, on
March 27, 1976. She received the B.S. and M.S.
degrees in electronic engineering from Chungnam
University, Taejon, in 1999 and 2001, respectively.
She joined Samsung Electronics Company,
Kiheung, Korea, in 2001, where she has been
working on the circuit design of high-speed SRAM.
Currently, she is involved in developing the next
generation of high-speed SRAM.
Kwon-Il Sohn joined the Memory Division, Samsung Electronics Corporation,
Kiheung, Korea, in 1999. From 1999 to the present, he has been working on the
circuit design of ultrahigh-speed SRAMs.
Sung-Tae Kim was born on July 21, 1974, in
Kwang-Ju, Korea. He received the B.S. degree in
electronics engineering from A-Ju University, Korea,
in 2001.
He joined the Memory Division, Samsung Elec-
tronics Corporation, Kiheung, Korea, in 2001, where
he has been working on the circuit design of high-
speed SRAM.
In-Yeol Lee was born on May 12, 1971, in
Kyung-Nam, Korea. He received the B.E. degree
in electronics engineering from Kyungbuk National
University, Korea, in 1998.
He joined Samsung Electronics Corporation, Ki-
heung, Korea,in1998,where he has been workingon
thecircuit designof asynchronousfastSRAMs,high-
speed DDR SRAM, and ternary content-addressable
memory.
Kwang-Jin Lee was born on May 1, 1970, in
Chonnam, Korea. He received the B.S. and M.S.
degrees in electronics engineering from Korea Uni-
versity, Seoul, Korea, in 1994 and 1996, respectively.
He is currently working toward the Ph.D. degree in
electronics engineering at the same university.
He has been with Samsung Electronics, Kiheung,
Korea, since 1996. His research interests are in high-
speed memory design, special memory design such
as CAM, and arm-based MCU design.
Tae-GyoungKangwas born on October30, 1967, in
Jeju, Korea.
He joined Samsung Electronics Corporation,
Kiheung, Korea, in 1991, where he has been working
on the layout of BiCMOS,
NAND flash, 1T SRAM
(DRAM cell SRAM interface), and high-speed
SRAM.
Su-Chul Kim was born on Feb 23, 1970, in Pusan,
Korea. He received the B.S. and M.S. degrees in
electronic engineering from Korea University, Seoul,
Korea, in 1995 and 2002, respectively.
In 1995, he joined the Samsung Electronics
Company, Kiheung, Korea. Since then, he has been
working on the design of 4M DDR and 8M SP
and 32M DDR3 and high-speed SRAM memories.
Currently, he is involved in developing 32M DDR
SRAM memory.
Kee-Sik Ahn was born on September 26, 1964, in
Seoul, Korea. He received the B.S. degree in elec-
trical engineering from Seoul National University,
Seoul, Korea, in 1987.
He joined Samsung Electronics Corporation,
Kiheung, Korea, in 1987, where he was involved in
the modeling of BiCMOS device. From 1992 to the
present, he has been working on the circuit design of
high-speed synchronous SRAMs and asynchronous
fast SRAMs.
Hyun-Geun Byun was born on October 17, 1957,
in Kyungbook, Korea. He received the B.S degree
in electronic engineering from Kyungbook National
University, Taegu, Korea, in 1983.
He joined the Memory Development Division,
Samsung Eelctronics, Korea, in 1983, where he was
engaged in the development of low-power SRAM
and high-speed SRAM.
Authorized licensed use limited to: Nanyang Technological University. Downloaded on May 19,2010 at 08:11:51 UTC from IEEE Xplore. Restrictions apply.
... The BS needs to provide the computation result of the l-th service if and only if |K l | ≥ 1, ∀s l ∈ S. We adopt a commonly used computation model [13], in which the total number of 1 The delay of proactive offloading and executing the input data is sufficiently small compared with the deadline, and therefore the computation results can be reused for a long period of time in the future. 2 We assume a type of on-chip caching facilities that incurs negligible accessing delay, e.g., SRAM, with reading/writing speed of 1.5Gb/s and S around 72Mbits [25]. CPU cycles required for performing one computation task is linearly proportioned to its task input bit length. ...
Preprint
With the growing demand for latency-critical and computation-intensive Internet of Things (IoT) services, mobile edge computing (MEC) has emerged as a promising technique to reinforce the computation capability of the resource-constrained mobile devices. To exploit the cloud-like functions at the network edge, service caching has been implemented to (partially) reuse the computation tasks, thus effectively reducing the delay incurred by data retransmissions and/or the computation burden due to repeated execution of the same task. In a multiuser cache-assisted MEC system, designs for service caching depend on users' preference for different types of services, which is at times highly correlated to the locations where the requests are made. In this paper, we exploit users' location-dependent service preference profiles to formulate a cache placement optimization problem in a multiuser MEC system. Specifically, we consider multiple representative locations, where users at the same location share the same preference profile for a given set of services. In a frequency-division multiple access (FDMA) setup, we jointly optimize the binary cache placement, edge computation resources and bandwidth allocation to minimize the expected weighted-sum energy of the edge server and the users with respect to the users' preference profile, subject to the bandwidth and the computation limitations, and the latency constraints. To effectively solve the mixed-integer non-convex problem, we propose a deep learning based offline cache placement scheme using a novel stochastic quantization based discrete-action generation method. In special cases, we also attain suboptimal caching decisions with low complexity leveraging the structure of the optimal solution. The simulations verify the performance of the proposed scheme and the effectiveness of service caching in general.
... A programmable impedance controller (PIC) creates the impedance codes, having as reference the external resistors R T and R Q for ODT and OCD, respectively. In [11] a 1.2-V 1.5-Gb/s 72-Mb DDR3 SRAM is described, which satisfies the need for high data rate and area density required in recent ultra fast systems together with the corresponding on-chip termination. In [12] the implementation of a third-generation 1.1-GHz 64-bit microprocessor is presented whereas it provides some details on the DDR1 SSTL I/O circuit solutions at the memory interface. ...
Article
Full-text available
A 1 GHz Double Data Rate 2/3 (DRR2/3) combo Stub Series Terminated Logic (SSTL) driver has been developed for the first time to our knowledge using a 90 nm CMOS process. To satisfy the signal integrity requirements the driver strength is dynamically calibrated and the input/output port is efficiently terminated by on-die resistors. Furthermore, the slew-rate can be sufficiently controlled by selecting an appropriate external resistor. The proposed driver design provides all the required output and termination impedances specified by both the DDR2 and DDR3 standards and occupies a small die area of 0.032 mm2 (differential). Experimental results demonstrate its robustness over process, voltage, and temperature variations.
Article
A Clock-shared differential signaling(CSDS) transmitter is fabricated in 0.13 μm CMOS for 120 Hz 10-bit Full HD TVs. The proposed Tx driver takes advantages of PVT-insensitive tunable termination resistance with double feedback loops, and small reference voltage fluctuation. Moreover, a fully-digital duty cycle corrector is proposed, and compared to non-clock spreading, the relative near-field EMI level of multi-phase clock spreading is enhanced by 4.4 dB at the operating frequency of 500 MHz. The CSDS Tx with 34 channels consumes 300 mW at a 2.5 V power supply and 1.0 Gb/s/ch.
Article
The effective design of semiconductor memory pertaining to the power consumption, speed and area penalty has always been the crucial task in embedded computing applications. The work presented in this paper is exact and innovative mathematical model based implementation of 32 kb SRAM optimized for power and speed. The model has been developed for a cell, array, and pre-charge, I/Os and periphery devices for their exact behavior and then effective design is obtained by running the model through computing engine. The supply and pre-charge to an array of SRAM are swept and optimized combination is found out for minimum power dissipation and highest achievable access time. The SRAM array rows are controlled by the Gating Transistor Power Saving Technique (GTPST). Redundant columns have been found to make the memory fault tolerant. Similarly the the bitline passive leakage sensing and compensation scheme also has been presented. The experimental result shows 0.25 ¿W dissipation at VDD of 620 mV and pre-charge of 300 mV. The minimum attainable bit line swing is 200 ¿V/ns at VDD of 620 mV and precharge of 500 mV, both of which are state-of-art of its kind. The power saving of 13% is reported. The design by mathematical model, schematic and layout of 32 Kb memory chip and simulation are carried out for development of codebook memory that finds application in embedded signal processing.
Conference Paper
A 72Mb 6T SRAM is designed with 2times144 separate-I/O and random R/W in parallel per cycle running at 875MHz DDR to achieve 504Gb/s bandwidth. It is fabricated in a 90nm CMOS process. Dual R/W self-timed clocks with core emulators are multiplexed to operate the SRAM core at 875MHz. On-chip DLL, programmable I/O skews, and programmable input termination and output driver impedance with precise linearity are essential for this 504Gb/s interface
Conference Paper
A versatile I/O buffer is proposed to interface DDR/DDR2/GDDR3 memory types. A new robust impedance calibration scheme which fills the role of off-chip driver (OCD) and on-die terminator (ODT) for improving signal integrity is introduced. The proposed calibration scheme minimizes quantization error and maintains 30~300Omega impedance within 3% variations
Conference Paper
A novel differential pulse-width control loop circuit based on high speed frequency-to-voltage converters is proposed. To demonstrate its functionality, a circuit has been designed and simulated in 0.18mm CMOS technology. Results show that the proposed circuit can correct a clock signal's duty cycle even for frequencies as high as 5 GHz. This design can be used to correct clock signal distortion due to process variations in high speed applications such as half-rate clock and data recovery systems.
Conference Paper
Effective design of cache SRAM has always been the challenging task in embedded systems dedicated to image processing applications such as vector quantizer (VQ). The low power high speed SRAM array is the need of VQ. The mathematical model and simulation results for low power, high speed, fault tolerant codebook SRAM is presented in this paper. The cell, precharge, transmission logic, sense amplifier, redundant bits and IOs are modeled and SPICE simulated. Since the codebook has rhythmic nature, the successive multiple read cycles are important than write. The implementation is done at 0.25 mum technology. The results show that the least precharge is at 300 mV. The array operates minimum at 600 mV. The dissipation of 256 b array is 1.8 mW at read speed of 5 Gbits/sec at precharge of 1.25 V and supply of 2.5 V.
Conference Paper
Full-text available
In this work components of the next generation 0.10 μm CMOS technology are presented. They form the core of a platform encompassing logic, non volatile memory, and analog blocks. High performance bulk devices use 18 Å gate oxide (24 Å inversion Tox) while low power devices use 25 Å gate oxide (31 Å inversion Tox) for reduced gate leakage. Gate lengths range from 65 nm for the high performance devices to 90 nm for the low power devices. Both 3.3 V and 2.5 V I/Os are supported using 70 Å and 50 Å oxide devices. The backend employs low-k (k~3) dielectric with multiple levels of Cu metallization. The high density 6T SRAM cell size is 1.33 μm<sup>2 </sup>
Conference Paper
We have integrated a high speed and high density 6T-SRAM cell (0.998 μm<sup>2</sup>) for system-on-a-chip (SOC) using enhanced 100 nm CMOS logic technology. This is achieved by a systematic integration methodology, which includes high-NA ArF lithography, optimized optical proximity correction (OPC) CAD, narrow well isolation, poly-buffered shallow trench isolation (STI), offset spacer transistor, and 9-level Cu interconnect and low-k dielectric technologies with the lithographically scalable SRAM cell design.
Conference Paper
50 nm CMOS transistors for high performance and low active power applications are presented. Good short-channel effect control is achieved down to 35 nm gate length. These transistors will be incorporated in a leading edge 100 nm technology, with optimized triple well, nitrided oxide gate dielectrics, 193-nm lithography, 9-level hierarchical Cu interconnects, and low-k dielectrics. These high performance transistors have the best current drive at a given leakage current reported in the literature
Conference Paper
We have developed a 1.29 um2 full CMOS SRAM cell for low power applications, which is the world-smallest one by using 0.12 um single gate CMOS technology and optical enhancement techniques for extending use of 248 nm KrF lithography. It includes (1) 0.28 um pitch contacts formed by aerial image controlled patterns on phase shift mask (PSM) and photo resist flow, (2) gate patterns with 0.24 um pitch, (3) 0.13 um buried channel pMOS, and (4) spacer-on-stopper (SOS) MOSFET structure for expanding contact area and reducing band-to-band tunneling leakage
Conference Paper
The authors present an 18 Mb CMOS SRAM which operates at 833 MHz with 1.67 Gb/s/pin. The 114.4 mm<sup>2</sup> die consumes 1.5 W and is fabricated in a 0.18 μm CMOS process with four levels of copper interconnect. The SRAM operates in two user-selectable double-data-rate modes (DDR and DDR2). High-frequency operation is achieved by solving three frequency-limiting issues identified in previous SRAM designs: managing data timing constraints associated with high-frequency operation in a high density SRAM core; maintaining coherency between SRAM output data timings and echo clock timings; and delivering symmetric data windows for 1 s and 0 s across a wide range of output driver supply levels
Conference Paper
A double-data-rate (DDR) SRAM overcomes the limitation of a single-data-rate (SDR) SRAM. The main features are an auto-tracking bitline scheme to reduce core cycle time, a shortened main data line for current reduction, a noise immune circuit having high-speed transfer characteristics through a dual-rail reset dynamic circuit, a two bit pre-fetched operation, and strobe clocks synchronized with the output data to guarantee CPU data-validation time
Conference Paper
A 245.7 mm<sup>2</sup> 256 Mb SDRAM uses: (1) 60.2% cell-occupancy ratio array, (2) prefetched pipeline using first-in first-out buffer with parallel/serial converter, (3) synchronous mirror delay circuit
Conference Paper
The operation and design of 200-MHz 100 K ECL output buffers for CMOS ASICs are described. It is shown how the components of the buffer output driver transistor, gate voltage generator, and low skew input drivers are combined into unique clock and data output buffers. A section on unity gain op-amp design describes how a number of these buffers are used on an ASIC. Application guidelines (curves) to illustrate the tradeoff between the buffer frequency and the number of buffers on an ASIC application are presented. The advantages that this input buffer provides in the area of low ground bounce generation is presented. Waveforms from an ASIC with 24 balanced and 16 single ended ECL output buffers are presented
Article
A 256-Mb SDRAM (245.7 mm<sup>2</sup>) has been developed using (1) a high cell occupation ratio (60.2%) array design for chip size reduction and a high yield, (2) a prefetched pipeline scheme (PPS) using a first-in first-out (FIFO) buffer with parallel serial converter for 250-MHz clock frequency operation, and (3) a synchronous mirror delay (SMD) circuit for 2.5-ns clock access and low standby current
Article
Methods by which CMOS circuits can be adjusted digitally to generate controlled impedances for use in high-performance circuits are described. Since digital signals are the only inputs to these circuits, on-chip DC power dissipation can be reduced, the circuit can be made more robust, and the impedance of the circuit can be adjusted by manipulating the input digital information. A design of a CMOS series terminated line driver is discussed, and the utilization of the controlled impedance in terminating transmission lines on-chip, constant delay lines, and controlled di / dt output buffers is discussed