Figure 4 - uploaded by Richard Berger
Content may be subject to copyright.
The layout of a SRAM unit cell 

The layout of a SRAM unit cell 

Source publication
Conference Paper
Full-text available
This paper presents the design of a 128 times 12 SRAM block with each SRAM unit containing 2 pFET access transistors and a precharge-to-ground system for the purpose of minimizing its layout feature size. By using these techniques, the feature size of the SRAM cell with guard rings was effectively reduced to 12.6 mum times 17.9 mum, incorporating I...

Context in source publication

Context 1
... paper presents the design of a 128×12 SRAM block with each SRAM unit containing 2 pFET access transistors and a pre- charge-to-ground system for the purpose of minimizing its layout feature size. By using these techniques, the feature size of the SRAM cell with guard rings was effectively reduced to 12.6μm × 17.9μm, incorporating IBM SiGe5AM 0.5 μm process in order to make the SRAM suitable for extremely low temperature environment. To improve the reliability, several radiation hardening techniques have also been applied: guard rings were designed to separate pFET and nFET regions of the SRAM cell to prevent potential latch-up, and an error-correcting code (ECC) was also included to correct soft errors caused by ionizing radiation. Cryogenic testing has been performed to verify the SRAM is able to operate correctly over a temperature range of 2K to 297K. Memory is a necessary part of any microprocessor or microcontroller system. Static Random-access memory (SRAM) is often embedded in these systems as the main memory because of its short access time and high reliability. The proposed SRAM design is for extreme environment applications such as robotic and human explorations on the Moon and Mars. The specification requires the designed SRAM to be capable of operating reliably down to -180°C during the lunar night and -230°C in the shadowed polar craters in NASA’s planned lunar missions. In such a situation, the reliability of electronics in the spacecraft becomes one of the most important concerns. It has been well known that cryogenic operation of silicon CMOS technology can significantly improve the device performance [2]. Previous research on successful cryogenic circuit operation as well as radiation behavior has shown that SiGe BiCMOS processes are suitable for such extreme environments [3]. The IBM SiGe5AM process was selected, which has a 0.5 μm gate length and four layers of metal, including a thick AM layer [4]. Because of the limited number of metal layers, routing signals over the SRAM array is not allowed. Hot carrier injection at low temperatures is addressed by increasing the gate length of nFET devices [5-6]. The designed 128×12 SRAM block consists of a 15-bit shift register, a word line decoder, a bit line access X12 block, a 32×48 SRAM matrix, and an ECC unit as shown in Figure 2. Due to the limited space available on the chip for pin pads, a shift register was used to reduce pin count. The shift register outputs Y0~Y14 are used to supply the address and write data. The word line decoder is a 5-to-32 tree decoder using modified transmission gates to activate one row at a time. Transmission gate multiplexers are used to select the correct bit column. Each set of four bit columns are connected to an 8-to-2 multiplexer. The single output is connected to a pre-charge, write driver, and sense amplifier block to perform read and write operations to the addressed bit. The basic function of the bit line access blocks is to provide a digital interface between the SRAM array and the input/output processing. Hamming error correction codes are used to provide single bit error correction on data read from the SRAM array. During a write operation, the address and the write data are in the shift register. The write data, Y7~Y14, is fed into the ECC block, where four check bits are generated. After a period of time, the output of the ECC block will become valid, and the data word is ready to be written into the SRAM array. Proper operation of the ECC can be verified by evaluating DOUT0~11. The data address will be decoded, and the addressed data word can be activated by the row and column enable signals. Read operations follow a similar path, but only the address is used from the shift register. After the address is decoded, the sense amplifiers are enabled, and the data is read from the bit lines. The resulting data word is then fed into the ECC block, and check bits are generated and compared with the check bits present in the read data word. The differences are then compared, and the resulting syndrome is used to correct an error if one is present. If an error is detected, the corrected data can be written back into the original memory location directly by latching the corrected data and enabling the write drivers. The writing and reading operations will be further described in Section 3. The SRAM architecture used is a variation of the standard 6T cell [9]. Each SRAM bit consists of two cross-coupled inverters as a positive feedback loop, and two transistors to selectively access the two inverters. Due to the requirement of all nFET devices to use a gate length twice as wide as the process minimum, the drive strength of a nFET is only 1.5 times stronger than the drive strength of a pFET with the same width. This reduced the effective area penalty for using pFET devices for the access and drive transistors to only 12%. An additional benefit of using pFET devices as the main component in the SRAM cell is higher radiation tolerance. Off-state leakage currents increase on nFET devices after radiation bombardment, but pFET devices are largely unaffected. Excessive leakage in access transistors is highly undesirable and may cause data corruption. Usage of pFET access and devices should improve the total dose survivability of the SRAM. Transistor W/L ratios were designed with a signal to noise margin of 0.35V. Figure 3 indicates the final transistor sizing for the SRAM cell. The layout of the SRAM unit is shown in Figure 4. The bit cell is a mostly symmetrical design to keep the bit line true (BLT) and the bit line complimentary (BLC) parasitic effects as closely matched as possible. Power and ground wires are on the perimeter of the SRAM cell, which allows easy tiling of the cells as well as additional isolation between cells. V DD and ground wires are present on every metal layer to provide low resistance V DD and ground connections throughout the SRAM array. Cell size is reduced as much as possible, with a resulting size of 12.6μm×17.9μm. Guard-rings are added around the pFET and nFET regions to increase latch-up immunity from switching and ion impact transient conditions. The layout of a 128×12 SRAM block has a footprint of about 690×770 μm 2 . The bit line access block provides an I/O interface for the SRAM array. It consists of 12 bit line access units. During a write operation, the input data, WRD0-11, will be powered by the write driver and then loaded onto each bit line through a 4-to-1 multiplexer. While reading, the SRAM bit releases a data bit onto bit lines through the 4-to-1 MUX X2. Since there is a relatively large parasitic capacitance on bit lines, this output data can only make very small voltage difference on the bit lines. A sense amplifier is used to amplify this voltage difference to a full logic. This full logic output will then be sent out as output signal and written back into SRAM cell to restore the original data. The pre- charging network is used here to prevent accidentally writing during performance of reading operation. More design detail for each block will be given in the following subsections. When PRE-EN goes high prior to a read or write operation, all three transistors conduct and both BLT and BLC are pulled down to ground (logic 0). nFET N3 is used here to equalize the voltage on the two bit lines, which ensures proper operation of the sense amplifier. The equalization process is also improved. The sense amplifier provides a high impedance amplification path from the bit lines to a logic output. After the bit lines are pre- charged to 0V, the activated row will begin to pull one bit line towards V DD . Since a relatively large parasitic capacitance exists on the long bit lines, the voltage will change slowly. The sense amplifier generates a valid output without requiring the bit line to fully charge to V DD , which saves power and increases the read performance. There is no feedback signal from the output of the amplifier to the bit lines, so data corruption should not be an issue. The RD- EN signal switches the sense amplifier between an active sensing mode and a reset state. When RD-EN is high, the amplifier is deactivated, and the output nodes are discharged and equalized. This removes the bias towards the previous reading. Like the SRAM cell, the layout of the sense amplifier is as symmetrical as possible. When WR-EN goes high, the output of two NAND gates will be determined by the data to be written, WRD n. When WRD n is logic 1, pFET P1 will be open and connect BLT port to V DD . When WRD n equals 0, BLC will be connected to V DD . Special care was taken to ensure that the drive strength was high enough to write to the SRAM bits while traveling through the column multiplexer. ECC is used to provide a higher immunity against single event upsets (SEUs) and manufacturing defects. The ECC used is a (12, 8) Hamming code with single error correction (SEC). The ECC block diagram is given in Figure 10. A primary component is the Hamming code check generator (ECC_CHK_GEN). The check code generator produces the outputs required while minimizing the gate count and delay. When performing write operations, the signal RD/WR will be set to logic 0, and the ECC data path will be configured to generate the required check bits and place the check bits with the original data on the output, DOUT. The check code generator produces the following outputs. ...

Similar publications

Conference Paper
Full-text available
Ferroelectric-based nonvolatile memory (FeRAM) is now a mainstream product category supported by Ramtron, Fujitsu, Texas Instruments, IBM, and Matsushita. The internal array of ferroelectric capacitors in FeRAMs consumes absolutely no energy when the chip is powered. This lack of energy consumption combined with SRAM-like operation at SRAM speeds c...

Citations

... When evaluating the cooling cost and bit cost scaling in relation to workload duty, they found that SRAM, with its relatively extended standby time and limited workload, is ideally suited for cryogenic operation due to its associated lower cooling costs [74]. In 2008, Barlow et al. [79] presented the design of an SRAM block intended for extreme environmental applications, such as robotic and human explorations on the Moon and Mars. This SRAM block was designed to operate reliably under extreme conditions, including temperatures as low as −180°C during lunar nights and −230°C in the shadowed polar craters, aligning with NASA's planned lunar missions. ...
Article
Full-text available
Recently, there has been a surge of research in the field of superconducting spintronics, which combines superconductivity and magnetism. This emerging field is considered an alternative or complementary approach to traditional complementary metal-oxide semiconductor (CMOS) technology, offering high efficiency and effectiveness. Furthermore, the unique physical phenomena resulting from the interplay of these two competing properties have attracted increasing attention for their potential application in low-power quantum computing. In this review, we focus on the latest advancements in spin-related superconducting logic devices, specifically categorized as superconducting diodes based on their semiconductor counterparts. Additionally, given the ultralow operating temperatures required for these devices, we provide a comprehensive overview of compatible cryogenic memory technologies that incorporate spin-related principles. Finally, we address the key challenges currently hindering the practical implementation of spin-related superconducting electronics and offer insights and directions for future research in this field.
... In addition to deep space electronics applications, cryogenic embedded memories can be used in high-performance computing and fuel-cell electric vehicles, and are a viable choice to support superconducting operation in quantum computing [1]- [4]. Despite some previous explorations focusing on cryogenic memories [1], [2], [4]- [6], there is still the need for a cost-efficient and high-density memory design capable of operating within a cryogenic environment. ...
Preprint
Full-text available
p>This work explores non-volatile (NV) embedded memories implemented by spin-transfer torque magnetic random access memories (STT-MRAMs). Our designs are based on state-of-the-art perpendicular magnetic tunnel junctions (MTJs) along with a commercial 65 nm planar CMOS Bulk technology node, both operating at the liquid nitrogen temperature, 77 K. We evaluate the impact of cooling down to 77 K of the STT-MRAMs based on single- and double-barrier MTJ (SMTJ and DMTJ), and DMTJ with the relaxed non-volatility. All NV designs were benchmarked against the six-transistor SRAM (6T-SRAM) baseline. Simulation analysis relies on a 512 kB cache memory operating at 77 K. Overall, results show that the implementation of STT-MRAMs with DMTJ devices, and in particular when using the non-volatility approach by reducing the cross-section area, excel in terms of energy consumption, leading to energy savings for write/read access of about 35%/54%. This saving is obtained while also dissipating less leakage power and requiring a smaller bitcell footprint. Moreover, it presents reduced write latency overhead (as much as 1.9 × lower), at the expense of increased read latency and reduced sensing margins of about 1.8 × and 88%, respectively. The results suggest that STT-MRAM technology can be a solid alternative for energy-efficient cryogenic memory applications. </p
... In addition to deep space electronics applications, cryogenic embedded memories can be used in high-performance computing and fuel-cell electric vehicles, and are a viable choice to support superconducting operation in quantum computing [1]- [4]. Despite some previous explorations focusing on cryogenic memories [1], [2], [4]- [6], there is still the need for a cost-efficient and high-density memory design capable of operating within a cryogenic environment. ...
Preprint
Full-text available
p>This work explores non-volatile (NV) embedded memories implemented by spin-transfer torque magnetic random access memories (STT-MRAMs). Our designs are based on state-of-the-art perpendicular magnetic tunnel junctions (MTJs) along with a commercial 65 nm planar CMOS Bulk technology node, both operating at the liquid nitrogen temperature, 77 K. We evaluate the impact of cooling down to 77 K of the STT-MRAMs based on single- and double-barrier MTJ (SMTJ and DMTJ), and DMTJ with the relaxed non-volatility. All NV designs were benchmarked against the six-transistor SRAM (6T-SRAM) baseline. Simulation analysis relies on a 512 kB cache memory operating at 77 K. Overall, results show that the implementation of STT-MRAMs with DMTJ devices, and in particular when using the non-volatility approach by reducing the cross-section area, excel in terms of energy consumption, leading to energy savings for write/read access of about 35%/54%. This saving is obtained while also dissipating less leakage power and requiring a smaller bitcell footprint. Moreover, it presents reduced write latency overhead (as much as 1.9 × lower), at the expense of increased read latency and reduced sensing margins of about 1.8 × and 88%, respectively. The results suggest that STT-MRAM technology can be a solid alternative for energy-efficient cryogenic memory applications. </p
... MOS technology operating at cryogenic temperatures provides some benefits, such as steeper subthreshold slope, increased carrier mobility, and increased saturation velocity, leading to semiconductor-based circuits with faster operation, reduced leakage, and improved energy-efficiency [4,5]. As shown in Figure 1, MOS technologies operating at cryogenic temperatures are interesting for a wide spectrum of applications including high-performance computing [6,7], control systems for quantum processors [8,9], and aerospace applications [5,10,11]. The need for electronic devices capable of operating at cryogenic temperatures has always been a sought-after feature in deep space applications; however, high-performance computing and especially quantum computing are now increasing the demand for processors and memories that can operate at very low temperatures. ...
... The benefits of cooling down processors and memory systems to cryogenic temperatures as low as 77 K have recently been demonstrated [2,13,14]. The studies reported in [11,13,15] mainly focus on traditional embedded memories based on six-transistor static random access memory (6T-SRAM), which are shown to provide significant improvements in terms of performance. However, the relatively large bitcell area of 6T-SRAM limits the overall on-chip memory density and the many leakage paths present in these memories limit the achievable power savings [16]. ...
Article
Full-text available
The ever-growing interest in cryogenic applications has prompted the investigation for energy-efficient and high-density memory technologies that are able to operate efficiently at extremely low temperatures. This work analyzes three appealing embedded memory technologies under cooling—from room temperature (300K) down to cryogenic levels (77K). As the temperature goes down to 77K, six-transistor static random-access memory (6T-SRAM) presents slight improvements for static noise margin (SNM) during hold and read operations, while suffering from lower (-16%) write SNM. Gain-cell embedded DRAM (GC-eDRAM) shows significant benefits under these conditions, with read voltage margins and data retention time improved by about 2x and 900x, respectively. Non-volatile spin-transfer torque magnetic random access memory (STT-MRAM) based on single- or double-barrier magnetic tunnel junctions (MTJs) exhibit higher read voltage sensing margins (36% and 48%, respectively), at the cost of longer write access time (1.45x and 2.1x, respectively). The above characteristics make the considered memory technologies to be attractive candidates not only for high-performance computing, but also enable the possibility to bridge the gap from room-temperature to the realm of cryogenic applications that operate down to liquid helium temperatures and below.
... Proposed cell design uses PMOS access transistor to earn the benefit of its higher tolerance towards radiation. Leakage current remains unaffected from radiation bombardment in PMOS transistor, but leakage current in NMOS transistor gets increased (Barlow et al. 2008). In addition, PMOS transistor exhibits very low flicker (1/f) noise compared to NMOS transistor. ...
Article
Full-text available
This article presents a low power and highly stable source feedback SE7T (single-ended 7T) SRAM cell. Using Monte-Carlo simulations critical design metrics of proposed SE7T SRAM cell are estimated and the estimated results are compared with that of conventional 6T SRAM cell and conventional 7T SRAM cell (CONV7T). The proposed source feedback single Ended (SE7T) SRAM cell achieves 8.6 ×/12.5 × and 1.2 ×/5.3 × lower write power and hold power as compared to CONV6T/CONV7T respectively. The proposed bitcell takes 1.3 × longer but 1.3 × less Read Access Time (TRA) as compared to CONV6T and CONV7T at 200 mV respectively. The proposed bitcell also provides 1.67 × and 1.07 × higher read stability and write ability as compared to 6T SRAM Cell.
... The proposed cell contains majority of PMOS transistors. There are several advantages of PMOS over NMOS: In PMOS, leakage current does not get affected from radiation bombardment, whereas it increases in NMOS [13]. PMOS offers lower flickering noise (1/f) than NMOS. ...
Article
Full-text available
SRAM design metrics such as read static noise margin (RSNM) and hold power dissipation need to be improved in future technology generation, since excessive scaling leads to degradation of these metrics. In this paper, a fully differential read decoupled 7T SRAM cell is proposed which shows better metrics. The proposed cell shows 1.84×/15.44× shorter read delay/write delay and 4.11×/1.44× improvement in RSNM/WSNM (read static noise margin/write static noise margin) @ 800 mV supply voltage.
... Proposed cell design uses majority of PMOS transistors to earn the benefit of its higher tolerance towards radiation. Leakage current remains unaffected from radiation bombardment in PMOS transistor, but leakage current in NMOS transistor gets increased [12]. In addition, PMOS transistor exhibits very low flicker (1/f) noise compared to NMOS transistor. ...
Conference Paper
Full-text available
This work presents a new single-port 6T SRAM cell with a single-ended read operation (read-decoupled) and shows significant improvement in read stability and write ability as compared to the conventional 6T (CON6T) SRAM cell. Unlike the CON6T cell where the pull-up transistors are powered by the supply voltage (VDD), the proposed cell has these transistors powered by the respective bitlines. Another distinct feature of the proposed design is the presence of a PMOS transistor between the two storage nodes which is used during the write operation. The design metrics of proposed highly-stable read-decoupled 6T-SRAM cell are compared with those of the CON6T SRAM cell. The proposed cell shows 4× improvement in read static noise margin (RSNM) and 1.08× improvement in write static noise margin (WSNM) @ 700 mV.
... Proposed cell design uses majority of PMOS transistors to earn the benefit of its higher tolerance towards radiation. Leakage current remains unaffected from radiation bombardment in PMOS transistor, but leakage current in NMOS transistor gets increased [12]. In addition, PMOS transistor exhibits very low flicker (1/f) noise compared to NMOS transistor. ...
Conference Paper
Full-text available
This work presents a new single-port 6T SRAM cell with a single-ended read operation (read-decoupled) and shows significant improvement in read stability and write-ability as compared to the conventional 6T (CON6T) SRAM cell. Unlike the CON6T cell where the pull-up transistors are powered by the supply voltage (VDD), the proposed cell has powered these transistors by the bitline driver. Another distinct feature of the proposed design is the presence of a PMOS transistor between the two storage nodes which is used during the write operation. The design metrics of proposed stable read-decoupled 6T SRAM cell are compared with those of the CON6T SRAM cell. The proposed cell shows 4× improvement in read static noise margin (RSNM) and 8% improvement in write static noise margin (WSNM) @ 700 mV
... One of the extra transistors for read operation, RT (read transistor) and the tail transistor, called write assist transistor, are made common for entire row (Fig.3). Read buffer is made up of one NMOSFET (RT) and two PMOSFETs (MP3/MP4) (PMOSFETs are used instead of NMOSFETs because PMOSFET shows higher tolerance There is no effect of radiation bombardment on leakage current in case of PMOS, whereas in NMOS, the leakage current increases [16]). ...
Article
Full-text available
Scaling of SRAM cell beyond 65-nm poses a serious threat to the stability of the cell and is a cause of major concern for the upcoming technologies. Due to random dopant fluctuation (RDF) and other process parameter variations, the cell turns out to be unstable. In this paper, an 8T (8- Transistor) SRAM cell is proposed which offers enhanced data stability during read operation. While reading, the voltage level of the ‘0’ holding node does not increase and thus a near ideal butterfly-curve is achieved, which is crucial to design a robust SRAM cell. In 16-nm technology node, the read static noise margin or read SNM (RSNM) as high as 159 mV at supply voltage (VDD) of 600 mV is achieved by the proposed cell. Therefore, the cell is 4.18× more stable than the standard 6T SRAM cell during read operation and has 1.87× shorter delay than the standard 6T SRAM cell at the same voltage. Also the proposed cell offers 4.15 × improvements in write delay. All these are achieved at a marginal penalty of write static noise margin (WSNM) 1.07× compared to standard 6T.
... PMOS transistors are used for read operation because PMOS transistor has an advantage of higher tolerance against radiation. In PMOS, leakage current does not get affected from radiation bombardment whereas it increases in NMOS [11]. Moreover, PMOS offers lower flicker (1/f) noise than NMOS transistor. ...
Conference Paper
Full-text available
Due to aggressive scaling of device dimensions, the stability of an SRAM cell is one of the major concern for upcoming technology. The cell turns out to be unstable due to random dopant fluctuation (RDF) and other process parameter variations. In this paper, an 8T (8-Transistor) SRAM cell is proposed. The proposed cell offers enhanced data stability during read operation. While reading, the proposed cell do not increases the voltage level of the node holding “0” and thus it achieves a near ideal butterfly-curve which is crucial to design a robust SRAM cell. The read static noise margin or read SNM (RSNM) of the proposed cell is as high as 159 mV @ 600 mV in 16-nm technology node and it is 4.18× more stable than standard 6T SRAM cell during read operation. The proposed cell also achieves 1.87× shorter delay at the same voltage compared to standard 6T SRAM cell.