SRAM Memory Testing Basics
SRAM Memory Testing Basics
Introduction to Memories
The most basic classification of a memory device is whether it is Volatile or Non-Volatile
(NVMs). These terms refer to whether or not a memory device loses its contents when power
is removed. Volatile memories lose their contents when power is removed. The physical design
of the storage circuit, called a cell, determines whether a device is volatile or non-volatile.
Volatile memories can be further divided into two groups called static and dynamic. Static
memories tend to retain their contents for as long as power is applied to the device. While
dynamic memories require periodic reminding, known as refreshing, to retain their data.
Dynamic RAMs require a refresh cycle every few milliseconds or the data stored in them is
lost.
In this book the SRAM is used as the reference device for the discussions describing the basic
functional blocks of a memory device, the various faults related to that function, and various
tests to detect each type of fault. Later in this book other memory device types are discussed by
describing the differences between the SRAM and that device.
In the following sections the block diagram of a SRAM device is built one major functional
block at a time. This will help you to understand the function of the array, the word line, the bit
line, the X & Y addresses lines, and all the other components of a memory device.
Note: The semantics in the memory world are confusing because a bit is used to represent
several things. There are address bits, data bits, and control bits to name a few. Each
has a unique purpose and is described later in this book.
Static RAM
D0
CS2
D1
D2 CS1
D3
Data In WE
OE
A0 O0
Row Decoder
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
Column Decoder
A4
A5
A6
A7
If you have taken the logic course from Soft Test, you might recognize or remember this
device. It is the same one used as an example for the logic course. In that class the device
functionality is discussed in general terms, but we will go into more detail. There, the DUT
was tested using vectors and here, we will use an APG (Algorithmic Pattern Generator).
Every memory device consists of addresses, data in, data out, and some type of control logic.
The 256x4 static RAM used in this example is made up of 8 address lines, 4 X and 4 Y
(arbitrarily assigned to row and column), Data In, and Data Out.
It has Control pins consisting of Write Enable bar (take it low to write the device), Output
Enable bar (take it low to read the device), and two Chip Selects; one high true logic; one low
true logic.
D0
CS2
D1
D2
D3
CS1 3
Data In Cont. WE
OE
A0 O0
Row Decoder
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
A Cell
Column Decoder
A4
A5
A6
A7
The first thing to understand with any memory device is what type of cell is used. In this case,
it is a Static-Ram cell. Different cells have radically different characteristics regarding power,
speed, and functionality.
D0
CS2
D1
D2 CS1
D3
Data In Cont. WE
OE
A0 O0
Row Decode
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
The basic cell is repeated over and over again to create an array. Before we move on to the
array itself, lets continue to focus on the individual cells.
VDD
Q3 Q4
Word Line
Q5
Q1 Q2
VSS
The SRAM memory cell is actually a flip-flop that uses dual bit lines for Q and Q/. The bit
lines actually carry the data that was stored in the cell. In the case of an SRAM, there is a bit
line not, that carries data bar. The word lines are activated by the address applied to the device.
After the bit line exits the array, it gets manipulated by the address decoder to choose a specific
bit to show as data out from the device. Note, that the Word Line actually turns on both bit
lines in the cell.
The six transistor design (See Figure 2-4) of a static RAM has a much larger die size for a
given memory array and requires more power than that of dynamic memory.
Fault Types
Stuck at Fault.
Coupling Fault.
Neighborhood Pattern Sensitive Fault.
Address Fault.
Transition Fault.
Lets pause in development of our memory block diagram and consider faults. Weve chosen
some specific faults so the class will be able to see what kinds of problems can be associated
with each different piece of the device block diagram.
We are going to cover each of these in detail, but lets look at this from a high level, first. What
is a fault? It is the same thing as on any other semiconductor device. A fabrication process flaw
that causes the device to function improperly. It could be a shorted metal or poly line or a
missing contact.
Since memory devices are really large arrays of repeated cells, the types of faults normally
found are predictable. Specific types of faults on memory devices have been given generic
names. A few of these are listed below:
Stuck at Fault a cell is stuck at a zero or one level (think of a short in the transistor) 3
Coupling Fault connects two or more cells together.
Neighborhood Pattern Sensitive Fault (NPSF) cell is affected by what happens around it
usually during writing (a capacitive coupling fault).
Address fault connects two or more addresses together.
Transition Fault when a cell or line fails to undergo a 0->1 or a 1->0 transition.
Stuck at Fault
Although, a stuck at fault is typically caused by one or more shorted transistors within the
cell, it can be caused by other things. Regardless of the cause, the detection process is the
same.
If a cell was shorted to a zero level and we wrote (then read) all zeroes into the array, the
device would pass. So to detect this fault we have to test every cell for the ability to store both
a one and a zero.
The main focus in a memory test program is the memory array. Arrays are a large collection of
circuits (cells) such as the one in Figure 2-4 which in turn defines the size, or storage capacity,
of the memory device. When making a graphical representation of an array most people think
of it as two dimensional with each box representing a single memory cell.
The Array
D0
CS2
D1
D2 CS1
D3
Data In Cont. WE
OE
A0 O0
Row Decode
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
Column Decode
A4
A5
A6
A7
Array related faults are indicated by interaction between two or more cells. As an example, if
two cells are shorted together, it is easy to imagine that writing to either cell would cause both
cells to store the same written information.
Typically there are only a few places that one cell could short to another cell. By talking to the
Design Engineer, it is possible to identify those places. Thus, in turn, the scope of the testing
can be reduced to only look for those specific shorts that can occur. It doesnt make sense to
spend test time looking for things that can not occur.
D0
CS2
D1
D2 CS1
D3
WE
3
Data In
OE
A0 O0
Row Decoder
Sense Amps
256 Cells
256 Cells
256 Cells
256 Cells
A1 O1
A2 O2
A3 O3
Column Decoder
A4
A5
A6
A7
Figure 2-6 is an example of a Static RAM that has four data lines going in (D0-D3) and four
data lines going out (O0-O3) with each data line going to its own array. This allows the four
independent bits of data to be written to (or retrieved from) the same logical location within
their respective arrays.
The device in Figure 2-6 has eight address lines (A0-A7) that produce 256 (28th) unique
addresses. Because the device has four arrays it can store 1024 (1K) single bits of data.
Because each address applied to the outside of the DUT is sent to all four arrays, the four
output pins produce separate data at each address location.
The data arrays share address, power, sense amps, control logic, and all other blocks on the
device, but they are really tested in parallel. We are effectively testing four memory 256 cell
arrays in parallel.
D0
CS2
D1
D2 CS1
D3
Data In Cont. WE
OE
A0 O0
Row Decode
Sense Amps
256 Cells
256 Cells
256 Cells
256 Cells
A1 O1
A2 O2
A3 O3
Column Decode
A4
A5
A6
A7
The Data In and Data Out lines for this device are separate. However, most devices use a
common data bus that is used to write and read data on the same DUT pins. This same pattern
of the input data bit Dn corresponding to the output data bit On continues through all the data
bits.
Lets focus on a single output and examine how its array is organized.
Figure 2-8 is a graphical representation of a memory array having sixteen cells with four rows,
four columns, and one data bit. The number of rows and columns within the array defines how
many row address lines (X address) are needed and how many column address (Y address)
lines are needed.
3
While keeping the example device in mind, this table type drawing can make things easier to
communicate. We can see details in 16 cells that get lost in a drawing of 256 cells.
Array Faults
Coupling Fault
0 0 0
0 1 0
0 0 0
If a Stuck at Fault is categorized as a short in one or more transistors that make up a cell, we
can categorize a Coupling Fault as a short between two or more cells. The only way to prove
that cells are not shorted together is to store and retrieve different data from adjacent cells.
The pattern shown in Figure 2-9 is theoretically the only way to test for coupling faults. What
if we worked with the design group and found that the device could not have a coupling fault
on the diagonal? How could our above pattern be changed to make the pattern more efficient?
Think checkerboard.
Shorts are only one type of problem caused by process errors. Process errors can also create
capacitors between cells. If the a capacitor is large enough and testing is done very quickly
(with a short cycle time), this type of failure can go undetected.
A basic premise for a memory device is that every cell is completely independent. Whatever is
written to one cell should not affect any other cell in the array. Due to the test time required to
check for this fault on every cell (and the decreasing likelihood that the fault exists) this is
seldom tested during production. The algorithm technique used to test for an NPSF is called
surround disturb.
There is a pattern shown later in this text called a modified Butterfly, that tests for this fault.
Addressing
D0
CS2
D1
D2 CS1
D3
Data In Cont. WE
OE
A0 O0
Row Decoder
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
Column Decoder
A4
A5
A6
A7
Memory addresses are comprised of two parts; Row and Column. Although the Row address,
also known as the X address, implies a horizontal direction. There is no standardization that
forces us to relate Row, X, and Word lines. Different companies use different semantics and
nomenclature. In this manual X is equivalent to Row and they run horizontally through the
array. 3
The next several slides describe the addressing scheme for memory devices in detail.
A3 A2 A1 A0
The array shown in Figure 2-11 can be depicted by two X address lines (bits) and two Y
address lines (bits). If we combine all four lines or bits into a single register, as shown at the
top, we can use one binary four bit number to designate a specific address. We can use other
numerical bases, besides binary. Hexadecimal is often used as a shorter, more efficient
expression of binary numbers.
0 0 0 0 = 0 = Amin
A3 A2 A1 A0
Address zero, which can also be referred to as address minimum, could be designated as the
highlighted cell as shown in Figure 2-12.
0 0 0 1 =1
A3 A2 A1 A0
As we toggle the least significant bit of the address register from zero to one we would step to
the next address in the binary sequence. As represented in Figure 2-13 this could be seen as
incrementing to the next column. Toggling the A2 from a zero to a one can be seen as
incrementing to the next row as seen in Figure 2-14.
0 1 0 1 =5
A3 A2 A1 A0
We could assign each bit in the address register a name. These bit names would match the pin
names given on most memory devices, A0-A3 in the example in Figure 2-14.
0 1 0 1 =5
A3 A2 A1 A0
Y bits X bits 3
We will also need to be able to refer to the same bits in the address registers by the names used
inside the tester. In order to accomplish this we need to define the direction of X and Y.
Although, the direction is really an arbitrary decision, the number of bits grouped into X and
the number of bits grouped into Y, is not. They must match the number of cells corresponding
to the rows and columns.
1 1 1 1 = 15 = Amax = XYmax
A3 A2 A1 A0
Y bits X bits
Y1 Y0 X1 X0
If we assume X to be in the horizontal direction, A0 and X0 would refer to the same bit in the
address register. Y1 and A3 would refer to the most significant bit in the register.
Figure 2-16 shows that with all address bits high the maximum address is accessed. This
address is also known as XY Maximum.
X
00 01 10 11
Y 00
01
10
11
For documentation purposes, in this book we will use the style shown in Figure 2-17. When
speaking about a specific address, we will refer to it as if it is a single register. We will usually
refer to it in hex.
Addresses are really two separate registers. One for the X addresses and one for the Y. They
can be tied together so they look like a single register when referenced from inside our test
program.
Note that either the X address or the Y address can be the LSBs (least significant bits) of the
complete address. If the X address is the LSB then it is called an X fast pattern because the
LSBs change faster than the MSBs. If the Y address is the LSB then it is called a Y fast
pattern. We can develop our pattern in either or both of these techniques.
Referencing our example device (Figure 2-17), we can see that there are specific address pins
connected to row decoders and column decoders inside the device. It is possible to connect X
address lines of a memory tester to either the row or column decoder pins with the Y address
lines connected to the remaining decoder. As long as we know which address line goes to
which decoder (column or row) on the memory device we will be able to trace a specific
address back to a physical location within the memory device.
D0
CS2
D1
D2 CS1
D3
WE
3
Data In Control
OE
A0 O0
Row Decoder
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
Column Decoder
A4
A5
A6
A7
Inside the array we dont really refer to X or Y, or even to row or column. We refer to word
lines and bit lines.
As soon as a choice like X or Y connections becomes arbitrary we open the door for confusion.
Row and column definitions add to that confusion because their meaning is not clear.
One decoder (row or column) will select an entire group of cells while the other will select
which cell in the group is selected when reading or writing.
Address Faults
With an understanding of what addresses are and how they work, we can start to understand
how they can fail. The key is to test for address uniqueness. Each address must access one, and
only one, cell within an array.
If we write the array to all ones as we have mentioned earlier, how many cells must function in
order for us to be able to read back all ones? Thats right, one single cell. All addresses could
be shorted together and we would still pass the all ones and also the all zeroes patterns.
Output Buffers
D0
CS2
D1
D2 CS1
D3
Data In Control WE
OE
A0 O0
Row Decode
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
Column Decode
A4
A5
A6
A7
Cells are really transistors (in the case of a DRAM they are a single transistor) and silicon real
estate is the most expensive in the world, we try and shrink transistors as much as possible. It is
easy to understand that the cells have very little drive capability. Therefore we need sense
amps to be able to recognize the very small voltage swings and translate them into levels that
can be distinguished as logical ones and zeroes.
Once the sense amps have amplified the voltages, the output buffers can then provide current
drive capability sufficient to use the device in a bus oriented environment.
Vdd
3
Bit Data out
Vss
Figure 2-20 illustrates an output buffer with a totem pole design, named for the way the two
transistors are stacked on top of one another. It is the most common output structure for MOS
devices.
The waveform that data out produces is the same as that on the bit line, but with added drive
capability supplied by the two transistors shown. The output current capacity (IOH and IOL) is
determined by size of these two transistors.
In most cases we have the ability to turn off both transistors using the Output Enable (OE) or
the Write Enable (WE) pin. This feature is not shown in Figure 2-20.
Other types of output buffers include open drain and open collector. Both of these require an
external circuit (usually just a resistor) to provide a pull up, or pull down, voltage.
Transition Faults
This type fault might have exactly the same symptoms as a stuck at fault, but the actual flaw in
processing is in a completely different physical location on the die.
An example would be when a Bit Line is charged to a high voltage in order to write data into
an EPROM or FLASH cell. If the bit line was accessed to read/write the opposite data before
the discharge was complete the wrong data would be presented on the output of the device.
But if discharge time had expired the Bit Line would work fine.
Once again in English. If a cell is stuck, or coupled, the erroneous state does not change based
on time, but based on some other action the user takes (writing a different cell or trying to write
different data). With a transition fault, the fault can only be seen based on specific, time
related, conditions. The cell fails until after a certain amount of time has expired.
Control Logic
D0
CS2
D1
D2 CS1
D3
Data In Cont. WE
OE
A0 O0
Row Decode
Sense Amps
A1 O1
256x4 Array O2
A2
A3 O3
Column Decode
A4
A5
A6
A7
If we over simplify the array as a group of cells that hold data until we need it again, we can
use addresses as a way to point to a specific cell, but we still need a way to decide whether we
are putting data into a given cell, or reading data from a given cell.
Obviously, the Write Enable (WE) pin allows us to write into the location that is applied to the
address pins when the WE pin is true (low in this case).
The Output Enable (OE) turns on the output buffers and presents data to the outside world.
The Chip Selects (also called Chip Enable on some devices) are really on/off switches. They
allow us to select or deselect (turn on or turn off) this specific device in its entirety. This is
useful to conserve power when we know the device wont be used for a long time (e.g. several
seconds). Deselecting the device will not destroy the data that has been stored in the array (as
would happen if power were removed) but it will turn off the circuitry used for everything else
(control pins, decoders, etc.) and thus reduce the power consumption of the device while in
stand-by mode.
Logic Tables
Most device data sheets show a truth table of the control pins and most devices use the same
control pin organization.
In Figure 2-22 it is important to remember that although the conditions marked with an X
represent a dont care state for that pin, inputs to a CMOS device can never be placed in a
high impedance state. If an input pin is allowed to float, the pin will most likely go to the
threshold voltage of that input and oscillate just enough to cause the pin to switch at maximum
frequency, finally resulting in latch up. Latch up can damage or destroy the device.
Write Timing
Timing
T0 Reference T0
ADDR NRZ
DATA IN SBC
CS1 RO
CS2 RZ
WE RO
(Logic One)
OE RO
2ns 11ns 2ns
As we transition from the device block diagram to the patterns, lets consider what the control
truth table in Figure 2-22 would look like on a scope for a write (and then read) cycle.
In Figure 2-23 the Address and Data In signals are drawn in such a manner as to indicate they
are bus signals representing a group of pins.
The mnemonics on the right hand side of the diagram in Figure 2-23 indicate which format
could be used to produce that waveform. These mnemonics have not been defined in this class
since they are discussed in detail in the digital logic class.
The times listed across the bottom of the diagram are examples of what we might see on our
example device.
Notice that the Data Out pins are not indicated during a write cycle. Typically they would
present a high impedance (float) condition. It is acceptable for the device to present a high
impedance to the outside world (on an output pin from the DUT) but it is unacceptable to
present a high impedance to any input pin of a CMOS device.
Read Timing
T0 T0
TRC 3
Address
TAA
TACS THZCS
TLZCS
TDOE
OE
TLZOE THZCS
Unlike the timing shown in Figure 2-23, Figure 2-24 has device specifications (i.e. Trc is read
cycle time) listed on it, just as you would see in the data book.
The write signal isnt included in the drawing since it is expected to be at a logic one level
(false) during any read operation.
Waveforms are a good communication medium between engineers. When combined with good
pattern documentation, timing drawings can describe exactly what is happening in a test.