Module 4 VLSI
Module 4 VLSI
1 Introduction
In earlier discussions, we introduced CMOS logic under the assumption that MOS transistors
behave as ideal switches.
However, real MOS transistors exhibit certain limitations that deviate from this idealized
behavior.
So far, we have primarily focused on fully complementary CMOS logic structures and the
ratioed CMOS inverter.
1. Circuit (Structural) Design – This involves selecting the appropriate logic configuration
and transistor arrangements.
2. Layout (Physical) Design – This deals with the physical placement and interconnections
of transistors on a chip.
The behavior of circuits at a lower level can influence high-level architectural decisions.
In such cases, alternative CMOS logic structures can be employed to design smaller and faster
gates.
However, these improvements often come at the cost of increased design complexity, greater
operational challenges, and potential reductions in circuit stability.
Area Constraints: Fully complementary CMOS gates can occupy a large silicon area, which is
undesirable in high-density designs.
Speed Limitations: The propagation delay of traditional CMOS logic may be too high for
certain high-performance applications.
Design Feasibility: Some logic functions, such as those required in large PLAs, may not be
efficiently implemented using complementary CMOS alone.
To address these issues, alternative logic styles are explored, each with its own trade-offs in terms of
power consumption, speed, and complexity.
Optimization: Although basic functionality is achieved with uniform transistor sizing, speed
optimization techniques involving different transistor sizes will be introduced later.
Complex Logic Implementation: More complex gates can also be implemented using CMOS
logic.
Z = (A · B) + C · (D + E)
Body Effect: Affects the threshold voltage of transistors, influencing circuit performance.
Leakage Currents: Improper substrate connections may lead to increased leakage or latch-up
issues.
Power Integrity: Proper substrate biasing ensures reliable operation across varying process
conditions.
This understanding of complementary CMOS logic forms the foundation for further discussions on
optimizing logic circuits for speed and efficiency.
In this configuration, a single pMOS transistor acts as the load device, with its gate
permanently connected to ground (VSS).
This design is similar to conventional nMOS logic, where the depletion or enhancement-mode
nMOS load transistor is replaced with a pMOS device.
One of the main drawbacks of pseudo-nMOS logic, as with conventional nMOS logic, is static
power dissipation. Since the pMOS load is always on, current continuously flows whenever
the pull-down network is active.
In a fully complementary CMOS gate, the capacitive load on each input is at least two unit gate
loads. In pseudo-nMOS logic, the minimum input load can be just one unit gate load since only
a single transistor is used per input term.
However, if minimum-sized driver transistors are used, the pull-up gain must be reduced to
maintain adequate noise margins, which in turn slows the gate’s rise time.
One potential advantage of using a pMOS load is the absence of the body effect, which affects
nMOS depletion loads.
A pseudo-nMOS gate can offer a higher circuit density compared to fully complementary
CMOS gates due to the reduced transistor count.
In summary, pseudo-nMOS logic provides a method for implementing logic gates with fewer
transistors, but at the cost of static power dissipation and slower rise times in some cases.
Unlike static CMOS logic, which maintains a stable output using both pull-up and pull-down
networks, dynamic CMOS logic relies on clocked operation and charge storage principles.
It consists of:
An nMOS evaluation transistor, which selectively discharges the output during evaluation.
A clock signal (ϕ), controlling the switching between precharge and evaluation phases.
The dynamic CMOS gate operates in two main phases, controlled by the clock signal ϕ:
1. Precharge Phase (ϕ = 0)
The pMOS precharge transistor turns ON, charging the output node to VDD.
The nMOS evaluation transistor is OFF, preventing any discharge.
The output is temporarily set to HIGH regardless of input values.
2. Evaluation Phase (ϕ = 1)
The pMOS precharge transistor turns OFF.
The nMOS evaluation transistor turns ON, allowing the nMOS logic block to evaluate the
function.
If the input logic network forms a conducting path to ground, the output discharges to LOW.
If no conducting path exists, the output remains at its precharged HIGH state.
This approach reduces the number of transistors needed, since a pull-up network is not required.
Phase ϕ1 (Precharge of PZ): During this phase, node PZ is precharged to VDD, while node Z
retains its previous value.
Phase ϕ2 (Precharge of Z): The precharge of node PZ is maintained, and a transmission gate
turns on, allowing node Z to also precharge.
Phase ϕ3 (Evaluation): The gate evaluates, and if the pull-down network is activated, node PZ
discharges conditionally.
Phase ϕ4 (Hold Phase): Node Z is held in its evaluated state, ensuring that it does not change
prematurely.
Advantages
Reduces charge-sharing issues by ensuring proper storage of values before evaluation.
Enables cascading of multiple logic stages without the risk of premature discharge.
Ensures that evaluated values remain stable during computation.
Limitations
Requires four separate clock signals, increasing circuit complexity.
Additional clocking transistors increase power consumption.
Strict timing requirements must be met for reliable operation.
5.3.2 Two-Phase Dynamic Logic
An alternative to four-phase logic is a two-phase approach, which simplifies clocking at the cost of
slightly reduced reliability.
Phase ϕB: Other gates evaluate, while the outputs of the previous phase are held constant.
Advantages
Reduces the number of required clock signals from four to two.
Simpler circuit design and reduced power consumption compared to four-phase logic.
Limitations
More susceptible to charge leakage compared to four-phase logic.
Requires careful synchronization to prevent timing mismatches.
5.7 Conclusion
Dynamic CMOS logic is a powerful alternative to static CMOS for high-speed, low-power digital
circuits. However, its reliance on clock-driven operation introduces challenges such as charge
leakage and complex timing synchronization. Proper clocking techniques, such as two-phase and
four-phase logic, are necessary for reliable operation.
†
6 Clocked CMOS Logic (C MOS)
Clocked CMOS logic, also known as C†MOS, is a dynamic logic technique that enhances power
efficiency and integrates latches within logic circuits. Originally developed for low-power CMOS logic
†
design, C MOS is now primarily used for synchronizing dynamic circuits and interfacing with other
clocked logic structures.
During operation:
Latching Capability: C†MOS logic inherently functions as a latch, holding data when the clock
is low.
Compatibility with Dynamic Logic: It effectively interfaces with other dynamic logic families,
improving timing synchronization.
Higher Input Capacitance: Although similar to static CMOS gates, the presence of clocking
elements increases parasitic capacitance.
Complex Timing Requirements: The circuit behavior depends on the correct synchronization
of clock signals.
6.4 Applications
C†MOS logic is widely used in:
Low-power digital circuits.
Unlike conventional static CMOS logic, CVSL relies on two complementary nMOS switching
networks, combined with cross-coupled pMOS transistors for positive feedback.
Two complementary nMOS logic trees that implement the desired logic function.
Cross-coupled pMOS pull-up transistors that ensure positive feedback and bistable
operation.
1. Differential Signaling: The logic trees receive both the input and its complement, ensuring
robust operation.
2. Pull-Up Mechanism: The cross-coupled pMOS transistors ensure that one output node is
strongly pulled high while the other remains low.
3. Switching Mechanism: When the inputs change, one of the nMOS logic trees will conduct,
forcing one node to be pulled low while the other remains high, resulting in a strong logic
transition.
7.2 Advantages of CVSL
Strong Signal Integrity: Positive feedback ensures sharp transitions and improved noise
immunity.
Logical Completeness: CVSL can implement any logic function efficiently, making it suitable
for automated logic synthesis.
Lower Power Consumption: Unlike domino logic, CVSL gates do not require precharge and
evaluate cycles, reducing dynamic power dissipation.
Larger Area Requirement: Additional routing and double-rail logic consume more chip area.
Slower Switching Speed: During transitions, the pMOS pull-ups must counteract the nMOS
pull-down networks, leading to delays compared to conventional static CMOS gates.
7.4 Applications
High-speed arithmetic and logic units (ALUs) in microprocessors.
7.5 Conclusion
CVSL is a powerful logic family that provides robust differential operation and logical completeness
at the cost of increased complexity and area. While it may not always be as fast as traditional CMOS
or domino logic, its ability to implement any logic function makes it useful in specific applications.
It is particularly popular in nMOS circuits, with a fundamental application being the 2-input
multiplexer.
A notable implementation of PTL is found in the ALU function unit of the OM-1 computer.
8.1 nMOS and CMOS Implementations of Pass Transistor Logic
8.1.1 nMOS Pass Transistor Logic
In nMOS-based PTL, the pass transistors are used to steer signals to the output based on
control inputs.
Figure below illustrates the nMOS structure, where only nMOS transistors are used.
Advantages:
Suffers from threshold voltage loss (Vth drop), leading to degraded high-level signals.
To overcome the limitations of nMOS PTL, a CMOS version can be designed by replacing each
nMOS transistor with a full transmission gate, as shown in figure below.
CMOS PTL uses both nMOS and pMOS transistors, ensuring full voltage swing.
This implementation provides strong pull-up and pull-down characteristics.
Figure above shows a dynamic version of PTL, which requires a precharge phase before operation.
Performance comparison:
Comparable speed to nMOS PTL.
Requires a precharge period, which may extend clock cycle times.
8.2.3 Static Pass Transistor Logic with Feedback Buffer
To ensure correct operation, the p-transistor pull-up and n-transistor pull-down must be
properly ratioed.
1. A model as shown in figure below, where control variables steer a pass transistor network.
2. The pass function, which determines how input variables propagate through the network.
– When A = 0; Y = B.
– When A = 1; Y = B
The output voltage may not reach the full supply level due to threshold voltage loss in
nMOS PTL.
Complementary pass networks (nMOS + pMOS) help mitigate this but introduce
additional delay.
The merging of source and drain regions makes PTL design complex, leading to higher
internal capacitances.
Both true and complemented versions of control variables are needed, increasing
circuit complexity.
Complementary Pass Networks (CVSL): Can be integrated with CVSL logic for improved
performance.
Finally, any PTL design should be evaluated through simulation and layout analysis to determine
its effectiveness for a given application.
8.6 Conclusion
Pass Transistor Logic (PTL) provides an efficient alternative to conventional logic gates,
particularly in multiplexers, arithmetic logic units (ALUs), and XOR gates.
While PTL reduces transistor count and improves speed, it introduces challenges such as
threshold voltage drop, increased capacitance, and control signal complexity.
The fact that direct n-diffusion to p-diffusion connections are not possible in bulk CMOS.
The inverter’s drains are connected using a metal wire and two contacts.
Power (VDD) and ground (VSS) connections use metal for low resistance.
An alternative layout is shown in figure below where the transistors are aligned horizontally.
Figure 20: CMOS inverter layout with diffusion power and ground
Performance Considerations
Using a vertical polysilicon drain connection introduces extra resistance and capacitance:
Rtotal ≈ 2Rcontact + Rpoly
where Rcontact is the resistance of a metal-polysilicon contact, and Rpoly is the resistance of the
polysilicon wire.
Using diffusion for power and ground connections adds series resistance and
capacitance.
To ensure good performance, the resistance should be at least an order of magnitude lower
than the transistor’s “on” resistance.
Using Additional Metal Layers The introduction of a second metal layer enhances layout
flexibility:
The second metal layer can be used for VDD and VSS supply lines.
Alternatively, it can be used to strap polysilicon to reduce resistance and improve signal
propagation.
Layouts remain largely unchanged except for the addition of metal-2 wires and metal-1
connection stubs.
Figure 21: CMOS inverter layout using additional metal layers
Further drain capacitance reduction can be achieved using the star connection.
No corner gaps exist, which increases transistor gain (β) while minimizing capacitance.
A 2-input NAND gate can be implemented using a combination of series nMOS transistors and
parallel pMOS transistors.
Figure below represents the direct translation of schematic into layout of a 2-input NAND gate.
By orienting the transistors horizontally, we obtain the layout shown in figure below, which is
cleaner and more compact.
Figure 26: Optimized layout for NAND Gate
In cases where deviations from this style occur, specific design reasons will be provided.
The NAND gate could be rotated by 90◦ to have vertical metal and horizontal polysilicon
connections.
Symbolic Layout A 2-input NOR gate follows the opposite arrangement of the NAND gate, where:
Alternative Layout for Faster Performance An alternative NOR layout is shown in figure below,
where the connection to the parallel transistors is optimized.
Faster gate operation, since the reduced capacitance improves switching speed.
The same optimization can be applied to the NAND gate, improving its speed.
9.4 Factors Affecting Complex Gate Design
For more complex logic gates, four key factors influence their electrical and physical design:
1. Series Transistor Connection: Increased resistance affects speed.
Initially, nMOS transistors A, B, and C are OFF (VgsA = VgsB = VgsC = 0).
When transistor D is switched OFF, the internal node capacitance C1 gets charged, leading to a
nonzero Vsb.
If all inputs are then driven HIGH (VgsA = VgsB = VgsC = VDD), the source of transistor D will be at
VDD − Vt.
The fall time of the gate output will be slower than expected due to this internal charge
buildup.
Consider the relative impact of body effect in nMOS and pMOS transistors.
If nMOS body effect is worse, using NOR structures instead of NAND may be preferable.
3. If none exist, break the gate in the minimum number of places to achieve this condition.
An Euler path example and the corresponding layout generation is shown in figure below.
Figure 32: Euler paths in CMOS gate and the corresponding layout
9.6.3 Automated Layout Techniques
A graph-theoretic approach can automate layout generation, using interval graphs to optimally
place transistors in a gate matrix style.
The final layout is shown in figure below.
These notes provide foundational insights into the structure and layout of I/O pads.
Connection points and VDD/VSS rails are placed consistently for uniformity.
Wider power/ground buses are used based on worst-case power estimates; multiple pads may
be employed to reduce noise.
LEFT;
INPUT A
INPUT B TOP;
VDD VDD
INPUT C
RIGHT;
OUTPUT Z
OUTPUT Y
BOTTOM;
OUTPUT W
VSS VSS
Design Strategy
Use two-stage inverters with optimal sizing (2.7:1 or within 2–10 range).
Include buffering to reduce internal loading.
Apply latch-up protection:
However, one critical issue must be addressed in input pad design: electrostatic discharge
(ESD) protection.
As a result, even tiny leakage currents can cause a substantial voltage build-up across the gate
oxide.
The voltage V that develops on the gate can be estimated using the equation:
where:
Example: Let:
I = 10µA, Cg = 0.03pF, ∆t = 1µs
Then:
330V
Such high voltages can easily exceed the oxide breakdown voltage (typically 40 - 100 V),
causing permanent damage.
To protect the gate from high-voltage transients, input pads include a resistor and diode
clamps:
– Clamp Diodes (D1 and D2): Turn on if the input voltage exceeds VDD or falls below VSS.
– Series Resistor (R): Limits the peak current during voltage excursions. Typical values:
200Ω to 3kΩ.
Note: The resistor and the parasitic input capacitance form an RC time constant, which
can limit high-speed operation.
– A punch-through device has closely spaced source and drain regions but no gate.
– It acts like an avalanche diode, turning on around 50 V.
– This structure does not require additional well formation.
In some cases, the TTL output may use an external resistor to 5 V to improve VOH. This resistor
can even be integrated into the pad using a p-MOS transistor.