## - Digital VLSI Design .

Digital VLSI Design


Carleton 2003

Curleton $\quad$ Dig Cirp. $0 \quad$| © John Knight |
| :---: |
| Revised; October 9, 2003 |

Digital VLSI Design -

Good Afternoon

## Digital Design

## Abstraction

Hiding postponable design details inside higher level model. Digital is more successful in doing this than analog.

## The Three Axes for Digital Design Methods



Dig Cir p. 2
© John Knight
Revised; October 9, 2003
Slide 1

Digital Design •
Abstraction

## The Universe of Digital Abstraction

## Abstraction

A model which hides postponable details so that Engineers can think on a grander scale.

## The Gate Abstraction

The most common digital abstraction is the gate.

- It allows us to think only of 1 and 0 signals.
- We ignore real voltage levels, real switching thresholds.
- We lump timing into a few delays, $\mathrm{T}_{\text {rise }}, \mathrm{T}_{\text {fall }}, \mathrm{T}_{\text {prop }}$


Three Axes for Design Methods
There are different levels of abstraction along each axes

## Behavioural

Think about the result and the algorithm to produce it; ignore internal components; ignore their interconnections.

## Structural

Think about the components and how they are interconnected; ignore how they are actually built.
Mux symbol most abstract, gate design medium, the transistor mux design is least abstract.

## Physical

Think about how it is built.

## The Universe of Digital Abstraction

## The Y Diagram



## The Universe of Digital Abstraction

## The Y Diagram

## Abstraction

The further away from the origin one gets, the more abstract the concept.
Compare: the transistor MUX, the gate MUX, and the symbol for the MUX.
The Path for a Computer Aided Design (CAD) Tool
Most tools

- start in the behavioural axis,.
- transform the design to the structural axis.
- travel down the structural axis.
- Finally transfer over to layout on the physical axis.

However floorplanning goes from the structural at a high level to physical at a high level.

Clock tree design is physical.

## Path for Most Design in This Course, the Digital Part

1. Start at a behavioural RTL Description, (Register Transfer Level).
2. Go over to macros and/or gates on the structural axis
3. Then go to the physical axis to predesigned standard cells.
4. The cells were designed bottom up from transistors.


## Other Things Besides Raw Circuit Design

## Other Things

## Circuit Design

1. Transmission line effects.

2. Clock distribution, skew.
3. Asynchronous inputs, metastability.
4. Verification.


## Production Issues

1. Production Testing.

2. Packaging, Cooling.
3. Reliability
4. Pads, connections, protection.
5. Power Supply.

## Group Dynamics

1. Incomplete fuzzy specification.
2. Division of labour, communication!!!
3. Design style, coding style. Reusability, documentation.


## The Under-Advertised Aspects of VLSI Design <br> That All Should Be

## Some Less Common Topics ${ }^{\wedge}$ Covered in This Course

1. Power Supply
distribution droop, ground bounce, single supply
2. High-frequency lines,

When is a connection a transmission line, ringing, terminations, open-collector glitch
3. Clock distribution
clock-skew, direction of data-flow v.s. direction of clock-flow, latching for skew tolerance divided clocks, resynchronization, clock gating, clock trees.
4. Production Testing

BIST (built-in-self-test). scan-chain testing, $\mathrm{I}_{\mathrm{DDQ}}$ testing, boundary-scan
5. Packaging,
surface mount, ball-pin grid, heat dissipation, pad protection, ground and power connections.
6. Pads and connections

Drivers, lead inductance, electrostatic protection, latch up, pad models
7. Reliability,

Hot electrons, metal migration, Arrhenius model,
8. Group Dynamics

Fuzzy specifications, communications, software people make different assumptions, documentation.
9. Asynchronous and Synchronous

What synchronous means, asynchronous inputs, asynchronous reset, metastability.
10. Design style

Coding style, reusability, using cores, documentation.
11. Verification

Simulation, synchronous simulation, test benches, static timing analysis, formal verification.

## - Digital Design .

## Switching Circuits

## Logic With Switches

## Series and Parallel Circuits



OR with switches


AND with switches


## General Switch Logic

## Loop Analysis



Non Series-Parallel Circuit

$F=A \cdot D+B \cdot G+A \cdot E \cdot G+B \cdot E \cdot D$

Draw all the paths from Vcc to Output

Carleton
university

Digital Design •

Dig Cir p. 8
© John Knight
Revised; October 9, 2003
Slide 4

## Switching Circuits

## Logic With Switches

Logic can be done with switches as well as gates.
a. A parallel connection implements OR.
b. A series connection implements AND.
c. Series and parallel combinations can do complex logic.

## General Switch Logic

However not all switching circuits can be solved using series and parallel combinations.

## Loop Analysis

Construct all paths between a logic " 1 " and the output. Each path is a string of ANDs. which are ORed together.
The expression comes out as a Sum-of-Products ( $\Sigma$ of $\Pi$ )

- Cut-Set Analysis.
- Make all the cuts that completely separate the output and Vcc.
- The cuts must only pass through switches.
- The switches in the cut are ORed together.
- The expression comes out as a Product-of-Sums (П of $\Sigma$ )


Cut-Set Analysis

Example: What logic function does this circuit implement?.


One can do this by loops or cutsets and remove $x \cdot \bar{x}$ algebraically. Alternately one can note a loop with $x \cdot \bar{x}$ can never be completed, or a cut set with $x+\bar{x}$ is always true.

- Digital Design .


## MOS Transistors, (in case you forgot)

The PMOS transistor


## The NMOS transistor




Think "Electrons are lighter than holes."

Carleton
university
© John Knight
Revised; October 9, 2003

## MOS transistors

## Conventions and Symbols

The p-channel MOS transistor was built first. It was natural to connect it to the high voltage and drain into a load. This is the reason for the names drain and source.
The n-channel MOS transistor kept the same terminology even though it had to drain uphill.

## Analog symbols

The MOS transistors used in logic are enhancement mode. Enhancement mode means they are shut off for zero gate-source voltage. The analog symbol shows the source-drain connection is off by using a broken line for the channel. (A depletion mode transistor, which conducts when $\mathrm{V}_{\mathrm{G}-\mathrm{S}}=0$, has a solid line).

An NMOS transistor forms a thin N-channel (barely shown) between the two N+ sections on top of the P-type body (P-well). This channel forms just under the thin oxide (not labelled) under the gate.
The body (well) of the transistor and the channel form an NP junction. The arrow on the analog symbol (c) shows the direction conventional current could flow in this junction. However the body is always back biased so this current will not flow.

The analog symbol information is mostly redundant in digital circuits:

- The body of an NMOS transistor is nearly always grounded (PMOS is connected to $\mathrm{V}_{\mathrm{DD}}$ ).
- The source and drain are physically identical and are not distinguished. Which is which is determined by the circuit.
Thus the symbol (a) below is used in digital descriptions and in these notes.
- Digital Design .


## Threshold Voltage in MOS Transistors

In a Conducting MOS Transistor
The Gate-Source Voltage $\geq$ a Threshold Drop,
Much More for Good Conduction.


## Replacing the switches with transistors


© John Knight
Revised; October 9, 2003
Slide 6

MOS transistors

NMOS transistor symbols

|  | shows current direction at the source | Symbol used in many analog circuits |  |
| :---: | :---: | :---: | :---: |

PMOS transistor symbols.
Symbol often used

used for CMOS circuits | Symbol which shows |
| :--- |
| current direction at the |
| source |

```
Voltage Loss From Using NMOS With Its Feet In the Air
    Output Voltage Lowering With Common Drain
    - Vout = \(5-V_{D S}\)
    - \(\mathrm{V}_{\mathrm{GS}}=\mathrm{V}_{\mathrm{DS}}\)
    - \(\Rightarrow V_{\text {OUT }}=5-V_{\text {GS }}\)
    - (Below Right) \(\mathrm{V}_{\text {Out }}\) for various \(\mathrm{I}_{\mathrm{DS}} \backslash\)
        The straight lines show Vout \(=I_{D S} R\) for a few \(R\)
        The intersections show \(V_{\text {OUT }}\) for that value of \(R\).
    - One needs over 100 K to get \(\mathrm{V}_{\text {OUT }}\) near \(\mathrm{V}_{\mathrm{DD}}-\mathrm{V}_{\text {THRESHOLD }}\)
```



$I_{D}$ vs $V_{D S}$


Rotate the axes


5 v - the previous graph

$\mathrm{V}_{\text {OUT }}$ for different R Gate at 5V
© John Knight
Revised; October 9, 2003

The Threshold Voltage
The Minimum Gate Voltage to Turn On the Transistor

- Can be adjusted by ion implanting or body biasing.
- For 5 V logic it is about a volt.
- For 3.3 V or 2.5 V logic it is lowered to about 0.5 V .
- If made too low, transistors will have a high leakage current when OFF.
- $\quad \mathrm{V}_{\mathrm{G}-\mathrm{S}}$ does not stay constant near the threshold like the the $0.7 \mathrm{~V}_{\mathrm{B}-\mathrm{E}}$
 junction voltage of a Si bipolar transistor.
- $I_{D(M A X)}$ v.s. $V_{G S}$ is quadratic for long channel transistors. ( ) It is roughly linear for short channel transistors due to velocity saturation.( )

The Model and Spice Parameters For the Threshold Voltage
NMOS

$$
\mathrm{v}_{\mathrm{T}}=\mathrm{V}_{\mathrm{T} 0}+\gamma\left\{\sqrt{\left|\mathrm{V}_{\mathrm{SB}}\right|+2\left|\phi_{\mathrm{F}}\right|}-\sqrt{2\left|\phi_{\mathrm{F}}\right|}\right\}
$$

$\mathrm{V}_{\mathrm{TO}}=\mathrm{VT} 0=$ Zero-bias threshold voltage. Typically 0.8 V )
$\gamma=$ GAMMA $=$ Bulk threshold parameter (body coefficient). Typically $0.37 \mathrm{~V}^{1 / 2}$
$2 \phi_{\mathrm{F}}=\mathrm{PHI}=$ Surface Potential. Typically 0.65 V
$\mathrm{V}_{\mathrm{SB}}=$ Source-to-body voltage.
For PMOS transistors used in CMOS (enhancement type) the threshold voltage $\mathrm{V}_{\mathrm{T}}$ and the zero-bias threshold voltage $\mathrm{V}_{\mathrm{T} 0}$ are negative:

> PMOS

$$
\mathrm{v}_{\mathrm{T}}=-\left|\mathrm{V}_{\mathrm{T} 0}\right|-\gamma\left\{\sqrt{\left|\mathrm{V}_{\mathrm{SB}}\right|+2\left|\phi_{\mathrm{F}}\right|}-\sqrt{2\left|\phi_{\mathrm{F}}\right|}\right\}
$$

## - Digital Design .

## Output Voltage Loss from NMOS With Its Feet In the Air



- Use approximate transistor curves
- Plot where VDS=VGS
- Note: Vout = 5-VDS
- Vout /(Current from curve) $=$ R
- Vout MAX is 4V, but unless $R$ is very high, it is much less.

Do not use NMOS up in the air

The voltage will be very low unless the resistance to ground is very high.
If the resistance is very high, the circuit will take a long time to discharge its stray capacitance.

It will be very slow

## One way to use NMOS up in the air

One can overcome these problems by raising the gate voltage above $V_{D D}$.
The down side is one must generate another supply.


- Digital Design .


## Replacing the switches with transistors

Use NMOS For Loads Connected To $\mathrm{V}_{\mathrm{DD}}$.


NMOS gives the full logic swing
But connecting the load to $\mathrm{V}_{\mathrm{DD}}$ inverts the function.


Carleton
Dig Cir p. 18
© John Knight
Revised; October 9, 2003
Slide 9

Digital Design -
The Threshold Voltage
PMOS transistors don't like their feet on the ground.
The minimum $\mathrm{V}_{\text {GATE }}$ is 0 V .
The SOURCE must be at least $\mathrm{V}_{\text {THRESHOLD }}$ above the gate for the transistor to be on.
Using Kirchoff,
the output cannot go below $\mathrm{V}_{\text {THRESHOLD }}$.
The PMOS threshold is written $\left|\mathrm{V}_{\text {THRESH }}\right|$ with absolute value signs. This is because it is negative and writing $\geq$ with negative quantities is confusing.

## NMOS does not like its feet off the ground

In logic, the gate voltage is limited to 5 V .
With a 1V threshold drop, the
output voltage $=5 \mathrm{~V}-1 \mathrm{~V}=4 \mathrm{~V}$ maximum.
With only a 1 V threshold drop, the current $\mathrm{I}_{\mathrm{D}}$ will be very low, and can only charge the stray capacitance slowly.
Thus even if one tolerates the 4 V maximum output, gates built this way will be very slow.
The shaded curve is supposed to look like a slow rise time.


## - Digital Design .

Replacing The Switches With Transistors (cont.)


## Use PMOS For Loads Connected To Ground.

PMOS gives the full logic swing.


Using PMOS inverts the inputs and "DeMorgan's" the function.

Dig Cir p. 20
© John Knight
Revised; October 9, 2003
Slide 10

## Digital Design •

The Threshold Voltage

## PMOS loves dangling its feet

A low gate voltage turns on PMOS
Here $\mathrm{V}_{\mathrm{GS}}=5 \mathrm{~V}$.
The transistor conducts very well and can supply a full 5 V at the drain.
The large current can charge the stray capacitance quickly.


## . Digital Design.

## Combining NMOS and PMOS.

- NMOS pulls down for $A=1$ to make $F=0$.
- PMOS pulls up when $A=0$ to make $F=1$..

- Combine the circuits and throw out the resistors.



## Active Pull-up and Pull-Down

Circuits with Resistive Pull-Ups (Pull-Downs)
Circuits with a transistor too pull the output down and a resistor to pull it up are:

- Are called ratioed logic because there low output voltage is the ratio:-

$$
\mathrm{V}_{\text {DD }}\left(\mathrm{R}_{\text {CHANNEL }} / \mathrm{R}_{\text {RESISTOR }}\right)
$$

- They pull-low quickly, but pull-high with time constant $\mathrm{R}_{\text {RESISTOR }} \mathrm{C}$.
- They consume power whenever the transistor is on.


## Circuits with Transistors to Pull-Up and to Pull-Down

- These circuit can give a low output of zero and a high output of $\mathrm{V}_{\mathrm{DD}}$.
- They can switch quickly in both directions.
- They do not consume power except during the switching time.


## CMOS

## The CMOS Configuration

a. A CMOS gate is inverting. Call its output $\bar{Z}$ to indicate this.
b. The switch function in the upper box and the output are the same. Thus the PMOS function will also be $\bar{Z}$.
c. The switch function in the lower box is the inverse of the output. The NMOS function will be $Z$.
d. An example of the NAND function.
e. The NAND function showing the transistor switches.

© John Knight
Revised; October 9, 2003
Slide 12

CMOS
The Threshold Voltage

## CMOS

## Combining NMOS and PMOS

## Complimentary Logic

This type always has one-and-only-one transistor turned on. One transistor always gives a path to a supply (VDD or ground). It never gives a path through both transistors at the same time.


## 1.• Problem

Draw the circuit for a CMOS NOR gate.


## Demorgan's Theorem

## Demorgan's Theorems, Simple Forms

```
\overline{\overline{A}+\overline{B}}=\mathbf{A}\cdot\textrm{B}}\quad\overline{A}\cdot\mathbf{B}=\overline{A}+\overline{B}\quad\overline{D+E}=\overline{D}\cdot\overline{E}\quad\overline{D}\cdot\overline{E}=D+
```


## Demorgan's Theorems, General Form

| $\overline{\mathbf{F}(\mathbf{A}, \mathbf{B}, \mathbf{C}, \ldots+, \cdot,)}=\mathbf{F}(\overline{\mathbf{A}}, \overline{\mathbf{B}}, \overline{\mathbf{C}}, \ldots, \cdot,+$, | $F=[\bar{A} \cdot \mathbf{B} \cdot \mathbf{C}+\mathrm{D} \cdot(\mathbf{A} \cdot \mathbf{B}+\mathbf{C})] \cdot \bar{A}$ |
| :---: | :---: |
| a) Bracket all groups of ANDs | $F=\{[\{\bar{A} \cdot B \cdot C\}+\{D \cdot(\{A \cdot B\}+C)\}] \cdot \bar{A}\}$ |
| b) Change AND to OR and OR to AND |  |
| Clean brackets | $\{\bar{A}+B+C\}\{D+\{A+B\} C\}+\bar{A}$ |
| c) Invert all variables | $\bar{F}=\{A+\bar{B}+\bar{C}\}\{\{\bar{D}+\{\bar{A}+\bar{B}\} \bar{C}\}+A$ |

Examples

| $\mathbf{F}=\overline{\mathbf{A}} \cdot \mathbf{B} \cdot \mathbf{C}$ |  |
| :--- | :--- |
| $\mathbf{F}=\overline{\mathbf{A}} \cdot \mathbf{B} \cdot \mathbf{C}+\mathbf{A} \cdot \overline{\mathbf{B}} \square$ |  |
| $\mathbf{F}=\overline{\mathbf{A}} \cdot \mathbf{B} \cdot(\mathbf{C}+\overline{\mathbf{A} \cdot \mathbf{B})} \square$ | $\overline{\mathbf{A}} \cdot \mathbf{B} \cdot \mathbf{C}\}$ |
| $\{\overline{\mathbf{A}} \cdot \mathbf{B} \cdot \mathbf{C}\}+\{\mathbf{A} \cdot \overline{\mathbf{B}}\}$ |  |

Dig Cir p. 26
© John Knight
Revised; October 9, 2003
Slide 13

CMOS -
Duality

## Duality

Duality is a concept closely related to DeMorgan's Laws.
Duality says that if
$\mathbf{F}(\mathbf{A}, \mathbf{B}, \mathbf{C}, \ldots+, \cdot$,$) is a valid formula, then$
$\mathbf{F}(\mathbf{A}, \mathbf{B}, \mathbf{C}, \ldots, \cdot,+$, ) is also a valid formula.

## Example

Suppose you remember the most useful of the simplification formulas -

$$
X+A X=X
$$

Then the dual is also a valid simplification formula -

$$
X(A+X)=X
$$

Example
Everybody remember the distributive law -

$$
X(A+B)=A X+B X
$$

Nobody remembers the other distributive law which is its dual -

$$
X+(A B)=X A+X B
$$

$\qquad$


## Using DeMorgan's Law to Design CMOS

In PMOS implement $\overline{\mathbf{Z}}$
In NMOS implement $\mathbf{Z}$
Design a CMOS gate for $\overline{\mathbf{Z}}$

a. Apply generalized DeMorgan to $\overline{\mathbf{Z}}$
$\overline{Z(a, b \ldots c,+, \cdot, 0,1)}=Z(\bar{a}, \bar{b} \ldots \bar{c}, \cdot,+, \mathbf{1 , 0})$
b. Implement the DeMorgan form of $\bar{Z}$ with PMOS switches.
c. Implement the noninverted $Z$ with NMOS switches.
d. Put NMOS and PMOS together.

Call this a $\bar{Z} / Z$ gate

Example
a) $F=\bar{Z}=\overline{(A+B) \cdot C+D \cdot E}$
d)
$=\overline{((A+B) \cdot C)+(D \cdot E)} \quad$ Put brackets around AND terms $=((\bar{A} \cdot \bar{B})+\bar{C}) \cdot(\bar{D}+\bar{E}) \quad$ Change to DeMorgan form of $\bar{Z}$
b) $\operatorname{PMOS}=\bar{Z}=((\bar{A} \cdot \bar{B})+\bar{C}) \cdot(\overline{\mathrm{D}}+\overline{\mathrm{E}})$
c) $\mathrm{NMOS}=\mathrm{Z}=(\mathrm{A}+\mathrm{B}) \cdot \mathrm{C}+\mathrm{D} \cdot \mathrm{E}$

UNIVERSITY

## Using the Sum of Products ( $\Sigma$ of $\Pi$ ) for the PMOS function

## Carry circuit for full adder

$\mathrm{C}_{\mathrm{Y}}=\mathrm{A} \cdot \mathrm{B}+\mathrm{B} \cdot \mathrm{C}+\mathrm{C} \cdot \mathrm{A}$
DeMorgan changes $F$ in ( $\Sigma$ of $\Pi$ ) into a $\bar{F}$ in ( $\Pi$ of $\Sigma$ ).
$C_{Y}=A \cdot B+B \cdot C+C \cdot A \quad \Rightarrow \quad \bar{C}_{Y}=(\bar{A}+\bar{B}) \cdot(\bar{B}+\bar{C}) \cdot(\bar{C}+\bar{A})$
Sometimes converting $\overline{\mathrm{F}}$ back to ( $\Sigma$ of $\Pi$ make it simpler,
or have smaller AND chains,


Carleton
Dig Cir p. 30
© John Knight
Revised; October 9, 2003
Slide 15

CMOS -
PMOS Circuit

## PMOS Circuit

It is usual to change NMOS parallel into PMOS parallel
NMOS series into PMOS parallel.
However sometimes rearranging the logic is better.

Circuits in which the PMOS and NMOS connections are the same (as here) are called self-dual.

## CMOS Complex Gates

## General Properties

- Can implement any function with a single inversion bar. Only one "bar" in the function, and that over the whole thing. Example $\overline{\mathbf{A} \cdot \mathbf{B}+\mathbf{C}+\mathbf{A} \cdot \mathbf{D}}$


## Max Size on ANDs

Gates are limited to about 4 series transistors.
The total channel resistance of the 4 series transistors is

$$
\approx 4 \times(R \text { of a single transistor). }
$$

The output time-constant is at least $4 \times$ larger.
If ( $\sum$ transistors stray C) >> (C Load)


The output time constant approaches
(4R)(4C) or $16 \times($ an inverter)

## Max Size on NOR.

- PMOS is $1 / 2$ to $1 / 3$ the speed of NMOS
- A 4-input NOR is Really Slow. for output going low to high.
- Output going high $\square$ to low is faster.



## Gate Speed

Discharging an $\eta$ input NAND is quadratic in $\eta$
The total NMOS channels resistance $=\eta \mathrm{R}_{\text {CHANNEL }}$.
The total PMOS drain-to-substrate capacitance $=\eta C_{\text {DRAIN }}$.
If $\mathrm{C}_{\mathrm{L}}$ is the load capacitance external to the gate. Then the-
Discharge time-constant
$\approx \eta R_{\text {CHANNEL }} *\left(\eta C_{\text {DRAIN }}+\left(\right.\right.$ effects of $\left.\left.C_{\text {drain }} s\right)+C_{L}\right)$
If $C_{L} \gg C_{\text {DRAIN }}$ or $C_{\text {drain }}$, then the discharge time constant is about $\eta \times$ longer than for a single transistor.

$$
\text { Discharge time constant } \approx \eta R_{\text {CHANNEL }} * C_{L}
$$

If $\mathrm{C}_{\mathrm{L}} \ll \mathrm{C}_{\text {DRAIN }}$ then-
Discharge time-constant $\approx \eta R_{\text {CHANNEL }} *\left(\eta C_{\text {DRAIN }}+\left(\right.\right.$ effects of $\left.\left.C_{\text {drain }} s\right)\right)$
The NMOS drain capacitances, distributed between the Rs, contributes less than the
sum of the $\mathrm{C}_{\text {drains }}$. An approximation (called the Elmore delay ${ }^{1}$ ) gives-
$\eta R_{\text {CHANNEL }}$ (effects of $C_{\text {drain }} s$ )

$$
\left.\approx \mathrm{R}_{\mathrm{CHANNEL}} * \mathrm{C}_{\mathrm{drain}} *(\eta+1) \eta / 2\right)
$$

For $\eta=4$ this gives a discharge time-constant $=14 R C=14 x$ an inverter.

- $\mathrm{C}_{\text {drain-to-gate }}$ is lumped in with $\mathrm{C}_{\mathrm{drain}}$ (drain-to-substrate ) here.
- Because $\eta$ series transistors will act like a long transistor with less velocity saturation. $R_{\text {CHANNEL }}$ will increase less than linearly with $\eta$.

Though not quite $16 x$, the rapid time constant increase discourages large $\eta$.


[^0]
## Using DeMorgan Graphically (Review)

## Equivalent graphical forms for AND, OR, NAND and NOR.



## Getting Rid of Big ORs.

Using DeMorgan graphically to partition big ORs into faster structures.


## Carleton <br> UNIVERSITY

Dig Cir p. 34
© John Knight
Revised; October 9, 2003
Slide 17

CMOS •
PMOS Circuit

## Charging $\mathrm{C}_{\mathrm{L}}$ through parallel transistors

With parallel transistors only one might be on, so the worst case charging resistance is as for one. Parallel transistors can share the same substrate well so additional transistors add less to the total $C_{\text {DRAIN }}$ than the first ${ }^{1}$.

## NANDs beat NORs

Since electrons are more mobile than holes, PMOS transistors are $1 / 2$ to $1 / 3$ of the speed of NMOS. NORs have a PMOS string which makes them slow.

To compensate cell designers make the PMOS transistors double width or more. Check how well the compensation is done by comparing the cell high-tolow propagation delay ( $\mathrm{t}_{\mathrm{PHL}}$ ) and the low-to-high propagation delay ( $\mathrm{t}_{\mathrm{PLH}}$ ). Two-to-one differences are common for the 2-input NOR.

## Large Fan-In Gates



CMOS gate propagation delay changes according to the following approximate formula.
$\eta=$ fan-in; the number of input leads coming into the gate. ${ }^{2}$

$$
t_{\mathrm{PROP}}=\mathrm{a}_{1} \eta+\mathrm{a}_{2} \eta^{2}
$$

The $a_{1} \eta$ is important with large $C_{L} \gg C_{\text {DRAIN. }}$
The $a_{2} \eta^{2}$ is important when $C_{L} \ll C_{\text {DRAIN }}$,
2.• Problem

Break a 6-input NAND into 2-input gates (one 3-input gate is allowed).

[^1]
## Basic CMOS Gates <br> NAND

Verilog textual description

nand NANDY (OUT, IN1, IN2), namby (F, A, B, D) ;


© John Knight
Revised; October 9, 2003
Slide 18

CMOS $\quad$
Verilog Gate Description

## Verilog Gate Description

## The Interpretation of Delay Times Depends on the Number of Arguments

NAND \#(All_transition_delay) instance_name(output, ina, inb, ... )
NAND \#(Rise_time, Fall_time) instance_name(output, ina, inb, ... )
NAND \#(Rise_time, Fall_time, Turn_off_delay) instance_name(output, ina, inb, ... )

- Rise_time. The output starts in $\mathrm{X}, \mathrm{Z}$ or 0 and ends in 1 .

Interpretation I. The delay from the $1 / 2$ value of an input change until the output reaches $1 / 2$.
Interpretation II. (older) The delay from the start of an input change until the corresponding output=1.
The cell designers tend to use interpretation I. Circuit designers must follow cell designers.

- Fall_time. The output starts in X, Z or 1 and ends in 0 .

Interpretation I. (More common) The delay from the $1 / 2$ value of an input change until the output reaches $1 / 2$. Interpretation II. (older) The delay from the start of an input change until the corresponding output= 0 .

- Turn_off_delay. The output starts in X, 0 or 1 . Turn_off_delay is the time from an input change until the output= Z. The built-in NANDs do not have 3-state control but bufifo, and bufif1 gates do. See page 44.
- All_transition_delay. One number is used for all of the above delays.


## Time Units

The delays are in simulator units. Thus 3 could be 3 ns , $3 \mu \mathrm{~s}$ or 3 years, whichever you say it is. However the timescale command can specify units. This is useful for combining a modules specified in say ps with one using ns.
-timescale $100 \mathrm{ps} / 10 \mathrm{ps} / /$ Reference time unit is 100 ps with accuracy of 10 ps . // Only 1, 10, and 100 are allowed as numbers.

## Case Sensitivity

- Verilog is case sensitive
- Reserved words are in lower case.
- User variables can be either case but NANDY is not Nandy, which is not nandy .


## NAND Gate Definition Using 4 Logic Values

## Four-Value Logic

- 1
- 0
- $\mathrm{X}=$ unknown to simulator
- $Z=$ disconnected, tri-state ${ }^{(8)}$


Four-Value NAND

| IA | NAND | 0 | 1 | X | Z |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  | 0 | 1 | 1 | 1 | 1 |
|  | 1 | 1 | 0 | X | X |
|  | X | 1 | X | X | X |
|  | Z | 1 | X | X | X |

## Meaning of NANDing with $X$ and $Z$




$\begin{array}{r}I A=x, z \\ I B=1, x, z\end{array}-\square 0-x$

- A simulated input $Z$ gives the output as input $X$ for NANDs, NORs, ANDs and ORs.
- $X$ is unknown in the simulator. Do not confuse with "don't care" in logic design.

Carleton

Don't Care Outputs are not Simulation Unknowns
BCD to 7 -segment display
Inputs WVYU are always decimal numbers, so inputs $1100,1101,1111$, 1110,1011 and 1010 , never happen.
The output for those inputs are never displayed and are don't care outputs. These can be used to simplify the logic.


Simulation Unknown From a Turned Off Bus


- For CMOS, the value on a disconnected bus may be the previous value stored in the bus capacitance. Verilog has a model trireg which can be used to simulate this.
- Once started, X can propagate through a circuit like cholera.
- Another unknown type is the don't care input. It means the output is independent of that input. Verilog uses a "?" for this in functions defined by a table.
For example the table for an OR might be written:

$$
\text { table } 00 ; 0 ; 1 \text { ? : } 1 \text {; ? } 1 \text { : } 1 \text {; endtable }
$$

The "?" stands for any of $1,0, \mathrm{X}$, or Z .

- "?" has a different meaning in a casex or casez statements. There it is equivalent to Z.


## Basic CMOS Gates (continued) <br> NOR

Verilog textual description nor \#(RISE_T, FALITI) NORKY (OUT, IN1, IN2);

## AND and OR

These are implemented as NAND and NOR with an extra inverter
and \#(RISE, FALL) ANNY(F1, IA, IB);
or \#(RISE, FALL) ORGY(F2, IA, IB) ;

## XOR and XNOR


© John Knight
Revised; October 9, 2003
Slide 20

CMOS -
Basic CMOS Gates (continued)

## Basic CMOS Gates (continued)

## Built-in Gates

There are 6 built-in primitive gates: nand, nor, and, or, xor, xnor.

## The Number of Inputs

Verilog builds the gate to agree with the number of inputs in the expression.

## Optional Arguments

The rise_time and fall_time arguments are optional. The default is zero delay.

## Behavioural Statements

This is behavioural Verilog, part of the Register Transfer Level, which is the level of abstraction at which many circuits are synthesized.
See "The Path for a Computer Aided Design (CAD) Tool" on page 5.

## Conditional Statements

The general form is
<condition> ? <expression_if_condition is true> : <expression_if_condition is false>

## Relational Statements

$A==B$ gives output 1 if $A=0=B$, or $A=1=B$. If either input contains an $X$ or a $Z$ the output is $X$.
$A===B$ also gives output 1 if $A=X=B$, or $A=Z=B$. It never gives an $X$ output.
3. - Problem

Construct the truth table for an XOR gate. The inputs may be $0,1, \mathrm{X}$ or Z . There is no such thing as level $\overline{\mathrm{X}}$.
The solution should contain 12 X entries.

## XOR Construction

## XOR Gate Construction

 XOR is Not a $\bar{Z} / Z$ GateXOR needs internal inversions.
Build an XOR from:
$C=\overline{A+B}$ and $F=\overline{A \cdot B+C}$
see side block
Construct a $\bar{Z} / \mathbf{Z}$ gate for $F$
$\bar{Z}=\bar{A} \cdot \bar{B}+\mathbf{C}=(\bar{A}+\bar{B}) \bar{C}$
$Z=A \cdot B+C$


Carleton
university
Dig Cir p. 42
© John Knight
Revised; October 9, 2003
Slide 21

CMOS -
XOR Construction

## XOR Constructed With Two $\overline{\mathbf{Z}} / \underline{Z}$ Gates

An alternate way to get the expression for XOR is to start with the product-of-sums representation for XOR:

$$
\mathrm{F}=(\overline{\mathrm{A}}+\overline{\mathrm{B}})(\mathrm{A}+\mathrm{B})
$$

Apply DeMorgan

$$
\overline{\mathrm{F}}=(\mathrm{A} \cdot \mathrm{~B})+(\overline{\mathrm{A}} \cdot \overline{\mathrm{~B}})
$$

Apply DeMorgan again

$$
\overline{\overline{\mathrm{F}}}=\mathrm{F}=\overline{(\mathrm{A} \cdot \mathrm{~B})+(\overline{\mathrm{A}} \cdot \overline{\mathrm{~B}})}
$$

## XOR Built With Transmission Gates

This 6-transistor XOR implementation is used in some cell libraries.
It uses a transmission gate, a PMOS and an NMOS transistor in parallel which are both turned on and off together. A is connected to $F$ when $B$ is 1 and disconnected when $\mathrm{B}=0$.

- When $B=1$, the transmission gate is off, and $\mathrm{B}=1, \overline{\mathrm{~B}}=0$ powers the inverter.


The output is $\overline{\mathrm{A}}$, but since $\mathrm{B}=1$, one can say the output is $\overline{\mathrm{A}} \mathrm{B}$

- When $B=0$, the inverter is off, but the transmission gate is on. The output is $A$ but since $B=0$, it is actually $A \bar{B}$.


## CMOS Gates (continued)

## Buffer and Not Gates Construction

Regular Buffer and NOT
buf \#(RISE, FALL) DRV(OUT, IN);
not \#(RISE, FALL) DRV(AFT, FORE) ;
TRI-STATE ${ }^{\circledR}$ Buffer and NOT.
bufif1 \#(Tlh, Thl, Tz) DRV1 (BUS, BK, ENB);
bufif0 \#(Tlh, Thl, Tz) DRVO(BUS, A, CTRL) ;
notif1 \#(Tlh, Thl, Tz) Not1 (BUS, D, K) ;
notif0 \#(Tlh, Thl, Tz) Not0(BUS, IN, CNTROL) ;

## bufifo means

normal buffer if control=0,
tristate if control=1.


## Four-Value Truth Tables

| CTRL |  |  |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  | bufifo | 0 | 1 | X | Z |
|  | 0 | 0 | Z | Z | Z |
| A | 1 | 1 | Z | Z | Z |
|  | X | X | Z | X | X |
|  | Z | X | Z | X | X |


© John Knight
Revised; October 9, 2003
Slide 22

CMOS
XOR Construction

## Legal Notice

TRI-STATE is the registered trade mark of National Semiconductor Corp. Their lawyers have notified at least one text book writer ${ }^{1}$ to display this fact. It is not clear if this applies only to the spelling "tristate." The word "3-state" appears to be in the public domain.

## Gate Timing

## Time to go 3-State

There are three times that can be associated with bufif and notif type gates.

- The rise time or $t_{p L H}$ (time propagation high to low), the first argument
- The fall time or $\mathrm{t}_{\mathrm{pHL}}$, the 2 nd argument.
- The time to go into (and come out) of 3-state, or $\mathrm{t}_{\mathrm{Z}}$.

In user-defined gates one can use a construction called a specify block to make more complex timing. For example one could have a different $\mathrm{t}_{\mathrm{Z}}$ for turn-on and turn-off. This is not available in the built-in gates.

## Min:Typical:Max Delays

Verilog simulators such as Cadance's Verilog-XL simulator can run either slow, typical or fast gates.
This is specified by a 3 numbers arranged min:typ:max

When the simulator is invoked, which delays will be used is chosen as an option.
In the Cadence verilog-XL simulator a command-line options is used when starting the simulator.
For example to run with $\mathrm{t}_{\text {RISE }}=35$ and $\mathrm{t}_{\text {FALL }}=31$ for this gate, use
? verilog your filename.v........axaiays.....
The other command-lines options are +mindelays and +typdelays.

[^2]
## . CMOS .

## Buffer Construction

## Simple Buffer

$K=0$ isolates BUS from D.
Leaves bus floating as far as $D$ is concerned.
Connecting control K to the middle transistors makes the bus slightly faster.
It disconnects the internal stray capacity shown from the bus, when driver is off $(\mathrm{K}=0)$.

## Faster Buffer

This has only one channel resistance in the bus charge/discharge path.
If

- the bus capacitance is large,
- D and $K$ are small transistors,
the NAND and NOR can be made stepped drivers for minimum delay.


Dig Cir p. 46
© John Knight
Revised; October 9, 2003
Slide 23

## CMOS -

## Stepped Drivers

These consider the fact that when buffer transistors are made wider to charge their output load faster, their gate width increases and puts more load on the buffer's driver stage.
Rabaey ${ }^{1}$ shows that the optimum increase in transistor widths is about 3.6 , and that the buffer load must be over 4 times the buffer driver load before one can get any improvement.

[^3]
[^0]:    ${ }^{1 \cdot}$ [RABAEY96] Example 4.5 p. 476.

[^1]:    ${ }^{1}$ [Rabaey96] p. 201 shows how overlap capacitance is shared. Also the Miller effect doubles C but only for the first transistor to switch.
    2. [Rabaey96] p. 196 discusses this quadratic relationship.

[^2]:    1. John Wakerly in Digital Design, Principles and Practices, second edition, p. 122.
[^3]:    1. [RABAEY96], pp. 448-454
