Non-Uniform Chip-Temperature Dependent Signal Integrity

Amir H. Ajamí, Kaustav Banerjee*, Massoud Pedram

Department of Electrical Engineering - Systems, University of Southern California, Los Angeles, CA 90089
*Center for Integrated Systems, Stanford University, Stanford, CA 94305

Abstract

In traditional design flows the temperature of the chip is assumed to be uniform across the substrate. However, for most high-performance designs the substrate temperature is non-uniform, which can be a major source of inaccuracy in delay and skew computations. This paper introduces the analysis and modeling of non-uniform substrate temperature and its effect on signal integrity. Using a novel non-uniform temperature-dependent analytical distributed RC interconnect delay model, the thermally dependent signal integrity metrics, i.e. signal delay and clock skew, are analyzed and some design techniques are provided to eliminate the non-uniform temperature-dependent clock skew.

I. Introduction

Management of thermally related issues is rapidly becoming one of the most challenging efforts in high-performance chip designs [1]. At the circuit level, thermal problems have important implications for circuit performance and reliability [2]. Dynamic power management and functional block clock gating tend to cause large thermal gradients over the substrate. Furthermore, as the technology feature size shrinks down, the upper metal layers that carry the global signals (including the clock) get closer to the substrate [1]. Consequently, the effect of non-uniform substrate temperature on the interconnect temperature profile becomes more critical. This thermal non-uniformity causes a non-uniform temperature profile along global signal lines, resulting in a non-uniform interconnect resistance per unit length. In high performance designs, it is of utmost importance to provide design techniques to overcome physical non-uniformities and process variations. This is necessary to provide uniform unit length interconnect capacitance and resistance in order to ensure a near-zero clock skew [3]. Hence, analysis and modeling of the effect of non-uniform substrate temperature on the integrity of global signal lines is critical.

II. Non-Uniform Chip and Interconnect Thermal Profile

Due to the different switching activities of the cells in the substrate, a non-uniform temperature gradient, which is created by the so-called hot spots in the substrate, is inevitable. Since long global wires span a large area of the substrate surface, they will pass over these hot spots. As a result, determining the interconnect thermal profile is crucial to the signal integrity analysis of the global interconnects.

Consider an interconnect with length $L$, width $w$ and thickness $t_0$ that passes over the substrate with an insulator of thickness $t_i$ and thermal conductivity $k_i$ separating the two (Figure 1). The interconnect is connected to the substrate by vias/contacts at its two ends. Using appropriate boundary conditions, the temperature profile along the interconnect can be obtained by solving the heat diffusion equation in the steady state, by assuming that the four sidewalls and the top surface of the chip are thermally isolated (which are in general valid assumptions), the heat diffusion equation can be written as follows:

$$a \frac{dT_{\text{IC}}(x)}{dx} = k \frac{T_{\text{IC}}(x) - T_{\text{IC}}(x)}{w^2}, \quad \theta = \frac{T_{\text{IC}}(x) - T_{\text{IC}}(x)}{w^2}$$  

where $\lambda$ and $\theta$ are constants given as follows:

$$\lambda = \frac{k_i}{k_i t_i}, \quad \theta = \frac{k_i}{k_i t_i}$$

$T_{\text{IC}}$ is the interconnect temperature as a function of position along the length of the interconnect, $T_{\text{IC}}$ is the substrate temperature, $p$ is the metal electrical resistivity at the reference temperature (0 °C), and $\beta$ is the temperature coefficient of interconnect metal resistance in $1/°C$. In order to have a unique solution for (1), we need to provide two boundary conditions. Figure 2 and 3 show the thermal profiles of the point-to-point interconnect depicted in Figure 1 for two different technology nodes under various substrate thermal profiles.

III. Non-Uniform Temperature-Dependent Signal Delay Model

The resistance of the interconnect is a function of the line temperature as $r(t) = r_0(1 + \beta T(t))$, where $r_0$ is the unit length resistance at 0 °C and $\beta$ is the temperature coefficient of resistance (1/°C). Consider an interconnect with length $L$ and uniform capacitance per unit length $C_{\text{IC}}$ that is driven by a driver of output resistance $R_D$ and terminated by a load with capacitance $C_L$. Using a distributed RC Elmore delay model, the delay $D$ of a signal passing through the line can be written as follows:

$$D = R_D (C_{\text{IC}} + \int_0^L c_i(x) dx) + \int_0^L r_i(x) \left[ \int_0^L c_i(x) dx + C_{\text{IC}} \right] dx$$

(3)

By applying the temperature-dependent resistance, the signal delay is:

$$D = D_0 + (c_{D} + c_0) \sigma \int_0^L r_i(x) dx - c_{D} \beta \int_0^L T(x) dx$$

(4)

where:

$$D_0 = R_D (C_{\text{IC}} + c_0 L) + \sigma \int_0^L T(x) dx$$

$D_0$ is the Elmore delay (at 0 °C) when the effect of temperature on the line resistance is neglected. Figure 5 shows the delay degradation due to temperature increase in an interconnect for various lengths using the parameters listed in Table 1 (assuming a constant thermal profile). In reality, and especially for long global lines, the thermal profile along the length of an interconnect is non-uniform. Figure 6 shows the importance of the line temperature non-uniformities on the delay by considering two different exponential thermal profiles depicted in Figure 4 for two different lengths. This shows that the assumption of a constant temperature along the wire (with peak-value) can introduce a large error in wire planning, wire routing, wire sizing and clock-skew analysis.

IV. Non-Uniform Temperature-Dependent Clock Skew

In addition to the performance degradation introduced by increasing temperature in the interconnect, the non-uniform thermal profile along upper layer interconnects has a major impact on the skew of the clock signal net. To ensure zero skew clock distribution, symmetric H-Tree structure or bottom-up merging are most commonly used [4]. In general, the top-level segments of the clock tree are long and assigned to the upper metal layers (Figure 7). As a result, they are exposed to the thermal non-uniformities in the substrate. Such non-uniformities result in different signal delays at the two ends of the clock tree segments, creating a non-zero skew in the tree. To ensure zero skew, we need to find the division point $x$ along the length of the global segment (Figure 8) such that when the clock signal driver is connected to that point, the delay at the two ends of the segment becomes the same. By considering propagation delay from the driver to the two ends $p$ and $q$ and using (4), the optimum length $T$ that guarantees zero clock skew can be obtained by solving the following equation:

$$\beta \int_0^T r(x) dx + \int_0^T T(x) dx = 0$$

(6)

where $A$ is a constant and can be written as follows:

$$A = \frac{1}{L_0} \int_0^L C_{\text{IC}} + \beta (L_0 + C_L) \int_0^L T(x) dx - C_{\text{IC}} \beta \int_0^L T(x) dx$$

(7)

Given circuit parameters $L$, $C_{\text{IC}}$, $C_L$, and $\beta$, one can easily compute the constant $A$ and solve (6) to obtain the optimum position for the clock signal connection to the net segment. From (6), it is seen that with a constant thermal profile $T(x)$ along the length of the interconnect, we can guarantee a zero skew by connecting the clock signal at $x=L/2$. In fact, even a non-uniform but symmetrical thermal profile with the symmetry axis at $L/2$ results in a zero clock skew when the driver is connected to the middle of the line. We also see that a gradually decreasing (increasing) thermal profile along the length of the line from $0$ to $L$ (from $p$ to $q$), results in an optimum length $T$ less than (greater than) $L/2$. Table 2 shows the behavior of temperature-dependent clock skew for a 2000 µm line by applying three different interconnect thermal profiles. The third set of profiles approximates a hot spot along the length of an interconnect by using a constant-peak normal distribution with peak $T_{\text{max}}$ (°C) at $100$, mean $m$ (µm) and standard deviation $\sigma$ (µm). The reported normalized skew percentage in column 4 represents the ratio of the clock skew when $L=\text{L/2}$ over the delay from the driver to any endpoint of the interconnect when $l=L$. Hence,
the normalized skew percentage represents the skew if the location of the driver insertion into the clock line did not account for the non-uniform temperature profile along the interconnect length. These results show that neglecting the effects of thermal profiles on the clock tree delay fluctuations changes the normalized skew by as much as 10 percent and can dramatically affect the clock signal performance.

V. Conclusion

In conclusion, a detailed analysis of the impact of non-uniform chip temperature distributions on the signal integrity of global lines was presented using a new analytical distributed RC delay model that incorporates the non-uniform interconnect temperature dependency. It was shown that non-uniform temperature distributions along long global wires in high-performance ICs can have a significant impact on the interconnect performance and the worst-case clock skew. Finally, an analytical model that helps the designers cope with the non-uniformities in the chip temperature during the clock net routing has been presented for the first time.

References:

Table 1: Parameters used in this work.

<table>
<thead>
<tr>
<th>Parameter</th>
<th>Nominal value</th>
<th>Unit</th>
</tr>
</thead>
<tbody>
<tr>
<td>$\beta$</td>
<td>3E-03</td>
<td>°C/K</td>
</tr>
<tr>
<td>$r_{dd}(25^\circ C)$</td>
<td>0.077</td>
<td>$\Omega$</td>
</tr>
<tr>
<td>$c_{sh}$</td>
<td>0.2</td>
<td>$\Omega$</td>
</tr>
<tr>
<td>$w$</td>
<td>0.32</td>
<td>$\mu m$</td>
</tr>
<tr>
<td>$R_d$</td>
<td>10</td>
<td>$\Omega$</td>
</tr>
<tr>
<td>$C_d$</td>
<td>1000</td>
<td>$fF$</td>
</tr>
</tbody>
</table>

Table 2: Comparison between the different non-uniform substrate thermal profiles and their effects on clock skew.

<table>
<thead>
<tr>
<th>Thermal Profile</th>
<th>Parameters</th>
<th>$I^*$</th>
<th>Normalized Skew %</th>
</tr>
</thead>
<tbody>
<tr>
<td>$T(x)=ax+b$</td>
<td>$T_d=170$, $T_s=90$</td>
<td>1012</td>
<td>5.62</td>
</tr>
<tr>
<td>$T(x)=ax+b$</td>
<td>$T_d=170$, $T_s=110$</td>
<td>1012</td>
<td>5.98</td>
</tr>
<tr>
<td>$T(x)=ax+b$</td>
<td>$T_d=170$, $T_s=130$</td>
<td>1021</td>
<td>2.65</td>
</tr>
<tr>
<td>$T(x)=ax+b$</td>
<td>$T_d=170$, $T_s=150$</td>
<td>1012</td>
<td>1.29</td>
</tr>
<tr>
<td>$T(x)=a\cdot e^{bx}$</td>
<td>$T_d=170$, $T_s=90$</td>
<td>957.5</td>
<td>5.24</td>
</tr>
<tr>
<td>$T(x)=a\cdot e^{bx}$</td>
<td>$T_d=170$, $T_s=110$</td>
<td>968.66</td>
<td>3.63</td>
</tr>
<tr>
<td>$T(x)=a\cdot e^{bx}$</td>
<td>$T_d=170$, $T_s=130$</td>
<td>979.5</td>
<td>2.40</td>
</tr>
<tr>
<td>$T(x)=a\cdot e^{bx}$</td>
<td>$T_d=170$, $T_s=150$</td>
<td>989.7</td>
<td>1.19</td>
</tr>
</tbody>
</table>