Research of thermal sensor allocation and placement based on dual clustering for microprocessors

Li, Xin; Rong, Mengtian; Liu, Tao; Zhou, Liang

doi:10.1186/2193-1801-2-253

Research
Open access
Published: 04 June 2013

Research of thermal sensor allocation and placement based on dual clustering for microprocessors

Xin Li¹,
Mengtian Rong¹,
Tao Liu¹ &
…
Liang Zhou¹

SpringerPlus volume 2, Article number: 253 (2013) Cite this article

2749 Accesses
1 Citations
Metrics details

Abstract

Dynamic thermal management techniques employ a set of on-chip thermal sensors to measure runtime thermal behavior of microprocessors so as to prevent the on-set of high temperatures. Therefore, effective analysis of thermal behavior and determination of the best allocation and placement of thermal sensors directly impact the effectiveness of the dynamic thermal management mechanisms. In this paper, we propose systematic and effective techniques for determining the fewest number of thermal sensors and the optimal locations based on dual clustering to provide a high fidelity thermal monitoring. Initially, we utilize the dual clustering algorithm to devise method that can reduce the number of sensors to a great extent while satisfying an expected accuracy. Then we identify an optimal physical location for each sensor such that the sensor’s attraction towards steep thermal gradient is maximized. Experimental results indicate the superiority of our techniques and confirm that our proposed methods are capable of creating a sensor distribution for a given microprocessor architecture using the number of thermal sensors of 2, 8, 15, 24, 35, depending on different expected hot spot temperature error accuracy of 5%, 4%, 3%, 2%, 1%, respectively.

Introduction

Large-scale circuit integration and exponentially increasing power densities have resulted in high temperature in current microprocessors. Elevated chip temperature slows down transistor speed and increases interconnect delays (Brooks et al. 2007). The results of these trends are timing failures and thermal runaway (Lin & Banerjee 2008). Therefore, effective assessment and analysis of the thermal behavior of microprocessors have become a major issue to be considered.

Traditionally, the problem of temperatures on chips has been solved by employing dynamic thermal management techniques (Jayaseelan & Mitra 2009) which use a set of on-chip thermal sensors that continuously monitor temperatures at a few selected die locations during the runtime. The most well-known dynamic thermal management techniques include clock gating, dynamic voltage and frequency scaling (DVFS) (Hanson et al. 2007). Several microprocessors have been equipped with thermal sensors. For instance, AMD Opteron employs 38 thermal sensors (Zhang & Srivastava 2009; Zhang & Srivastava 2011) that trigger alarms if the junction temperature exceeds a specified limit (Coskun et al. 2008).

Moreover, accuracy is another crucial criterion for dynamic thermal management techniques. Overestimation of temperature results in spurious alerts that lead to unnecessary triggering of thermal control mechanisms, e.g., DVFS (Long et al. 2008; Memik et al. 2008). On the other hand, underestimation of temperature greatly reduces the reliability since the processor will continue to operate at a higher temperature than its rated operating condition (Long et al. 2008; Memik et al. 2008). Embedding a large number of thermal sensors on the die is an unadvisable option to increase the accuracy. In fact, chips need to use the fewest number of thermal sensors to reduce manufacturing costs, die area and design complexity. In addition, allocating arbitrarily large number of sensors employed by the monitoring infrastructure, constructing the sensor networks will also pose a challenge (Long et al. 2008). An ideal goal is to monitor the highest temperatures on a microprocessor with allocating a minimum number of thermal sensors. As a result, how to provide accurate thermal monitoring in a given system while maintaining a reasonable number of sensors becomes crucial.

In this paper, we propose systematic and effective techniques for determining the fewest number of thermal sensors and the optimal locations based on dual clustering algorithm to provide a high fidelity thermal monitoring.

The organization of this paper is as follows. Related Work section overviews some of the recent relevant methods in the literature. In Proposed Thermal Sensor Allocation and Placement Techniques section we provide an overview of our methodology, where we introduce the thermal gradient calculation method in Thermal Gradient Calculation section and propose effective technique for thermal sensor allocation based on the dual clustering algorithm in Sensor Allocation Scheme section, and in Sensor Placement Strategies section we identify an optimal strategy for thermal sensor placement. We demonstrate the effectiveness of our methods through an extensive set of experimental results in Experimental Results section. Finally, Conclusion section summarizes the main conclusions of this work and indicates directions for future work.

Related work

It is intriguing to observe that several recent studies aiming to address thermal sensor allocation problem and reconstruct the full thermal characterization seemed to have a few works. For thermal sensor allocation and full thermal reconstruction, some representative techniques have been proposed shown in Table 1.

Table 1 Overview of related works

Full size table

Proposed thermal sensor allocation and placement techniques

Although it is a clear trend in elevating the number of thermal sensors in high performance microprocessors (Long et al. 2008), allocating the number of sensors arbitrarily will create several overheads as mentioned earlier. Reducing the number of sensors may help relieve these overheads. However, this will cause inaccuracies. Our goal is to provide accurate thermal monitoring while maintaining a reasonable number of sensors. In this section we first introduce the thermal gradient calculation method, and then we propose systematic and effective thermal sensor allocation and placement techniques to overcome this challenge.

Thermal gradient calculation

Thermal gradient describes that in which direction and at what rate the temperature changes the most rapidly around a particular location. The magnitude of the thermal gradient determines how fast the temperature changes in the corresponding direction rather than the value of the temperature at the measuring point.

Any representation in computer memory must be discretized, we utilize the classical Sobel operator (Wang 2009) to calculate an approximation of the gradient of the thermal map. At each point in the thermal map, the result of the Sobel operator is either the corresponding gradient vector or the norm of this vector. The Sobel operator is implemented using the following two 3 × 3 matrixes which are convolved with the original thermal map to calculate approximations of the derivatives: one for horizontal changes, and the other for vertical.

\begin{array}{c} M_{x} = [\begin{array}{c} - 1 & 0 & 1 \\ - 2 & 0 & 2 \\ - 1 & 0 & 1 \end{array}] & and & M_{y} = [\begin{array}{c} - 1 & - 2 & - 1 \\ 0 & 0 & 0 \\ 1 & 2 & 1 \end{array}] \end{array}

(1)

If we define T as the source thermal map, at each point in the thermal map, the approximation of the magnitude of the thermal gradient is expressed as follows:

G = \sqrt{{(M_{x} * T)}^{2} + {(M_{y} * T)}^{2}}

(2)

where ‘*’ here denotes the 2-dimensional convolution operation.

. (Memik et al 2008) indicated that the thermal gradient around a high-temperature location is larger than that at a low-temperature point. However, our experimental results find that the thermal gradient at one point has no relation with its own temperature. For example, we simulated the bzip2 benchmark (Henning 2000) using the experimental flow shown in Simulation Infrastructure section. Figure 1 (a) exhibits the full thermal characterization, and Figure 1 (b) shows the thermal gradient distribution calculated by Sobel operator. It’s observed that the RUU block has relatively high temperatures, while attaining lower values in thermal gradient distribution.

Sensor allocation scheme

In general, placing sensors at the hot spot locations for one application will cause large temperature errors for other applications (Memik et al. 2008; Mukherjee & Memik 2006). Our objective is to address this deficiency by systematically analysis of thermal maps across a wide set of applications. We formulate the sensor allocation problem as a dual clustering of the points of interest in the spatial and non-spatial domains. We try to partition the hot spot data set into several groups, so that these groups form nonoverlapping compact regions in the spatial domain while minimizing the dissimilarity of the data points in a group on the non-spatial domain (Lin et al. 2005). Then, each group will be allocated one sensor, which will monitor the hot spot points associated with that group. In the remaining part of this section, we will first briefly introduce the basic concept of the dual clustering. Based on the dual clustering, we propose an effective sensor allocation algorithm.

Dual clustering

The dual clustering (Jiao et al. 2011) can be defined as: given a set of objects {o ₁, o ₂, …, o _n}, each object has two attribute domains, i.e., spatial domain and non-spatial domain, as shown in Equation 3.

o_{n} = \{g_{n}^{(1)}, \dots, g_{n}^{(L)}, a_{n}^{(1)}, \dots, a_{n}^{(T)}\}

(3)

Where $\{g_{n}^{(1)}, \dots, g_{_{n}}^{(L)}\}$ is the spatial location (L is usually set to 1, 2 or 3), and $\{a_{n}^{(1)}, \dots, a_{n}^{(T)}\}$ is the non-spatial attributes (T is the number of non-spatial attributes). The spatial distance between two objects is defined as Euclidean distance, and the non-spatial distance between two objects is given by Equation 4.

D_{ij}^{(A)} = \sqrt{\sum_{t = 1}^{T} w_{t} {(a_{i}^{(t)} - a_{j}^{(t)})}^{2}}

(4)

Where $D_{ij}^{(A)}$ is the non-spatial distance between object i and object j, $a_{i}^{(t)}$ and $a_{j}^{(t)}$ represent the values of attribute t for object i and object j, w _t is the weight of attribute t, and $\sum_{t = 1}^{T} w_{t} = 1$ .

Dual clustering is the process of partitioning the object data set into several groups, while clustering dispersion in the non-spatial domain is less than the given threshold and each group is a connective cluster (Jiao et al. 2011). The result of dual clustering should be spatial continuous and attributively aggregative.

Sensor allocation algorithm

Based on dual clustering, we devise an effective sensor allocation algorithm. Initially, we construct a Voronoi diagram (Bhattacharya & Gavrilova 2007) according to the locations of all the hot spots on the die. After that, the hot spot fields are divided into subregions of Voronoi cells, and each stationary hot spot node is within a Voronoi cell shown in Figure 2.

Definition 1. If the Voronoi cells of two hot spots share a Voronoi edge (have more than a single point in common), then the two hot spots are considered neighbor, i.e., the hot spots H2, H3, H4, H6 and H8 are Voronoi neighbors of hot spot H1 in Figure 2.

Definition 2. Set the number of non-spatial attributes T to 1 and the non-spatial attribute is defined as the temperature of hot spot.

Definition 3. If two hot spots are neighbor to each other and the non-spatial distance between them is less than the given threshold $D_{\max}^{(A)}$ , then the Voronoi cells of the two hot spots are merged into a new cluster, i.e., in Figure 2, the Voronoi cells of hot spot H1 and H6 are merged into a cluster when $D_{{_{H}}_{1, H 6}}^{(A)} < D_{_{\max}}^{(A)}$ .

Definition 4. Set the threshold of non-spatial distance to:

D_{max}^{(A)} = α \times ϵ_{max} \times \frac{1}{n} \sum_{i = 1}^{n} a_{i}

(5)

Where α is a correction coefficient, ϵ _max is an expected hot spot temperature error accuracy, n is the number of hot spots in a cluster and a _i is the value of non-spatial attribute at each hot spot in a cluster.

Our sensor allocation algorithm can be presented as follows:

1.
Select a hot spot with maximum value of thermal gradient as initial cluster center.
2.
Apply the definition 3 to obtain a new cluster C _new.
3.
Set the hot spots in cluster C _new to new cluster centers and go to Step 2.
4.
If the cluster C _new cannot be merged with other cells, it is defined as an integrated cluster, then allocate one sensor to it.
5.
Perform the Step 1–4 in residual hot spots until each hot spot belong to a certain cluster.

The details of sensor allocation algorithm are shown in Figure 3.

Sensor placement strategies

Once we finish the hot spot clustering, the allocation number of sensors is determined. Then we need to determine the physical location of thermal sensors. In this section we identify two different strategies for thermal sensor placement.

Geometric-Center Sensor Placement. In this strategy, a sensor is placed at the geometric center of each cluster region.

As we know, ideal thermal sensor placement methods that focus on placing sensors only near potential locations which have the highest absolute temperatures will achieve the best results for hot spot temperature estimation. However, these methods might lead to poor results for full thermal reconstruction as they will have no information at the locations which temperatures change the most rapidly. Thus, we choose thermal gradient, instead of absolute temperature, as the base for sensor placement method. We propose here another strategy-which is inspired by improved k-means clustering method (Memik et al. 2008; Mukherjee & Memik 2006)-that takes into account the diversity of thermal gradients within a cluster.

Thermal-Gradient-Attraction Sensor Placement. The basic idea behind this strategy is to move the sensors closer to the relatively higher thermal gradient hot spots. This is equivalent to the sensor being attracted to the hot spots with high thermal gradient values by a larger force (Memik et al. 2008; Mukherjee & Memik 2006). The details of this strategy are described as follows:

For each addition of hot spot h _i of cluster C _j, the sensor coordinates are the cumulative sum of the corresponding member coordinates. The cumulative sum computation is shown in Equation 6.

\begin{array}{l} s_{jx, y} = & s_{jx, y} + h_{ix, y} + β (h_{ix, y} - s_{jx, y} / n_{iteration}) \\ \times (h_{ig} - s_{jg} / n_{iteration}) \\ s_{jg} = & s_{jg} + h_{jg} \end{array}

(6)

Where s _{jx, y}, h _{ix, y} and s _jg, h _ig are the coordinates and thermal gradient of sensor s _j and hot spot h _i, respectively. n _iteration is the number of iterations, and β is an attraction coefficient. We have determined experimentally that an attraction coefficient value β = 0.3 performs best. The (x, y) coordinates of the sensor s _j are closer to the hot spot h _i if the g dimension of s _j is less than that of h _i, otherwise the sensor moves further from the position of h _i. The illustration for the thermal gradient attraction approach in the n + 1 iteration is shown in Figure 4 (γ = β(h _ig − s _jg/n _iteration)). After iterating over all the hot spots in cluster C _j, the final position of s _j is updated as shown in Equation 7.

s_{jx, y} = s_{jx, y} / size (C_{j})

(7)

Experimental results

In the following two sections we first describe our experimental methodology and then we present our results.

Simulation infrastructure

To evaluate the effectiveness of our methods, we design an experimental flow that simulates thermal distribution for a 65 nm microprocessor based on Alpha EV6 architecture. We first give the definition of power consumption (Shauly 2012) and then we describe our experimental flow.

Power consumption

There are two main components that constitute the power used by a CMOS integrated circuit: static power and dynamic power. Static power essentially consists of the power used when the transistor is not in the process of switching. Typically, CMOS technology has been praised for its low static power. However, as devices are scaled, gate oxide thicknesses decrease and there is increased probability of tunnelling, resulting in larger and larger leakage currents. Therefore, static power (also called leakage power) dissipation will become increasingly significant. Dynamic power is the sum of transient power consumption and capacitive load power consumption. The total power dissipation is summarized as shown in Equation 8:

\begin{array}{l} P_{total} & = P_{dynamic} + P_{static} \\ = (P_{short} + P_{switch}) + P_{static} \\ = I_{SC} V_{dd} + ψ C_{L} V_{dd}^{2} f + I_{leakage} V_{dd} \end{array}

(8)

P _short is the power consumed during gate voltage transient time, that in CMOS technology is only related to the direct path short circuit current (I _SC) which flows when both the NMOS and PMOS transistors are simultaneously active, conducting current directly from supply to ground. Significant short circuit power dissipation can be avoided if the output rise/fall time of a gate is much longer than the input rise/fall time. P _switch refers to the dynamic component of power, where C _L is the total loading capacitance, f is the clock frequency, and ψ is the average switching activity factor. P _static is due to the leakage current I _leakage. Imperfect cut-off of the transistor leads to leakage (I _leakage) and power dissipation (P _static) even without any switching activity.

Experimental flow

The complete experimental flow shown in Figure 5 is performed using the following tools:

We use the Alpha EV6 as our base processor (Kessler 1999) with a 3 GHz clock frequency. The Alpha EV6 is an out-of-order speculative execution core that is commonly used as a test-bench core in thermal management research.
For workloads, we simulated the SPEC2000 benchmark (13 floating points and 12 integer benchmarks) suite (Henning 2000), using Simple Scalar (Burger & Austin 1997) 3.0e. The Simple Scalar simulates a superscalar processor with out-of-order issue and execution. For each application, we simulated 10 million instructions.
For dynamic power estimation, we use Wattch (Brooks et al. 2000), a power simulator for analyzing and calculating microprocessor power dissipation at the architecture-level. We integrate the Wattch power model into Simple Scalar simulator in order to gain the power statistics in each time interval. For each functional unit in the processor, we add an access counter to record the access information, which is fed into the Wattch power model to calculate the dynamic power traces. In our experiments, we assume clock gating to all components and that clock gating can reduce dynamic power by 75%, as proposed by Liao et al. (2005). For leakage power estimation of processor core units, we construct a leakage model (Liao et al. 2005) and use CACTI 5.0 (Wilton & Jouppi 1996) to accurately model cache leakage power.
We utilize HotSpot (Huang et al. 2006) version 5.0 for thermal simulation in the grid level (discretized into 128 × 128 grids). The floor-plan of Alpha EV6 and the workload power traces from Wattch are used as inputs to the HotSpot, and finally the steady-state temperatures for a set of grid locations can be produced as output. This type of grid level thermal modelling is useful for capturing spatial temperature variation within a processor unit. The initial temperature of processor, which represents the die temperature if the processor was already executing instructions prior to execution of benchmarks to model the warm up period, was assumed to be 60°C. The ambient temperature is set to 45°C. For 3GHz clock frequency, HotSpot calling interval of 10 K cycles gives the best trade-off between precision and overhead (Mukherjee & Memik 2006). The package assembly model in HotSpot, whose physical and thermal properties of all packaging layers are evaluated according to a practical packaged high-performance microprocessor shown in Table 2 (Lin et al. 2007), was also created shown in Figure 6.

Table 2 Dimensions and thermal properties of different package layers

Full size table

The point of interest for our experiments is the hottest point per component. For each benchmark, each component will exhibit a hot spot. As the location of this hot spot may change for different applications as verified by Memik et al. (2008), we first combine these locations to find the distribution of hot spots across different benchmarks. Figure 7 depicts the distribution of hot spots for each processor block, which was obtained from our simulations across the SPEC2000 benchmarks in the same Alpha EV6 architecture. Dotted lines represent the region of L2 cache blocks containing the hot spots. We partition the L2 cache into three regions: L2_left, L2_right and L2_bottom. Across 25 benchmarks and 20 different components of the processor, the theoretical number of block-level hot spots is 500. However, some hot spots reoccur due to correlation of activity and power density, and the temperatures of some hot spots are obviously lower than those of other hot spots in the same block, leaving us with 132 distinct points. Based on this distribution we make decisions of the allocation number and locations of sensors using our proposed thermal sensor allocation and placement techniques as described in Proposed Thermal Sensor Allocation and Placement Techniques section.

Results

Extensive experiments are conducted to examine the effectiveness of our proposed thermal sensor allocation and placement techniques. All experiments are implemented by MATLAB code and run on a Pentium 3.0 GHz PC with 1GB SDRAM. In our experiments we report the following three metrics:

Number of thermal sensors. Given a maximum allowable hot spot temperature error accuracy: for our proposed thermal sensor allocation and placement techniques, we determine the number of integrated cluster and each integrated cluster will be allocated one sensor; for improved k-means clustering technique (Memik et al. 2008), we iteratively perform the improved k-means clustering algorithm until the maximum hot spot estimation error is less than the given allowable hot spot temperature error (initially, set the value of k to 1). Finally, the value of k is the number of thermal sensors.
Hot spot estimation error. The computation of the hot spot estimation error is equal to the difference between the hot spot temperatures in the true temperature distribution signals as obtained by executing the experimental flow and the temperatures at the locations of the thermal sensors.
Full thermal reconstruction error. For each application, we reconstruct the full thermal characterization with the different strategies for thermal sensor placement, using the inverse distance weighting method based on a dynamic Voronoi diagram (Li et al. 2011). Then, we compute the average absolute temperature error between the true temperatures and the estimated temperatures calculated by the reconstruction method. We report the average absolute error computed for all 25 benchmarks.

In our first set of experiments, we determine the allocation number of thermal sensors while varying the maximum allowable hot spot temperature error accuracy from 1% to 5%. We compare three different methods for thermal sensor allocation and placement: improved k-means clustering (IKmC) (Memik et al. 2008), geometric-center (GC) and thermal-gradient-attraction (TGA). The plot in Figure 8 gives the allocation number of thermal sensors while satisfying different maximum allowable hot spot temperature error accuracy. Comparing the results, it’s observed that our proposed thermal gradient attraction method gives the fewest number of thermal sensors and all of our proposed methods significantly outperform the improved k-means clustering (Memik et al. 2008). Allocating arbitrarily large number of sensors will not only create a significant area overhead, but constructing the sensor networks will also pose a challenge. Thus, reduce the number of thermal sensors to a great extent while satisfying the maximum allowable hot spot temperature error accuracy is a desirable property for microprocessors.

In the second set of experiments we demonstrate that the maximum hot spot estimation temperature error obtained by our proposed sensor allocation and placement techniques is assuredly less than the corresponding maximum allowable hot spot temperature error. We repeat this experiment using different maximum allowable hot spot temperature errors and report the errors in hot spot estimation in Figure 9. The results are summarized in Table 3. The results show that our proposed sensor allocation and placement methods give close results to improved k-means clustering (Memik et al. 2008), while reducing the number of sensors to a great extent as shown in Figure 8. The difference between thermal gradient attraction and geometric center is that thermal gradient attraction method gives relatively poor results compared with geometric center method, while requiring a fewer number of thermal sensors.

Table 3 Hot spot temperature error and corresponding number of sensors using different sensor allocation and placement approaches

Full size table

The optimal value of correction coefficient α as a function of the maximum allowable hot spot temperature error is given in Figure 10. Note that the relationship between them is an increasing function. The reason is that when the maximum allowable hot spot temperature error increases, the non-spatial distance correspondingly increases and as the relationship between the threshold of non-spatial distance and correction coefficient is linear as defined in Sensor Allocation Algorithm section, the value of correction coefficient also increases. In addition, the optimal values of correction coefficient of thermal gradient attraction method are larger than those of geometric center method, which illuminates that thermal gradient attraction method achieves the hot spot estimation error limit bounded by the corresponding maximum allowable hot spot temperature error even with larger threshold of non-spatial distance.

The objective of our third set of experiments is to determine the full thermal reconstruction error while varying the maximum allowable hot spot temperature error accuracy from 1% to 5%. Figure 11 summarizes the errors in full thermal reconstruction and the corresponding allocation number of thermal sensors. It’s clear that thermal gradient attraction method gives the superior results: obtaining the least full thermal reconstruction error and requiring the fewest number of thermal sensors. The reason for the superior performance of the thermal gradient attraction strategy is that the sensor’s attraction towards steep thermal gradient is maximized. The difference between geometric center method and improved k-means clustering (Memik et al. 2008) is that geometric center method gives relatively poor results compared with improved k-means clustering (Memik et al. 2008), while requiring a fewer number of thermal sensors.

In conclusion, using our proposed thermal gradient attraction method, the allocation number of thermal sensors are 2, 8, 15, 24, 35, and the average absolute full thermal reconstruction errors are 9.34%, 4.12%, 2.88%, 1.65%, 0.91%, depending on different maximum allowable hot spot temperature error accuracy of 5%, 4%, 3%, 2%, 1%, respectively. These values confirm that our proposed thermal sensor allocation and placement techniques are capable of accurately characterizing the temperature of microprocessors, while requiring the fewest number of thermal sensors.

Conclusion

In this paper, we have proposed systematic and effective techniques for determining the fewest number of thermal sensors and the optimal locations based on dual clustering algorithm in a complex microprocessor system. Our goal is to provide accurate thermal monitoring while maintaining a reasonable number of sensors. We first develop method based on dual clustering algorithm that can reduce the number of sensors to a great extent while satisfying an expected accuracy. Then we identify an optimal physical location for each sensor such that the sensor’s attraction towards steep thermal gradient is maximized.

The effectiveness of our techniques has been evaluated on a sophisticated experimental setup. Experimental results indicate the superiority of our techniques and confirm that our proposed thermal sensor allocation and placement techniques are capable of accurately characterizing the temperature of microprocessors, while requiring the fewest number of thermal sensors. The significance of our techniques will allow dynamic thermal management scheme to implement the accurate temperature monitoring with small number of embedded thermal sensors-a desirable property for microprocessors.

Our future work will focus on investigating the impact of calibration errors in the thermal sensor measurements (Zhang & Srivastava 2009; Zhang & Srivastava 2011) on the results of our proposed methods.

References

Bhattacharya P, Gavrilova ML Proceedings of the 4th International Symposium on Voronoi Diagrams in Science and Engineering. In Voronoi diagram in optimal path planning. Pontypridd, Wales, UK: IEEE Computer Society; 2007:38-47. 10.1109/ISVD.2007.43
Google Scholar
Brooks D, Tiwari V, Martonosi M Proceedings of the 27th International Symposium on Computer Architecture. In Wattch: A Framework for Architectural-Level Power Analysis and Optimizations. Vancouver, BC, Canada: IEEE Computer Society; 2000:83-94. 10.1145/342001.339657
Google Scholar
Brooks D, Dick RP, Joseph R, Shang L: Power, thermal, and reliability modelling in nanometer-scale microprocessors. IEEE Micro 2007, 27(3):49-62. 10.1109/MM.2007.58
Article Google Scholar
Burger DC, Austin TM: The simple scalar tool set, version 2.0. ACM SIGARCH Comput Arch News 1997, 25(3):13-25. 10.1145/268806.268810
Article Google Scholar
Cochran R, Reda S Proceedings of the 46th Annual Design Automation Conference. In Spectral Techniques for High-Resolution Thermal Characterization with Limited Sensor Data. San Francisco, California, USA: ACM; 2009:478-483. 10.1145/1629911.1630037
Google Scholar
Coskun AK, Rosing TS, Gross KC Proceedings of the International Symposium on Low Power Electronics and Design. In Proactive Temperature Management in MPSoCs. Bangalore, India: ACM; 2008:213-218. 10.1145/1393921.1393966
Google Scholar
Hanson H, Keckler SW, Ghiasi S, Rajamani K, Rawson F, Rubio J Proceedings of the 2007 International Symposium on Low Power Electronics and Design. In Thermal Response to DVFS: Analysis with an Intel Pentium M. Portland, Oregon, USA: ACM; 2007:219-224. 10.1145/1283780.1283827
Google Scholar
Henning J: SPEC CPU2000: Measuring CPU performance in the new millennium. IEEE Comput 2000, 33(7):28-35. 10.1109/2.869367
Article Google Scholar
Huang W, Ghosh S, Velusamy S, Sankaranarayanan K, Skadron K, Stan MR: HotSpot: A compact thermal modelling methodology for early-stage VLSI design. IEEE Trans VLSI Syst 2006, 14(5):501-513. 10.1109/TVLSI.2006.876103
Article Google Scholar
Jayaseelan R, Mitra T Proceedings of the 46th Annual Design Automation Conference. In Dynamic Thermal Management via Architectural Adaptation. San Francisco, California, USA: ACM; 2009:484-489. 10.1.1.151.4312
Google Scholar
Jiao LM, Liu YL, Zou B: Self-organizing dual clustering considering spatial analysis and hybrid distance measures. Sci China Earth Sci 2011, 54(8):1268-1278. 0.1007s11430-011-4222-1
Article Google Scholar
Kessler RE: The Alpha 21264 microprocessor. IEEE Micro 1999, 19(2):24-36. 10.1109/40.755465
Article Google Scholar
Li X, Rong M, Liu T, Zhou L: Inverse distance weighting method based on a Dynamic Voronoi Diagram for thermal reconstruction with limited sensor data on multiprocessors. IEICE Trans Electron 2011, E94-C(8):1295-1301. 10.1587/transele.E94.C.1295
Article Google Scholar
Liao W, He L, Lepak KM: Temperature and supply voltage aware performance and power modelling at microarchitecture level. IEEE Trans Comput-Aided Design of Integr Circuits and Syst 2005, 24(7):1042-1053. 10.1109/TCAD.2005.850860
Article Google Scholar
Lin SC, Banerjee K: Cool chips: opportunities and implications for power and thermal management. IEEE Trans Electron Devices 2008, 55(1):245-255. 10.1109/TED.2007.911763
Article Google Scholar
Lin CR, Liu KH, Chen MS: Dual clustering: integrating data clustering over optimization and constraint domains. IEEE Trans Knowl Data Eng 2005, 17(5):628-637. 10.1109/TKDE.2005.75
Article Google Scholar
Lin SC, Chrysler G, Mahajan R, De VK, Banerjee K: A self-consistent substrate thermal profile estimation technique for nanoscale ICs-Part II: Implementation and implications for power estimation and thermal management. IEEE Trans Electron Devices 2007, 54(12):3351-3360. 10.1109/TED.2007.909038
Article Google Scholar
Long J, Memik SO, Memik G, Mukherjee R: Thermal monitoring mechanisms for chip multiprocessors. ACM Trans Arch Code Optimization 2008, 2(5):9:1-9:33. 10.1145/1400112.1400114
Google Scholar
Memik SO, Mukherjee R, Ni M, Long J: Optimizing thermal sensor allocation for microprocessors. IEEE Trans Comput-Aided Design Integr Circuits 2008, 27(3):516-527. 10.1109/TCAD.2008.915538
Article Google Scholar
Mukherjee R, Memik SO Proceedings of the 43rd annual Design Automation Conference. In Systematic Temperature Sensor Allocation and Placement for Microprocessors. San Francisco, California, USA: ACM; 2006:542-547. 10.1145/1146909.1147051
Google Scholar
Nowroz AN, Cochran R, Reda S Proceedings of the 47th Design Automation Conference. In Thermal Monitoring of Real Processors: Techniques for Sensor Allocation and Full Characterization. Anaheim, California, USA: ACM; 2010:56-61. 10.1145/1837274.1837291
Google Scholar
Reda S, Cochran R, Nowroz AN: Improved thermal tracking for processors using hard and soft sensor allocation techniques. IEEE Trans Comput 2011, 60(6):841-851. 10.1109/TC.2011.45
Article Google Scholar
Shauly EN: CMOS leakage and power reduction in transistors and circuits: process and layout considerations. J Low Power Electron Appl 2012, 2(1):1-29. 10.3390/jlpea2010001
Article Google Scholar
Wang W Proceedings of the 2009 International Joint Conference on Artificial Intelligence. In Reach on Sobel Operator for Vehicle Recognition. Hainan Island: IEEE Computer Society; 2009:448-451. 10.1109/JCAI.2009.54
Google Scholar
Wilton S, Jouppi NP: CACTI: an enhanced Cache Access and Cycle Time Model. IEEE J Solid State Circuits 1996, 31(5):677-688. 10.1109/4.509850
Article Google Scholar
Zhang Y, Srivastava A Proceedings of the 46th Annual Design Automation Conference. In Accurate Temperature Estimation Using Noisy Thermal Sensors. San Francisco, California, USA: ACM; 2009:472-477. 10.1145/1629911.1630036
Google Scholar
Zhang Y, Srivastava A: Accurate temperature estimation using noisy thermal sensors for Gaussian and non-Gaussian cases. IEEE Trans Very large Scale Integr Syst 2011, 19(9):1617-1626. 10.1109/TVLSI.2010.2051567
Article Google Scholar

Download references

Acknowledgement

This work was supported by the National Basic Research Program of China under Grant 2009CB320206, by the National Natural Science Foundation of China under Grant 60821062 of China.

Author information

Authors and Affiliations

Key Laboratory of Ministry of Education of China for Research of Design and Electromagnetic Compatibility of High Speed Electronic Systems, Shanghai Jiao Tong University, Shanghai, 200240, China
Xin Li, Mengtian Rong, Tao Liu & Liang Zhou

Authors

Xin Li
View author publications
You can also search for this author in PubMed Google Scholar
Mengtian Rong
View author publications
You can also search for this author in PubMed Google Scholar
Tao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Liang Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xin Li.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

XL designed the project, carried out research and drafted the manuscript. MR worked on the related work research and contributed to project design. TL set up the simulation infrastructure and performed the experiments. LZ coordinated the research effort and revised the drafted manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Li, X., Rong, M., Liu, T. et al. Research of thermal sensor allocation and placement based on dual clustering for microprocessors. SpringerPlus 2, 253 (2013). https://doi.org/10.1186/2193-1801-2-253

Download citation

Received: 06 December 2012
Accepted: 21 May 2013
Published: 04 June 2013
DOI: https://doi.org/10.1186/2193-1801-2-253

Research of thermal sensor allocation and placement based on dual clustering for microprocessors

Abstract

Introduction

Related work

Proposed thermal sensor allocation and placement techniques

Thermal gradient calculation

Sensor allocation scheme

Dual clustering

Sensor allocation algorithm

Sensor placement strategies

Experimental results

Simulation infrastructure

Power consumption

Experimental flow

Results

Conclusion

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords