An Efficient Kriging-based Constrained Multiobjective Evolutionary Algorithm for Analog Circuit Synthesis via Self-adaptive Incremental Learning

> Sen Yin, Wenfei Hu, Wenyuan Zhang, Ruitao Wang, Jian Zhang and Yan Wang School of Integrated Circuits, Tsinghua University, China

#### Bio

- Sen Yin
- Tsinghua University



2014.9-2018.6 received the B.S. degree in Wuhan University, Wuhan, China

2018.9-present pursuing the Ph.D. degree at the Institute of Microelectronics, Tsinghua University, Beijing, China.

My current research interests include analog/RF circuit automatic synthesis, RF passive component modeling and the optimization of system-level circuits .

- 1. Introduction
- 2. Motivation
- 3. Proposed SILE algorithm
- 4. Experimental results
- 5. Conclusion



- 1. Introduction
- 2. Motivation
- 3. Proposed SILE algorithm
- 4. Experimental results
- 5. Conclusion





- The key problem in analog circuit synthesis: circuit sizing
- We concentrate on analog circuit sizing, multi-objective optimization.

- Most of the work concentrates on the synthesis of analog circuits at block-level.
   However, few results are reported on the optimization of an analog system.
- The optimization of an analog system



Low efficiency of multi-objective optimization is the bottleneck to optimize the system.

[1] Georges GE Gielen. Modeling and analysis techniques for system-level architectural design of telecom front-ends.IEEE Transactions on Microwave Theory and Techniques, 50(1):360–368, 2002. [2] Fábio Passos, Elisenda Roca, Javier Sieiro, Rafaella Fiorelli, Rafael Castro-López, José Mar 1a López-Villegas, and Francisco V Fernández. A multilevel bottom-up optimization methodology for the automated synthesis of RF systems.IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 39(3):560–571, 2019.

- Multi-objective analog circuit sizing
- 1.model-based methods, 2.simulation-based methods



- Multi-objective analog circuit sizing
- 1.model-based methods, 2.simulation-based methods



• High accuracy. Require a large number of time-consuming simulations.

- Multi-objective analog circuit sizing
- Online model-based methods



• High accuracy with less number of simulations

- 1. Introduction
- 2. Motivation
- 3. Proposed SILE algorithm
- 4. Experimental results
- 5. Conclusion



# Motivation

 Multi-objective Bayesian optimization (MOBO)<sup>[3]</sup> is one of the most representative algorithms in online model-based methods

۲



# Motivation

- There are two problems remains to be solved in online model-based methods.
- 1. The time spent on the model is comparable to or even exceed the simulation time in online model-based methods.
- 2. Most online model-based methods can not deal with constrained problems.



• The motivation of this work is to develop an online model-based method which shortens the time spent on the model.

- 1. Introduction
- 2. Motivation
- 3. Proposed SILE algorithm
- 4. Experimental results
- 5. Conclusion



- Time complexity of Kriging model
- Training process: the goal is to find the fittest hyperparameters  $\hat{\theta}$  for observed points. The time complexity is  $O(T_1 \cdot n^3)$ .  $T_1$  is the number of evaluating (1)

$$\hat{\theta} = \operatorname{argmax} \left( -\frac{n}{2} \ln \hat{\sigma}^2 - \ln |\mathbf{R}| \right)$$

$$\begin{cases} \hat{\mu} = \frac{\mathbf{1}^T \mathbf{R}^{-1} \mathbf{y}}{\mathbf{1}^T \mathbf{R}^{-1} \mathbf{1}} \\ \hat{\sigma} = \frac{(\mathbf{y} - \mathbf{1}\hat{\mu}) \mathbf{R}^{-1} (\mathbf{y} - \mathbf{1}\hat{\mu})}{n} \\ R(\mathbf{x}, \mathbf{x}') = \exp\left( -\Sigma_{i=1}^d \theta_i (x_i - x_i')^2 \right) \end{cases}$$
(1)

• Prediction process: Given estimated  $\hat{\theta}$  and calculated  $R^{-1}$ , one can predict the performances at any untested point. The time complexity is  $O(T_2 \cdot n^2)$ .

$$\begin{cases} \hat{y}(x) = \hat{\mu} + r^T R^{-1} (y - 1\hat{\mu}) \\ \hat{s}^2(x) = \hat{\sigma}^2 [1 - r^T R^{-1} r + \frac{(1 - 1^T R^{-1} r)^2}{1^T R^{-1} 1}] \end{cases}$$

• We propose an efficient Kriging-based constrained multi-objective evolutionary algorithm for analog circuit synthesis via self-adaptive incremental learning (SILE).



- Incremental learning technique
- How to calculate new  $\tilde{R}^{-1}$  from  $R^{-1}$  of the old model?

$$\tilde{R} = \begin{bmatrix} R_{11} & R_{12} & \cdots & R_{1n} & R_{1(n+1)} & R_{2(n+1)} \\ R_{21} & R_{22} & \cdots & R_{2n} & R_{2(n+1)} \\ \vdots & \vdots & \ddots & \vdots & \vdots \\ \frac{R_{n1} & R_{n2} & \cdots & R_{nn} & R_{n(n+1)} \\ R_{(n+1)1} & R_{(n+1)2} & \cdots & R_{(n+1)n} & R_{(n+1)(n+1)} \end{bmatrix}$$
$$= \begin{bmatrix} \frac{R}{A^T} & \frac{A}{B} \end{bmatrix}$$
$$\tilde{R}^{-1} = \begin{bmatrix} R^{-1} + R^{-1} A C^{-1} A^T R^{-1} & -R^{-1} A C^{-1} \\ -C^{-1} A^T R^{-1} & C^{-1} \end{bmatrix}$$
$$C = B - A^T R^{-1} A$$

- C is a 1×1 matrix. The time complexity of computing  $C^{-1}$  is O(1).
- Now that  $R^{-1}$  is known, the time complexity of  $\tilde{R}^{-1}$  is  $O(n^3) \rightarrow O(n^2)$

- $\tilde{R}$  is a symmetry positive definite matrix  $\leftrightarrow \tilde{R} = \tilde{L}\tilde{L}^T$
- How to calculate new  $\tilde{L}$  from L of the old model?



- the time complexity of  $L^{-1}A$  is  $O(n^2)$  by using back substitution method
- The time complexity of  $Chol(B A^T L^{-T} L^{-1} A)$  is O(1).
- Now that *L* is known, the time complexity of  $\tilde{L}^{-1}$  is  $O(n^3) \rightarrow O(n^2)$  17

- Self-adaptive strategy
- 1. In most cases, we build models with incremental Kriging model without updating hyperparameters. We only learn hyperparameters with Kriging model under a specific number of simulations.
- 2. In the early stage of optimization, we need to update hyperparameters more frequently.
- 3. In the later stage of optimization, we lower the frequency to update hyperparameters.

- Self-adaptive strategy
- As a result, hyperparameters are updated every *H* generations. *H* is adaptively adjusted based on the number of simulations, *N*

$$H = \left[\frac{N - \lambda}{N_{\max} - \lambda} (H_{\max} - H_{\min})\right] + H_{\min}$$

- $\lambda$ : the initial sample number  $N_{\text{max}}$ : the maximum number of simulations
- $H_{\text{max}}$ : upper bound for H  $H_{\text{min}}$ : lower bound for H
- For  $\lambda = 100$ ,  $N_{\text{max}} = 200$ ,  $H_{\text{min}} = 10$ ,  $H_{\text{max}} = 19$ , hyperparameters only update when N = 100, 110, 121, 133, 146, 160, 175, 192.

• Self-adaptive strategy



- The area under a curve is the total training time in the process of optimization
- At small spikes, we use Kriging model to optimize hyperparameters
- In the rest number of simulations, incremental Kriging model is used

#### The framework of prescreening



- More details
- x is said to constraint-dominate y if the following condition holds:
- 1) if CV(x) = 0 and  $CV(y) = 0, \forall i \in \{1, 2, ..., m\}$  such that  $f_i(x) \le f_i(y)$  and  $\exists j \in \{1, 2, ..., m\}$  such that  $f_i(x) < f_i(y)$
- 2) otherwise, CV(x) < CV(y)

$$CV(x) = \sum_{i=1}^{p} \max(g_i(x), 0)$$

- 1. Introduction
- 2. Motivation
- 3. Proposed SILE algorithm
- 4. Experimental results
- 5. Conclusion



- A two-stage amplifier, 11 design variables, 180nm
- 15 corners: -40°C, 27°C, 85°C and tt, ss, ff, fs, sf

 $\begin{array}{l} minimize(-Gain, -UGBW, -PM)\\ s.t. \ PM > 60^{\circ} \end{array}$ 



• A two-stage amplifier, 11 design variables, 180nm

| Algorithm         | SILE     | MOBO           | NSGA-II  | MOEA/D   |
|-------------------|----------|----------------|----------|----------|
| Max Gain(dB)      | 81.58    | 81.33          | 80.31    | 81.17    |
| Max UGBW(MHz)     | 19.64    | 18.86          | 16.95    | 17.68    |
| Max PM(°)         | 93.10    | 92.71          | 92.84    | 85.99    |
| Mean HV           | 14821    | 13951          | 13880    | 13709    |
| Median HV         | 14726    | 14038          | 14231    | 13582    |
| Max HV            | 16268    | 14624          | 15209    | 16190    |
| Min HV            | 13484    | 12526          | 10597    | 12224    |
| N <sub>max</sub>  | 400      | 400            | 4000     | 4000     |
| Training time/s   | 56.54    | <b>1183.76</b> | N/A      | N/A      |
| Prediction time/s | 2.53     | 798.26         | N/A      | N/A      |
| Simulation time/s | 1423.399 | <b>.7%</b>     | 15051.81 | 15170.12 |
| Total time/s      | 1490.35  | 3476.15        | 15052.09 | 15172.64 |
|                   |          | <b>10X</b>     |          |          |

 Compared with MOBO, SILE reduce the training time by 95% and the prediction time by 99.7%. SILE shows a speedup of 10X in terms of the total time while achieving better results.

• A fully differential operational amplifier , 21 design variables, 65nm



 $\begin{array}{ll} \mbox{minimize} & (-Gain, -GBW) \\ \mbox{s.t.} & PM_{dm} > 60^{\circ} \\ & PM_{cm} > 50^{\circ} \\ & GBW_{cm} > 1.2GBW_{dm} \\ & SR > 100V/\mu s \\ & overshoot_c < 15\% \\ & tset_c < 50ns \\ & overshoot_r < 15\% \\ & tset_r < 50ns \end{array}$ 

• A fully differential operational amplifier , 21 design variables, 65nm



SILE reduces the training time by 95%, the prediction time by 99.8% while achieving much better PF. There is a 6X speedup over NSGA-II and MOEA/D regarding the total time.

- 1. Introduction
- 2. Motivation
- 3. Proposed SILE algorithm
- 4. Experimental results
- 5. Conclusion



# Conclusion

• We propose an efficient Kriging-based constrained multi-objective evolutionary algorithm for analog circuit synthesis via self-adaptive incremental learning.



 Experimental results on two real-world circuits demonstrate that compared with MOBO, our method can reduce the training time of Kriging model by 95% and the prediction time by 99.7%. Compared with NSGA-II and MOEA/D, the proposed method can achieve up to 10X speed up.

#### Thanks for your attention!