#### An 8.69 Mvertices/s 278 Mpixels/s Tile-based 3D Graphics SoC HW/SW Development for Consumer Electronics

Liang-Bi Chen, Ruei-Ting Gu, Wei-Sheng Huang, Chien-Chou Wang, Wen-Chi Shiue, Tsung-Yu Ho, Yun-Nan Chang, Shen-Fu Hsaio, Chung-Nan Lee, and Ing-Jer Huang

Department of Computer Science and Engineering, National Sun Yat-Sen University, Kaohsiung, TAIWAN, R.O.C.

#### Presenter: Liang-Bi Chen





## Block diagram of proposed 3DG SoC



IEEE/ACM ASP-DAC'09@Yokohama, Japan, Jan 19-22 2009



# Block diagram of GE and RE





## Chip specification and characteristics





| Chip Spec.                                          | Process Technology                                                                               | TSMC 0.18µm CMOS 1P6M                                                                                                                     |  |  |  |
|-----------------------------------------------------|--------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
|                                                     | Supply Voltage                                                                                   | 1.8V for Core 3.3V for I/O Cells                                                                                                          |  |  |  |
|                                                     | Power Consumption                                                                                | 476.6mW (core power)                                                                                                                      |  |  |  |
|                                                     | Gate Count                                                                                       | 986.5K gates                                                                                                                              |  |  |  |
|                                                     | Die Size                                                                                         | 3.96×3.96mm <sup>2</sup>                                                                                                                  |  |  |  |
|                                                     | Package                                                                                          | PBGA 256 pin                                                                                                                              |  |  |  |
|                                                     | Operating Frequency                                                                              | 139MHz                                                                                                                                    |  |  |  |
| 3DG Engine                                          | 3DG Performance 8.69M vertices/s for GE, 278M<br>pixels/s for RE, 278M texels/s for<br>Texture   |                                                                                                                                           |  |  |  |
|                                                     | Texture Mapping                                                                                  | Bilinear perspective correction                                                                                                           |  |  |  |
|                                                     | 3DG Operation                                                                                    | Fully support OpenGL-ES 1.0<br>functions                                                                                                  |  |  |  |
|                                                     | Target Resolution                                                                                | Up to 640×480                                                                                                                             |  |  |  |
| Embedded<br>Debugging/<br>Performance<br>Monitoring | Bus Tracer                                                                                       | Provide 4 signal/timing abstraction<br>levels<br>Real-time compression                                                                    |  |  |  |
|                                                     | Performance Counting<br>Module                                                                   | Provide GE/RE/System bus<br>performance information, include<br>triangle rate, pixel rate, bus<br>idle/utilization/transfer rate and etc. |  |  |  |
| Engine                                              | Protocol Checker                                                                                 | Rule-based AHB protocol checker,<br>total 73 rules                                                                                        |  |  |  |
| Software<br>Features                                | OpenGL-ES API for provided hardware<br>Device driver for Linux 2.6.11<br>Nano-X Windowing System |                                                                                                                                           |  |  |  |
| Prototype                                           | ARM Versatile platform baseboard for ARM926ES-S+Logic Tile<br>FPGA                               |                                                                                                                                           |  |  |  |

IEEE/ACM ASP-DAC'09@Yokohama, Japan, Jan 19-22 2009



### Comparison of 3DG Chip performance

| 3DG Chips                      | Imai et al.<br>ISSCC2004         | Sohn et al.<br>ISSCC2005                      | Kim et al.<br>IEEE JSSC2006                                     | Kim et al.<br>ISSCC2007 | Nam et al.<br>ISSCC2007                                                 | Our Design                               |
|--------------------------------|----------------------------------|-----------------------------------------------|-----------------------------------------------------------------|-------------------------|-------------------------------------------------------------------------|------------------------------------------|
| Triangle rate<br>(Mvertices/s) | 4.7<br>(parallel<br>projection)  | 3.6<br>(full<br>pipeline<br>with<br>lighting) | 7.55<br>(single<br>specular<br>lighting)<br>(5.45) <sup>+</sup> | NA                      | 141<br>(peak for model<br>transform, no<br>lighting perf.<br>available) | 8.69<br>(full pipeline<br>with lighting) |
| Pixels rate<br>(Mpixels/s)     | 150<br>(bilinear)                | 50                                            | 166 (120)+                                                      | NA                      | 50                                                                      | 278                                      |
| Tixel rate<br>(Mtexels/s)      | 600                              | 200                                           | 1300 (939)+                                                     | 400                     | 200                                                                     | 278                                      |
| Chip Area<br>(mm²)             | 75.43<br>(25.5025 for<br>3DG IP) | 36<br>(core 23.04)                            | 49.7<br>(19.4 for 3DG)<br>(95.3/37.2)+                          | 25                      | 17.2                                                                    | 15.68<br>(11 for 3DG<br>Engine)          |
| Power (mW)                     | 109.5                            | 155                                           | 407<br>(780)+                                                   | 379                     | 52.4                                                                    | 472.6                                    |
| Frequency<br>(Mhz)             | 75                               | 50-200/<br>12.5-50<br>(VS/RE)                 | 166/333<br>(3DG/RISC)<br>(120/240) <sup>+</sup>                 | 50                      | 89-200<br>(RISC,VS)<br>20-50 (RE)                                       | 139                                      |
| Silicon proc.<br>(µm)          | 0.18                             | 0.18                                          | 0.13<br>(Normalized<br>to 0.18) <sup>+</sup>                    | 0.18                    | 0.18                                                                    | 0.18                                     |
| Special features               |                                  |                                               |                                                                 | 3D Vision               | Dynamic Volt.<br>Freq. Scaling<br>(DVFS)                                | Embedded<br>debug /perf.<br>Monitoring   |





# Conclusion

- We develop a tile-based 3DG SoC and its related development tools.
  - In hardware design
    - Has better (compared with previous works) performances in the vertex rate and pixel rate, and is the smallest in hardware size.
  - Special features
    - On-chip bus integration
    - Real-time profiling
    - Debugging
    - Performance measurement/tuning

#### Future Work

- Further improve power consumption and texture performance
- Upgrade to OpenGL ES 2.0

IEEE/ACM ASP-DAC'09@Yokohama, Japan, Jan 19-22 2009

