申智波, 陈杰, 刘建, 王云, 朱玥. 适用于嵌入式应用的多线程顶点处理器设计[J]. 微电子学与计算机, 2012, 29(7): 76-80.
引用本文: 申智波, 陈杰, 刘建, 王云, 朱玥. 适用于嵌入式应用的多线程顶点处理器设计[J]. 微电子学与计算机, 2012, 29(7): 76-80.
SHEN Zhi-bo, CHEN Jie, LIU Jian, WANG Yun, ZHU Yue. Design of Multithreaded Vertex Processor for Embedded Applications[J]. Microelectronics & Computer, 2012, 29(7): 76-80.
Citation: SHEN Zhi-bo, CHEN Jie, LIU Jian, WANG Yun, ZHU Yue. Design of Multithreaded Vertex Processor for Embedded Applications[J]. Microelectronics & Computer, 2012, 29(7): 76-80.

适用于嵌入式应用的多线程顶点处理器设计

Design of Multithreaded Vertex Processor for Embedded Applications

  • 摘要: 针对嵌入式应用中三维图形渲染的要求,设计了一款可编程的多线程顶点处理器.该顶点处理器采用单指令多数据结构,一条指令能够同时处理4个单精度浮点数,并采用多线程技术,支持4个线程并发执行,能够有效地减少发生数据写读冲突时的停顿周期数,提高了处理效率.相对于单线程结构,4线程顶点处理器在较小的硬件开销下,可以实现2.1~2.8倍的性能提升.该顶点处理器支持OpenGL ES 1.1和Vertex Shader Model 1.1,在90nm CMOS工艺库下可实现频率为200MHz,性能为50Mvertices/s.

     

    Abstract: A programmable multithreaded Vertex Processor is designed for 3D graphics rendering in embedded applications.The Single Instruction Multiple Data (SIMD) architecture is proposed, so four single precision floating-point numbers are processed simultaneously.The multithreaded datapath supports parallel execution of four threads, so the pipeline stalls are reduced when data hazards occur and the processing efficiency rises up.The proposed four-threaded Vertex Processor achieves 2.1~2.8 times of performance improvement at a relatively small hardware cost compared to the single-threaded architecture.The Vertex Processor supports OpenGL ES 1.1 and Vertex Shader Model 1.1.Implemented in 90nm CMOS process, it achieves 50 Mvertices/s of geometry performance at the operating frequency of 200MHz.

     

/

返回文章
返回