In-Memory Deep Learning Accelerator

Apr 27, 2016

Deep learning has shown exciting successes in performing classification, feature extraction, pattern matching, etc. In many real-time applications, machine learning models are typically pre-trained in the cloud and then deployed in edge devices, such as mobile phones and Internet of Things (IoT) devices, for fast and energy-efficient local inference. Because of the very limited computing resource and energy budget, specialized real-time, yet low-power inference hardware is in urgent needs. Leveraging our expertise in analog and mixed-signal designs, we are exploring mixed-signal computing paradigms for deep learning and statistical machine learning models.

Precise and Programmable In-SRAM Computing [JSSC2021] [ISLPED2021] [CICC2022]

We explore novel circuit topologies to enable accurate and programmable-bitwidth deep learning accelerators and overcome the limitations of existing in-memory computing designs. We are particularly interested in high-performance low-power mixed-signal circuits (DAC, ADC, etc. ) specifically designed for mixed-signal computing systems, which are largely overlooked and optimistically assumed in the literature. Together with our collaborators, we further seek architecture designs and training methods co-optimized with in-SRAM computing circuits.

Mixed-Signal Computing

Publications

Ziyuan Wen, Rongqing Cong, Hanlin Zhu, Jiaao Zhang, Chong Xie, Kaiyuan Yang. A 28nm Online Spike Sorting Processor based on Multi-Channel Template Matching. IEEE Symposium on VLSI Technology and Circuits (VLSI), 2025.

PDF Project DOI

Zhiyu Chen, Ziyuan Wen, Weier Wan, Akhil Pakala, Yiwei Zou, Wei-Chen Wei, Zengyi Li, Yubei Chen, Kaiyuan Yang. PICO-RAM: A PVT-Insensitive Analog Compute-In-Memory SRAM Macro With In Situ Multi-Bit Charge Computing and 6T Thin-Cell-Compatible Layout. IEEE Journal of Solid-State Circuits (JSSC), 2024.

PDF Project DOI

Yi Huang, Lingkun Kong, Dibei Chen, Zhiyu Chen, Xiangyu Kong, Jianfeng Zhu, Konstantinos Mamouras, Shaojun Wei, Kaiyuan Yang, Leibo Liu. CASA: An Energy-Efficient and High-Speed CAM-based SMEM Seeding Accelerator for Genome Alignment. Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023.

PDF Project DOI

Zhiyu Chen, Qing Jin, Zhanghao Yu, Yanzhi Wang, Kaiyuan Yang. DCT-RAM: A Driver-Free Process-In-Memory 8T SRAM Macro with Multi-Bit Charge-Domain Computation and Time-Domain Quantization. IEEE Custom Integrated Circuits Conference (CICC), 2022.

PDF Project DOI

Zhiyu Chen, Qing Jin, Jingyu Wang, Yanzhi Wang, Kaiyuan Yang. MC2-RAM: An In-8T-SRAM Computing Macro Featuring Multi-Bit Charge-Domain Computing and ADC-Reduction Weight Encoding. 2021 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED), 2021.

PDF Project DOI

Zhiyu Chen, Zhanghao Yu, Qing Jin, Yan He, Jingyu Wang, Sheng Lin, Dai Li, Yanzhi Wang, Kaiyuan Yang. CAP-RAM: A Charge-Domain In-Memory Computing 6T-SRAM for Accurate and Precision-Programmable CNN Inference. IEEE Journal of Solid-State Circuits (JSSC), 2021.

PDF Project DOI

Jongyup Lim, Myungjoon Choi, Bowen Liu, Taewook Kang, Ziyun Li, Zhehong Wang, Yiqun Zhang, Kaiyuan Yang, David Blaauw, Hun-Seok Kim, Dennis Sylvester. AA-ResNet: Energy Efficient All-Analog ResNet Accelerator. 2020 IEEE 63rd International Midwest Symposium on Circuits and Systems (MWSCAS), 2020.