| 1. |
|
mtGEMM: An Efficient GEMM Library for Modern Multi-Core DSPs (IEEE Transactions on Parallel and Distributed Systems) |
|
Apr 1, 2026 by Jianbin Fang, Kainan Yu, Peng Zhang, Dezun Dong +4 more
| discuss | doi src S2, crossref
|
|
| 2. |
|
VMXDOTP: A RISC-V Vector ISA Extension for Efficient Microscaling (MX) Format Acceleration |
|
Mar 5, 2026 by Max Wipfli, Gamze .Islamouglu, Navaneeth Kunhi Purayil, Angelo Garofalo +1 more
| discuss src S2
|
|
| 3. |
|
Practicality of MAGIC NOR in a 1T1M Crossbar Array for In-memory Computing (Journal of Circuits, Systems and Computers) |
|
Mar 5, 2026 by F. Lalchhandama, Kamalika Datta, Rolf Drechsler, Sandip Chakraborty +1 more
| discuss | doi src S2, crossref
|
|
| 4. |
|
Joint Hardware-Workload Co-Optimization for In-Memory Computing Accelerators |
|
Mar 4, 2026 by O. Krestinskaya, M. E. Fouda, A. Eltawil, Khaled N. Salama
| discuss src S2
|
|
| 5. |
|
Variable Bit-Width All-Optical Content-Addressable Memory Enabled by Sb2Se3 for Similarity Search (Photonics) |
|
Mar 3, 2026 by Yi Guo, Xinmeng Hao, Yibo Zhang, Guangsong Yuan +4 more
| discuss | doi src S2, crossref
|
|
| 6. |
|
Heterogeneously integrated IGZTO-CMOS neuromorphic CIM achieving 34.8 TOPS/W and 87.7-fJ write energy (Japanese Journal of Applied Physics) |
|
Mar 2, 2026 by zhaolong He, Kaichang Chen, Hanfeng Wang, Yaolei Guo +6 more
| discuss | doi src S2, crossref
|
|
| 7. |
|
A 319.3-TOPS/W high-efficient computing-in-memory engine with 55nm high-density multi-level NOR-flash array and event-driven neuromorphic architecture (Japanese Journal of Applied Physics) |
|
Mar 2, 2026 by Yue Cheng, Yaolei Guo, Hanfeng Wang, zhaolong He +6 more
| discuss | doi src S2, crossref
|
|
| 8. |
|
Self-Calibrating Analog Circuitry for Softmax-Scaled Function With Analog Computing-In-Memory (IEEE Transactions on Very Large Scale Integration (VLSI) Systems) |
|
Mar 1, 2026 by Linjun Jiang, Yitong Zhou, He Zhang, Wang Kang
| discuss | doi src S2, crossref
|
|
| 9. |
|
Reconfigurable FPU With Precision Auto-Tuning for Next-Generation Transprecision Computing (IEEE Transactions on Circuits and Systems Part 1: Regular Papers) |
|
Mar 1, 2026 by Guilherme Dias, Luís Crespo, Timo Schlachter, M. A. Heller +4 more
| discuss | doi src S2, crossref
|
|
| 10. |
|
Implementation of Boolean Logic Operation in Embedded Flash Memory for In-Memory Computing Applications (IEEE Transactions on Electron Devices) |
|
Mar 1, 2026 by Jinhyeok Kim, Sung Jin Bang, Min Sung Kim, Minsuk Koo +1 more
| discuss | doi src S2, crossref
|
|
| 11. |
|
End-to-End Design Flow for Resistive Neural Accelerators (IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems) |
|
Mar 1, 2026 by Max Uhlmann, T. Rizzi, J. Wen, E. Quesada +7 more
| discuss | doi src S2, crossref, openalex
|
|
| 12. |
|
A T8T-SRAM Computing-in-Memory Macro for Ternary Deep Neural Networks and Boolean Logic Computations (IEEE Transactions on Very Large Scale Integration (VLSI) Systems) |
|
Mar 1, 2026 by Chenghu Dai, Jianhao Zhang, Zihua Ren, Lian Liu +6 more
| discuss | doi src S2, crossref
|
|
| 13. |
|
A 40 nm Buffer-Free 7T-SRAM Analog Charge-Domain CIM Macro With Merging Timing Based On Time-Row Division Strategy (IEEE Transactions on Very Large Scale Integration (VLSI) Systems) |
|
Mar 1, 2026 by Linjun Jiang, Jianxin Wu, Sifan Sun, Changyu Li +2 more
| discuss | doi src S2, crossref
|
|
| 14. |
|
A 28 nm 1.3 TFLOPS/mm2 Floating-Point SRAM-Based CIM Macro With Asynchronous Normalization and Parallel Sorting Alignment for AI-Edge Chip (IEEE Transactions on Very Large Scale Integration (VLSI) Systems) |
|
Mar 1, 2026 by Zhiting Lin, Miao Long, Yang Yang, Xin Wang +6 more
| discuss | doi src S2, crossref
|
|
| 15. |
|
Latency and Resource Trade-off Analysis of AXI-DMA and BRAM Integration Approaches on SoC-FPGA (Firat University Journal of Experimental and Computational Engineering) |
|
Feb 28, 2026 by G. Tatar
| discuss | doi src S2, crossref
|
|
| 16. |
|
FPGA-Based Implementation and Software Verification of a 3×3 Convolution Core for Edge AI Systems (Science and Technology of Engineering, Chemistry and Environmental Protection) |
|
Feb 28, 2026 by Hanwen Zhang
| discuss | doi src S2, crossref
|
|
| 17. |
|
A Highly-Parallel AI Accelerator Architecture for Convolution and Activation, Implemented in Verilog (Science and Technology of Engineering, Chemistry and Environmental Protection) |
|
Feb 28, 2026 by Qilin Xie, Jingyu Zhang
| discuss | doi src S2, crossref
|
|
| 18. |
|
A FORTIFIED FOURTEEN-TRANSISTOR STATIC MEMORY CELL WITH ENHANCED RESISTANCE TO RADIATION-INDUCED PERTURBATIONS (International Journal of Data Science and IoT Management System) |
|
Feb 28, 2026 by G. Kumar, C. Manoj, Bezawada Siva Ganesh, Veeranki Sasidhar +2 more
| discuss | doi src S2, crossref
|
|
| 19. |
|
Shifting in-DRAM |
|
Feb 27, 2026 by William C. Tegge, Alex K. Jones
| discuss src S2
|
|
| 20. |
|
Optimized fault-tolerant data processing module for high-reliability CNN accelerator. (PLoS ONE) |
|
Feb 27, 2026 by Sungkwang Yoon, Seung-Han Lee, Juhyeong Jo, Young-woo Lee
| discuss | doi src S2, crossref
|
|
| 21. |
|
Multi-Bit Floating-Gate Memory with an Ultrawide Programmable Window. (Small) |
|
Feb 27, 2026 by Ce Li, Ning Lin, Dongliang Yang, Tianze Yu +3 more
| discuss | doi src S2
|
|
| 22. |
|
GenDRAM:Hardware-Software Co-Design of General Platform in DRAM |
|
Feb 27, 2026 by Tsung-Han Lu, Weihong Xu, T. Rosing
| discuss src S2
|
|
| 23. |
|
A methodology for accurate benchmarking of neural network accelerators using a high-level synthesis-based hardware generator. (Philosophical transactions. Series A, Mathematical, physical, and engineering sciences) |
|
Feb 26, 2026 by Kartik Prabhu, Jeffrey Yu, X. Pan, Priyanka Raina
| discuss | doi src S2, crossref
|
|
| 24. |
|
TOM: A Ternary Read-only Memory Accelerator for LLM-powered Edge Intelligence |
|
Feb 24, 2026 by Hongyi Guan, Yijia Zhang, Wenqiang Wang, Yizhao Gao +3 more
| discuss src S2
|
|
| 25. |
|
S-TRAC: An Algorithm–Hardware Co-design of Sparsity-aware Threshold Adjustment for Accelerator-based RISC-V ISA Extensions (ACM Transactions on Parallel Computing) |
|
Feb 24, 2026 by Yueting Li, Terry Tao Ye, Wanshuang Lin, Wendong Xu +2 more
| discuss | doi src S2, crossref
|
|
| 26. |
|
Heterogeneous Memory Design Exploration for AI Accelerators with a Gain Cell Memory Compiler |
|
Feb 24, 2026 by Xinxin Wang, Lixian Yan, Shuhan Liu, Luke R. Upton +9 more
| discuss src S2
|
|
| 27. |
|
Oxide semiconductor gain cell-embedded memory: materials and integration strategies for next generation on-chip memory (Communications Engineer) |
|
Feb 23, 2026 by Sang Won Chung, S. Yoon, J. Jeong
| discuss | doi src S2, crossref, openalex
|
|
| 28. |
|
A Logic-Reuse Approach to Nibble-based Multiplier Design for Low Power Vector Computing |
|
Feb 22, 2026 by Md. Rownak Hossain Chowdhury, Mostafizur Rahman
| discuss src S2
|
|
| 29. |
|
TeLLMe: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs (Symposium on Field Programmable Gate Arrays) |
|
Feb 21, 2026 by Ye Qiao, Zhiheng Chen, Yifan Zhang, Yian Wang +1 more
| discuss | doi src S2, crossref
|
|
| 30. |
|
FARE: A Fine-grained Pipelined Reconfigurable FlashAttention Kernel (Symposium on Field Programmable Gate Arrays) |
|
Feb 21, 2026 by Kaushikkumar S. Rathva, A. Alam, S. Srinivasan, S. K. Mandal
| discuss | doi src S2, crossref
|
|