The field of computer architecture is witnessing a significant shift towards memory-centric computing, with a focus on improving performance, energy efficiency, and security. Recent research has explored the potential of 3D-stacked High-Bandwidth Memory (HBM) architectures, compute-in-SRAM devices, and processing-in-memory (PIM) paradigms to address the memory wall challenge. Additionally, there is a growing concern about thermal vulnerabilities in these architectures, which can be exploited to launch thermal performance degradation attacks. To mitigate these risks, researchers are developing innovative solutions, such as bandwidth-optimized oblivious map accelerators and scalable frameworks for resilient memory design. Noteworthy papers in this area include:
- BOLT, which achieves significant speedups in initialization and query time over state-of-the-art OMAPs,
- IM-PIR, which proposes the first PIM-based architecture for multi-server private information retrieval,
- BitROM, which enables practical and efficient Large Language Model inference at the edge through co-design with BitNet's 1.58-bit quantization model.