A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing

A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing

Abstract

This work introduces a heterogeneous neuromorphic system-on-chip (SoC) designed for highly efficient Edge-AI computing, achieving an impressive energy consumption of 0.96 pJ/SOP. The architecture integrates a tightly coupled RISC-V CPU and a neuromorphic processor utilizing a novel 'fullerene-like' interconnection topology for enhanced decentralized communication. Fabricated in 55 nm CMOS, the chip boasts a high neuron density of 30.23 K/mm² and a power density reduction of 67.5% compared to related works.

Report

Key Highlights

  • Record Energy Efficiency: Achieves ultra-low energy efficiency of 0.96 pJ/SOP (Synaptic Operation).
  • High Density: Features a high neuron density of 30.23 K-neuron/mm².
  • Novel Topology: Utilizes a 'fullerene-like' Network-on-Chip (NoC) topology, significantly boosting node connectivity and decentralization.
  • Low Power Footprint: Demonstrates a low power density of 0.52 mW/mm², resulting in a 67.5% reduction compared to existing architectures.
  • Heterogeneous Integration: Tightly couples a standard RISC-V CPU with a specialized neuromorphic processor core on the same die.

Technical Details

  • Fabrication Technology: The chip is fabricated using 55 nm CMOS technology.
  • Die Specifications: The total die area is compact, measuring 5.42 mm².
  • Neuromorphic Core Optimizations: Energy efficiency within the core is augmented through the implementation of sparse computing, partial membrane potential updates, and non-uniform weight quantization.
  • Fullerene-like NoC: This novel topology, built from multiple neuromorphic cores and multi-mode routers, increases the average degree of communication nodes by 32% over traditional topologies, while maintaining a minimal degree variance of 0.93 for advanced decentralized communication.
  • Scalability: The NoC design allows for straightforward scaling through the use of extended off-chip high-level router nodes.

Implications

  • New Benchmarks for Edge-AI: The achieved energy efficiency (0.96 pJ/SOP) sets a highly competitive standard for neuromorphic processing, making complex AI tasks feasible in extremely power-constrained or battery-operated Edge-AI devices.
  • Validation of RISC-V in Heterogeneous Systems: The successful tight coupling of the specialized neuromorphic processor with a RISC-V CPU reinforces RISC-V's viability and growing dominance as the flexible control plane (or GPP) necessary to manage and configure domain-specific accelerators.
  • Advanced Architectural Design: The introduction of the 'fullerene-like' interconnection topology provides a crucial proof-of-concept for creating highly connected, robust, and decentralized communication fabrics, which are essential for scaling up future large-scale spiking neural networks (SNNs) and addressing communication bottlenecks in conventional grids.
  • Area and Power Density Efficiency: Achieving high neuron density and dramatically reduced power density in a mature 55 nm process demonstrates that significant architectural innovations, rather than just process node scaling, can drive breakthroughs in hardware efficiency.
lock-1

Technical Deep Dive Available

This public summary covers the essentials. The Full Report contains exclusive architectural diagrams, performance audits, and deep-dive technical analysis reserved for our members.

Read Full Report →