Empirical Evaluation & Neuromorphic Viability

Empirical Evaluation & Neuromorphic Viability#

To establish the viability of the Spiking Decision Transformer (SNN-DT), we conduct a rigorous ablation study isolating our core neuromorphic components across four standard Gym control tasks: CartPole-v1, MountainCar-v0, Acrobot-v1, and Pendulum-v1.

Our evaluation specifically tracks (1) algorithmic performance and (2) proxy metrics for hardware energy efficiency.

Downstream Validation Accuracy#

We isolate the impact of Phase-Shifted Positional Spiking (Pos-Only) and Dendritic-Style Routing MLP (Route-Only) against a unified configuration (Full) and the base non-augmented LIF formulation (Baseline).

Offline Loss Validation Curves

Figure 1: Ablation validation loss trajectories. The Full model natively achieves the fastest convergence towards the error floor by exploiting highly diverse temporal encoding and responsive gating.

Environment

Baseline

Pos-Only

Route-Only

Full (SNN-DT)

CartPole-v1

\(452.3 \pm 11.7\)

\(474.1 \pm 7.9\)

\(479.2 \pm 6.2\)

\(\mathbf{492.3 \pm 6.8}\)

MountainCar-v0

\(-120.2 \pm 9.4\)

\(-111.5 \pm 7.2\)

\(-109.8 \pm 6.9\)

\(\mathbf{-102.4 \pm 5.5}\)

Acrobot-v1

\(-87.1 \pm 3.2\)

\(-72.0 \pm 3.6\)

\(-68.3 \pm 3.9\)

\(\mathbf{-59.7 \pm 2.7}\)

Pendulum-v1

\(-155.3 \pm 5.1\)

\(-140.0 \pm 4.7\)

\(-135.4 \pm 4.4\)

\(\mathbf{-130.5 \pm 4.2}\)

RL Performance plot

Figure 2: Performance distributions evaluated over the target environments tracking downstream RL validation. The density directly reflects tighter policy resilience in continuous evaluations.

Note: SNN-DT matches the expressivity capabilities of state-of-the-art dense Decision Transformers while stabilizing sequence variance observed physically out-of-distribution across seeds.

Energy Profiling & CPU Overhead#

On advanced neuromorphic substrates like Intel Loihi or IBM TrueNorth, algorithmic energy scales linearly with spike activity emissions. We compute absolute spike counts during test batches as an energy proxy.

Spike Emission Distribution

Figure 3: Histograms of localized sparse spike activity. SNN-DT networks suppress superfluous event spikes effectively limiting output variance beneath the 10-spike barrier compared to unrestricted formulations.

Ablation Mode

Spikes / Inference

CPU Latency (ms)

Baseline

12,000

15.2

Pos-Only

11,000

14.8

Router-Only

9,000

13.5

Full SNN-DT

8,000

12.1

Projected Neuromorphic Efficiency#

The integrated structure produces a significant efficiency win. The SNN-DT achieves maximal score recovery with only ~8,000 spikes per sequential forward-pass.

Assuming a standardized metric of \(E_{spike} \approx 5 \text{ pJ}\) observed on dedicated hardware, the projected energy cost sits around \(40 \text{ nJ}\) per decision inference step:

\[ E_{decision} \approx \bar{S} \times E_{spike} \approx 8,000 \times 5\text{ pJ} = 40\text{ nJ} \]

This sub-microjoule boundary unlocks unprecedented application potential for transformer-based inference protocols operating on autonomous drone clusters or wearables edge systems.