ArticlePDF Available

Learning Rules in Spiking Neural Networks: A Survey

February 2023
Neurocomputing 531(18)

February 2023
531(18)

DOI:10.1016/j.neucom.2023.02.026

Authors:

Jing Lian

Lanzhou Jiaotong University

Qidong Liu

Nanyang Technological University

Hegui Zhu

Northeastern University (Shenyang, China)

Show all 6 authorsHide

Content uploaded by Jizhao Liu

Content may be subject to copyright.

Highlights

• Research highlight 1

Guided by a hierarchical classification of SNN learning rules, we present a

comprehensive survey of learning rules in spiking neural networks.

• Research highlight 2

We discus and compare their characteristics, advantages, limitations, and

performance on several popular datasets.

• Research highlight 3

We review practical a p p l i c a t i o n s of SNNs to bette r p r e s e n t the

SNN research landscape.

Figure: A three-level hierarchical taxonomy of learning rules in SNNs.

Learning Rules in Spiking Neural Networks: A Survey

Zexiang Yia, Jing Lianb, Qidong Liuc, Hegui Zhud, Dong Liange, Jizhao

Liua,∗

aSchool of Information Science and Engineering, Lanzhou

University, Lanzhou, 730000, Gansu, China

bSchool of Electronics and Information Engineering, Lanzhou Jiaotong

University, Lanzhou, 730070, Gansu, China

cSchool of Computer and Artiﬁcial Intelligence, Zhengzhou

University, Zhengzhou, 450001, Henan, China

dCollege of Sciences, Northeastern University, Shenyang, 450001, Liaoning, China

eDepartment of Computer Science and Technology, Nanjing University of Aeronautics

and Astronautics, Nanjing, 211106, Jiangsu, China

Abstract

Spiking neural networks (SNNs) are a promising energy-eﬃcient alternative

to artiﬁcial neural networks (ANNs) due to their rich dynamics, capability to

process spatiotemporal patterns, and low-power consumption. The complex

intrinsic properties of SNNs give rise to a diversity of their learning rules

which are essential to functional SNNs. This paper is aimed at presenting a

comprehensive overview of learning rules in SNNs. Firstly, we introduce the

basic concepts of SNNs and commonly used neuromorphic datasets. Then,

guided by a hierarchical classiﬁcation of SNN learning rules, we present a

comprehensive survey of these rules with discussions on their characteristics,

advantages, limitations, and performance on several datasets. Moreover,

we review practical applications of SNNs, including event-based vision and

∗Corresponding author

Email address: liujz@lzu.edu.cn (Jizhao Liu)

Preprint submitted to Neurocomputing January 11, 2023

audio signal processing. Finally, we conclude this survey with a discussion

on challenges and promising future research directions in this area.

Keywords: spiking neural networks, pulse-coupled neural networks,

neuromorphic computing, learning rules, image classiﬁcation.

1. Introduction

The human brain is the best-known intelligent system, performing func-

tions such as perception, reasoning, and control with a power consumption

of nearly 20W[1]. There have been many artiﬁcial intelligence (AI) models

inspired by the brain. Rosenblatt [2] proposed the perceptron to realize bi-

nary classiﬁcation of input patterns. Lecun et al. [3] applied convolutional

neural networks (CNNs) to handwritten digit recognition. Recently, neural

networks based on attention mechanisms[4] have invoked a new wave of re-

search. Although ANNs have achieved great success in many ﬁelds[5], they

have the following drawbacks: 1) the training and inference of ANNs con-

sume huge amounts of energy[6]; 2) limitations such as poor robustness and

catastrophic forgetting [7]. However, these problems do not exist in the brain

where neurons have complex spatiotemporal dynamics, communicating and

processing information through discrete spikes. Additionally, neurons are

connected in a hierarchical way, forming diﬀerent functional neural networks

with diverse plasticity. How to introduce the above characteristics to con-

struct more energy-eﬃcient and robust AI models is an open problem in AI

research.

1.1. Unique Characteristics of spiking neural networks

Spiking neural networks (SNNs) mimic the way the human brain process

information by taking advantage of discrete and asynchronous spikes, thus

they are believed to have the capability to process spatiotemporal information

eﬃciently[8, 9, 10]. The basic building blocks of SNNs are spiking neurons.

Due to the description of the generation of spikes at diﬀerent levels of bio-

ﬁdelity, there is a diversity of spiking neuron models such as Hodgkin-Huxley

(H-H) model[11], Izhikevich mmodel (IM)[12]. Additionally, by utilizing the

time dimension, spiking neurons can represent information in a sparse and

robust way[13].

SNNs are bridges between brain science (BS) and AI. On the one hand,

neuroscientists use SNNs to simulate biological neural networks to deepen

their understanding of the brain[14, 15, 16]; On the other hand, AI re-

searchers draw inspiration from BS to build energy-eﬃcient and robust neural

networks[17, 18]. However, Due to the non-diﬀerential nature of spikes, train-

ing eﬃcient and high-performance SNNs has remained a major diﬃculty[19,

20, 21].

1.2. Motivation

In recent years, SNNs have attracted enormous research interest. There

has been an upward trend in SNN-related papers. To visualize this trend, we

present Fig.1, which illustrates the number of SNN papers that are available

on the Web of Science since 2015.

Learning is an essential part of SNNs that adapts a network to perform

speciﬁc tasks, such as classiﬁcation or object detection. Fig.2 shows some of

the important learning rules over the past two decades. SpikeProp[22] is the

2015 2016 2017 2018 2019 2020 2021 2022

Year

200

400

600

800

1000

1200

1400

No. Papers

Journal

Conference

Review

Figure 1: The number of SNN papers published after 2015.

earliest spike-based backpropagation in multilayer SNNs. Tempotron[23] can

perform binary classiﬁcation tasks in analogy to perceptron. Remote super-

vised method (ReSuMe)[24] and spike pattern association neuron (SPAN)[25]

are classical spike sequence learning rules. Masquelier and Thorpe [26] apply

spike-timing-dependent plasticity (STDP) to multilayer neural networks in-

spired by ventral visual pathways to enable unsupervised feature learning. In

2015 and beyond, SNNs have been dominated by deep networks. Cao et al.

[27] propose converting a pre-trained ANN to an SNN. Kheradpisheh et al.

[28] use STDP to train deep spiking CNNs layer by layer. SuperSpike[29]

and spatio-temporal backpropagation (STBP)[30] train multilayer SNNs via

surrogate gradients. Spike-element-wise (SEW) ResNet[19] is proposed to

combat degradation problem in deep SNNs. Bu et al. [31] realize ultra-low-

latency inference in ANN-converted SNNs.

Several survey papers [32, 33, 34, 35, 36, 37, 38, 39, 40, 41] have so far

reviewed recent advances in SNNs. However, some of these papers have a

Converting a

pre-trained

ANN

Surrogate

gradient Low latency

conversion

2000 2005 2010 2015 2020

SpikeProp

Tempotron

STDP+Shallow Networks

ReSuMe

SPAN STDP+Deep networks SEW ResNet

Shallow SNNs Deep SNNs

Figure 2: The evolution of learning rules in SNNs.

limited scope of learning rules, for instance, [36, 34, 41] focus on supervised

learning, [32, 33] have an emphasis on learning rules in multilayer SNNs.

Moreover, most of the above surveys only cover the papers published un-

til 2021. Nonetheless, many important breakthroughs in learning rules in

SNNs have occurred since 2022. Additionally, none of the surveys cover

pulse-coupled neural networks (PCNNs), which are cortex models exhibit-

ing synchronous oscillation behavior and have been widely applied to image

processing[42, 43, 44].

1.3. Contribution

This paper surveys the advances in learning rules in SNNs, providing

insights into both technical and performance in a systematic way.

The key contributions of this article are summarized as follows. First, we

provide a systematic review of the evolution of learning rules in SNNs, where

many of them have not been reviewed in previous surveys, and present com-

parisons between the state-of-the-art using results reported on several pop-

ular datasets. Second, we review practical applications of SNNs, including

event-based vision and audio signal processing. Third, we discuss a number

of challenges and promising future research directions.

1.4. Organization

The rest of this survey is structured as follows. We start by providing

the basic concepts of SNNs including neuron and networks models, synaptic

plasticity, and neural coding. Next, in Section 3, we review the commonly

used neuromorphic datasets. Section 4 presents a hierarchical classiﬁcation

of learning rules in SNNs and analyzes the research trend and their character-

istics, advantages, limitations, and performance on several datasets. Section

5 reviews practical applications of SNNs. Finally, Section 6 discusses some

challenges and directions of this ﬁeld.

2. Basic concepts of SNNs

SNNs involve more neuroscience-related concepts than ANNs. To better

analyze the learning rules in SNNs, basic concepts of SNNs are introduced in

this section, including neuron and network models, synaptic plasticity, and

neural coding.

2.1. Neuron models

There are billions of neurons in the human brain which have a basic

structure as shown in Fig.3. Dendrite is the input terminal. The cell body

integrates incoming spikes received by diﬀerent branches of dendrites and

emits a spike when its membrane potential reaches the threshold. Spikes

travel along the axons to other neurons via synapses.

Dendrite

Cell body Synapse

Axon

Dendrite

Figure 3: Diagram of a neuron. It can be divided into three parts: dendrite, cell body,

and axon.

To emulate the generation of spikes with diﬀerent levels of bio-ﬁdelity

and computational cost, a variety of spiking neuron models have been pro-

posed. For simplicity and mathematical tractability, leaky integrate-and-ﬁre

(LIF)[45], spike response model (SRM)[46] and PCNN neuron[14] models are

widely used in SNNs. To better formalize these models, Tab.1 summarizes

the main notations used in the following equations.

LIF model dates to 1907[45], when the mechanism of generating spikes

had not yet been revealed, so neurons were modeled as a parallel circuit of

resistance Rand a capacitance Cas shown in Fig.4(a). When the input

current Iis injected into the capacitor, its voltage Vrises. Meanwhile,

charges on the capacitor will leak through the resistor. A spike will be

emitted whenever Vreaches the threshold Vth, which is then reset to the

resting voltage Vrest. Fig.4(b) depicts the dynamics of an LIF neuron under

constant input, which can formally be described in diﬀerential form as:

τm

dt =−(V−Vrest) + RI +S(t) (Vrest −Vth) (1)

S(t) = X

tj≤t

δt−tj

s(2)

Table 1: Notation list.

Notation Description

τ,R,CTime constant, input resistance and capacitor of a neuron, respectively

FFeeding input of a neuron

LLinking input of a neuron

VMembrane potential of a neuron

EDynamic threshold of a neuron

SOutput spike train of a neuron

sTime of jth spike of output spike train

SiInput spike train from ith synapse

iTime of jth spike from ith synapse

Vrest

Vth

(a)

0 25 50 75 100 125 150 175 200

0.0

0.1

0.2

0 25 50 75 100 125 150 175 200

Time steps

0.0

0.5

1.0

Vth

Vrest

Spike

(b)

Figure 4: The LIF model. (a) Diagram of the LIF model circuits. (b) Neuronal dynamics

of a LIF neuron under the constant current.

where τm=RC is the time constant of the membrane. When working

with diﬀerential equations, it is convenient to denote a spike as a Dirac delta

function δ(t), so the postsynaptic spike train S(t) can be presented as a sum

of Dirac functions at diﬀerent output spike times tj

Synapse transmits spikes via neurotransmitter which acts like a low pass

ﬁlter with synaptic weight. Dendrite integrates the weighted and ﬁltered

synaptic current, obtaining the total input current I. The dynamics of these

operations are given by:

τs

dt =−I+X

wisi(t) (3)

Si(t) = X

ti≤t

δt−tj

i(4)

where τsis the synapse time constant. tj

idenote the presynaptic spike times

of ith aﬀerent. The sum runs over all presynaptic neurons i.wiand si(t) are

the corresponding synaptic weights and presynaptic spike trains, respectively.

It is customary to simulate SNNs in discrete time using Euler’s method.

To reduce computational costs, the synaptic ﬁlter eﬀect is often ignored, but

there are also researchers[47] incorporating this ﬁlter dynamics to improve

the convergence of SNNs training. In addition, there are some variants of the

LIF Model such as quadratic integrate-and-ﬁre (QIF) and integrate-and-ﬁre

(IF) models.

SRM uses spike response kernels to model a neuron’s membrane potential

in response to its input and out spikes[46]. A commonly used form is given

as follows:

V(t) = X

wiX

ti≤t

εt−tj

i+X

tj≤t

ηt−tj

s+Vrest (5)

where εand ηare the input and output spike response kernel, respectively.

SRM neuron models are appealing as they can add other features simply

by embedding them into the kernel. In addition, since the membrane po-

tential is explicitly expressed, the simulation of SRM models is often event-

based, thus less cost and time-consuming than LIF models.

The PCNN neuron is a two-compartmental model[42], show as in Fig.5.

The dendrite tree has two distinct inputs, the primary input termed feeding

Spike generatorModulation

Linking input L Dynamic threshold E

Feeding input FS

Dendrite tree

Figure 5: Schematic of a PCNN neuron. It consists of three parts: dendrite tree, modu-

lation, and spike generator.

input F, and the auxiliary input termed linking input L. They are repre-

sented as leaky integrator given by

τf

dt =−F+If(6)

τl

dt =−L+Il(7)

where τf,τf,Ifand Ilare the time constants and synaptic currents of the

feeding and linking input, respectively. The synaptic currents can be spikes,

constants, analog time-varying signals, or any combination.

The linking input modulates the feeding input by multiplication resulting

in the membrane potential V.

V=F(1 + βL) (8)

where beta is the linking strength.

The spike generator will emit a spike whenever the membrane potential

crosses the threshold E. Unlike LIF models resetting the membrane poten-

tial, the PCNN neuron feeds back to the threshold, which is another leaky

integrator given by

τe

dt =−E+VES(t) (9)

where S(t) denotes the output spike train of the PCNN neuron. τeis the

time constant and VEis the amplitude gain.

2.2. Network models

Neurons are connected to form neural networks following some connec-

tion patterns, as shown in Fig.6. There are three types of feedforward con-

nections, convolutional, local, and full connection. Convolutional and local

connections both have local visual receptive ﬁelds, but in convolutional con-

nection, the weights are shared between all receptive ﬁelds, while in local

connection, each receptive ﬁeld has its own set of weights, which is more

biologically plausible[48]. Full connection layers are often used to classify ex-

tracted features. Recurrent connections consist of self-recurrent, lateral, and

feedback connections. Zhang and Li [49] added self-recurrent connections to

implement local memory. Diehl and Cook [50] used feedback connections to

implement winner-take-all mechanism. Cheng et al. [17] introduced lateral

connections to improve the recognition robustness against noise. The last

type of connection is the residual connection which can solve the degrada-

tion problem in deep ANNs and is also the key to deep SNNs[19, 20].

Lateral

Residual connection

Full Convolution/Local

Self-recurrent

Recurrent connectionFeedforward connection

Feedback

Network block

Figure 6: Connection patterns in SNNs.

Network models can be constructed using the above connection patterns.

We review three types of network models as follows. The PCNN is a cortex

model originated from the simulation of synchronous oscillation behavior in

the primary cortex of cats[42]. The classical PCNN and its variants such

as the spiking cortical model (SCM)[51] have been widely used in image

segmentation[52, 53], fusion[54, 55, 56], enhancement[57, 58, 59], invariant

texture retrieval[60], and many other image processing tasks[61, 62, 63]. Al-

though a vast body of works utilize PCNNs for image processing, less atten-

tion has been paid to learning rules for PCNNs as we will see in the following.

Based on the characteristics of ventral visual ﬂow, Riesenhuber and Poggio

[64] proposed the HMAX model. Serre et al. [65] expanded HMAX into

the ﬁeld of computer vision. Masquelier and Thorpe [26] proposed a spik-

ing version of HMAX equipped with STDP for unsupervised feature learning.

Single layer networks (SLNs), multilayer Perceptron (MLP), recurrent neural

networks (RNNs), and Convolutional neural networks (CNNs) are network

models widely used in deep learning[66]. VGG and ResNet are popular neural

architectures for deeper CNNs which have been adopted by a large number

of high-performance SNNs.

2.3. Synaptic plasticity

Synaptic plasticity refers to the modulation of synaptic weights[67]. In

1949, Hebb [68] proposed Hebb’s postulate, and It can be simply stated as

”neurons that ﬁre together, wire together[69]. ” Subsequent ﬁnding of long-

term potentiation[70] provided experimental evidence for Hebb’s postulate.

LTP together with long-term depression (LTD) regulates synaptic weights

bidirectionally, serving as the synaptic basis for learning and memory. The

induction of LTP and LTD is spike-timing dependent. Studies[71, 72] demon-

strated that the relative timing of the pre and postsynaptic spikes determine

the direction and magnitude of synaptic modiﬁcation. This phenomenon is

known as spike-timing-dependent plasticity (STDP)[73]. Fig.7(a) shows two

neurons connected by a synapse, and Fig.7(b) illustrates the relationship be-

tween the amount of change in synaptic weights and the timing of pre and

postsynaptic spikes.

Fitting experimental data with exponential function, we can formalize

STDP as

∆w=





A+exp −∆t

τ+∆t > 0

−A−exp ∆t

τ−∆t < 0

(10)

where A+and A−are the modulation magnitudes for LTP and LTD, re-

spectively. τ+and τ−are the corresponding time constants of the learning

Post

Pre

Δw= f (tpre, tpost)

(a)

-100 0 100

-60

-100

Dt (ms)

Dw (%)

LTP

LTD

(b)

Figure 7: STDP. (a) Two neurons are connected by a synapse equipped with STDP. (b)

Relationship between spike timing and synaptic weight change. In a time window of tens

of milliseconds, when the presynaptic spike is earlier (later) than the postsynaptic spike,

the weight increases (decreases), resulting in LTP (LTD)

window, and ∆t=tpost −tpre is the time diﬀerence between a pair of pre

and postsynaptic spikes. In STDP modeling studies, Song et al. [73] found

that synapses modiﬁed by STDP compete with one other, resulting in a bi-

modal distribution of synaptic weights. Guyonneau et al. [74] showed that

a neuron with STDP-modiﬁed synapse stimulated by a repeatedly presented

spike pattern will be selective to it and decrease response latency. STDP also

enables neurons to learn visual features in an unsupervised way[26, 50, 75].

Increasing experimental observations demonstrate that neuromodulators

play a vital role in synaptic plasticity. They can change the polarity[76] or

adjust the time window of STDP[77]. Fr´emaux and Gerstner [78] proposed

three-factor rules to incorporate the inﬂuence of neuromodulators. Inspired

by these observations, researchers model neuromodulator eﬀects to imple-

ment bio-plausible supervised learning[79] and reinforcement learning[80] for

image recognition.

2.4. Neural coding

Stimulus, such as light or odors are converted to spikes for neural pro-

cessing, a process known as neural coding. Currently, there are three main

coding methods applied in SNNs: rate, temporal and direct coding.

Rate coding converts input stimulus into Poisson-distributed spike trains,

with ﬁring rates proportional to the input intensity. To reduce the compu-

tational cost, the binomial distribution is commonly used instead of Poisson

distribution[17].

As rate coding represents information via ﬁring rate, i.e. spike count over

a short time, a single spike contains little information. temporal coding is

concerned with the timing of a spike. A common temporal coding method is

time-to-ﬁrst-spike (TTFS) coding[81] in which a larger input intensity corre-

sponds to an earlier spike. Therefore, TTFS coding requires fewer spikes to

encode information than rate coding, resulting in a lower power consumption

and inference latency[82].

Rueckauer et al. [83] suggested that the variability in rate coding impairs

the performance of SNNs. So many studies[84, 85] used a trainable spik-

ing neuron layer to convert analog input that can be regarded as synaptic

currents into output spike strains, which is called direct coding[86].

3. Neuromorphic datasets

The performances of SNNs are often evaluated on existing ANN-oriented

datasets, for example CIFAR-10[87] and ImageNet[88], which are static-

image datasets containing no temporal information. Before feeding them

into SNNs, researchers often convert such frame-based data to spike trains

using coding methods described in Section 2.4. However, these ANN-oriented

datasets can’t exploit the spatiotemporal processing capability of SNNs[89].

To this end, researchers have gathered neuromorphic datasets inspired by the

biological visual system.

Neuromorphic datasets are recorded by dynamic vision sensors (DVS)

which capture the changes in the sensing ﬁeld using two channels. The On

channel for intensity increases and the Oﬀ channel for intensity decreases.

Currently, neuromorphic datasets can be divided into two categories: DVS-

converted and DVS-captured[90]. DVS-converted datasets are converted

from traditional datasets, such as MNIST and CIFAR-10. Researchers use a

DVS camera to record static images by moving the image or camera. Both

N-MNIST[91] and CIFAR10-DVS[92] are acquired by this method. In con-

trast, DVS-captured datasets are recorded via real-world motion. DVS128

Gesture[93] is a typical example. We present an overview of the main char-

acteristics of well-known neuromorphic datasets in Tab.2. In the following,

we review these datasets in detail.

Table 2: Summary of well-known neuromorphic datasets.

Datase Year Data category #classes URL

N-MNIST 2015 DVS-converted 10 http://www.garrickorchard.com/datasets

N-Caltech101 2015 DVS-converted 101 http://www.garrickorchard.com/datasets

CIFAR10-DVS 2017 DVS-converted 10 https://ﬁgshare.com/s/d03a91081824536f12a8

DVS128 Gesture 2019 DVS-captured 11 http://research.ibm.com/dvsgesture

ASL-DVS 2019 DVS-captured 24 https://github.com/PIX2NVS/NVS2Graph

N-MNIST: The N-MNIST dataset[91] is converted from the MNIST

dataset. Researchers ﬁrst displayed MNIST examples on an LCD monitor,

and then moved the ATIS sensor mounted on a pan-tilt unit to record the

image. It includes a training set with 60,000 samples and a test set with

10,000 samples.

N-Caltech101: The N-Caltech101 dataset[91] is a neuromorphic version

of the N-Caltech101 dataset. It was recorded using the same method as the

N-MNIST dataset and contains 9146 images in 101 classes.

CIFAR10-DVS: The CIFAR10-DVS dataset[92] is a spiking version of

the CIFAR-10 dataset. The dataset was recorded by moving images in front

of a DVS camera. It consists of 10,000 examples in 10 classes, with 1000

examples in each class.

DVS128 Gesture: The DVS128 Gesture dataset[93] was recorded by a

DVS128 camera and contained 11 kinds of hand gestures from 29 subjects

under 3 kinds of illumination conditions.

ASL-DVS: The ASL-VDS dataset[94] was recorded in an oﬃce environ-

ment with low environmental noise and constant illumination. It contains 24

classes of gestures corresponding to 24 English letters.

4. Learning rules in SNNs

In this section, we ﬁrst introduce the hierarchical classiﬁcation of SNN

learning rules which gives a overview of these rules. Then, we analyze the

research trends in SNN learning rules. Finally, we review these rules in a hier-

archical way and summarize the state-of-the-art performance for comparison

on several datasets.

4.1. Hierarchical classiﬁcation of SNN learning rules

In order to help illustrate an overall structure for SNN learning rules, we

categorized the existing learning rules hierarchically, which is presented in

Fig.8. Previous papers [35, 95] classify SNN learning rules according to the

usage of data label, i.e. supervised and unsupervised learning [35] or the

biological realism and plasticity scale of learning rules [95]. We diﬀer from

these papers by considering the working principles of the SNN learning rules.

The details of each group of learning rules will be discussed later.

STDP-based

ANN for pre-training

(ReLU neuron) SNN for inference

(IF neuron)

W-H rule-based

Likelihood gradient Voltage gradient

Spike time

t4=f (t1, t2, t3)

Neurons

Spike

generator

PS(t)

Membrane potential

Activation

Time

LTP

Time difference

Weight

change

LTD

Sd(t)

Vmax

Bio-plausible algorithms

L=f (Vmax)

Gradient-based algorithms

Other bio-plausible algo.

LTD LTP SBP

So(t)

tmax

Vth

Forward

propagation

Backpropagation

0.5

0.8

0.1

0.2

0.3

0.6

Voltage

Activation gradientTiming gradient

ANN pre-training

Direct training ANN-coupled training

ANN layer

Input

Output

ANN layer

SNN layer

Forward pass

Backward pass

Weight sharing

Figure 8: Illustration of the three-level hierarchical classiﬁcation of learning rules in SNNs. Bold only, bold italic, and italic

only style means the ﬁrst, second, and third level of the hierarchy, respectively

4.2. Analysis and trends

Before we dive into individual rules, we summarize the most represen-

tative SNN learning rules in Tab.3, sorted according to their publication

dates, for analysis of the evolution and trends in this ﬁeld. LIF and SRM are

mainly used neuron models for their simplicity and mathematical tractabil-

ity. Direct coding has become popular due to its advantages over rate coding

in accuracy[86]. Temporal coding is recently limited to certain rules such as

timing gradient. Notably, there has been a clear trend in formulating eﬃcient

learning rules such as ANN pre-training and activation gradient for deeper

SNNs (>10 layers) to perform challenging tasks, such as image classiﬁcation

on ImageNet and CIFAR-100.

Table 3: Summary of some representative learning rules in SNNs

Ref. Year Venue Neuron Model Coding Method Learning Rule Network Model Dataset

[22] 2002 Neurocomp. SRM Temporal Timing gradient MLP Iris

[23] 2006 Nat. Neurosci. LIF — Voltage gradient SLN —

[96] 2006 Neural Comput. SRM — Likelihood gradient SLN —

[26] 2007 PLoS Comput. Biol. IF Temporal STDP-based HMAX Caltech-101

[97] 2009 Neural Netw. SRM Temporal Timing gradient MLP Iris

[24] 2010 Neural Comput. LIF; H-H; IM — W-H rule-based SLN —

[98] 2012 PLoS One SRM — W-H rule-based SLN N/A

[99] 2013 PLoS One LIF; IM — W-H rule-based SLN N/A

[100] 2014 Neurocomp. SRM Temporal STDP-based MLP Iris

[50] 2015 Front. Comput. Neurosci. LIF Rate STDP-based SLN MNIST

[101] 2015 Neural Comput. SRM — Likelihood gradient MLP N/A

[27] 2015 IJCV IF Rate ANN pre-training CNN CIFAR-10; Neovision2

[102] 2015 IJCNN IF Rate ANN pre-training CNN MNIST

[103] 2015 IJCNN LIF — Voltage gradient SLN —

[104] 2016 Science LIF — Voltage gradient SLN TIDIGITS

[105] 2016 arXiv IF Direct ANN pre-training CNN CIFAR-10

[106] 2016 Neurocomp. IF Temporal STDP-based HMAX 3D-Ob ject

[107] 2016 Front. Neurosci. LIF Rate Voltage gradient CNN MNIST; N-MNIST

[83] 2017 Front. Neurosci. LIF Direct Conversion method VGG-16 MNIST; CIFAR-10; ImageNet

[108] 2018 IEEE T-NNLS IF Temporal Timing gradient MPL MNIST

[109] 2018 NeurIPS IF; QIF — Activation gradient RNN —

[28] 2018 Neural Netw. IF Temporal STDP-based CNN Caltech-101; ETH-80; MNIST

[80] 2018 IEEE T-NNLS IF Temporal STDP-based HMAX Caltech-101; ETH-80;

[110] 2018 Neural Netw. LIF Direct Voltage gradient CNN MNIST

Table 3 continued from previous page

Ref. Year Venue Neuron Model Coding Method Learning Rule Network Model Dataset

[30] 2018 Front. Neurosci. LIF Rate Activation gradient CNN MNIST; N-MNIST

[85] 2019 AAAI LIF Direct Activate gradient CNN N-MNIST; CIFAR10-DVS; CIFAR-10

[111] 2019 IEEE T-Cyb. LIF — Voltage gradient SLN —

[112] 2019 IEEE T-CDS LIF Rate STDP-based CNN Caltech-101; MNIST

[48] 2019 Neural Netw. LIF Rate STDP-based SLN MNIST

[113] 2019 Front. Neurosci. IF Rate ANN pre-training VGG-16; ResNet-20/34 CIFAR-10; ImageNet

[82] 2020 Int. J. Neural Syst. IF Temporal Timing gradient MLP Caltech-101; MNIST

[114] 2020 ICASSP SRM Temporal Timing gradient MLP MNIST

[115] 2020 CVPR IF Rate ANN pre-training VGG-16; ResNet-20/34 CIFAR-10; CIFAR-100; ImageNet

[116] 2020 ICLR IF Rate ANN pre-training VGG-16; ResNet-20/34 CIFAR-10; CIFAR-100; ImageNet

[117] 2021 IEEE T-NNLS IF Direct ANN-coupled training CifarNet; AlexNet CIFAR-10; ImageNet

[17] 2021 IJCAI LIF Rate Activation gradient CNN MNIST; Fashion-MNIST

[84] 2021 ICCV LIF Direct Activation gradient CNN CIFAR10; CIFAR10-DVS; DVS128 Gesture

[118] 2021 AAAI IF Temporal Timing gradient VGG-16; GoogleNet MNIST; CIFAR-10; ImageNet

[119] 2021 ICLR IF Direct ANN pre-training VGG-16; ResNet-20 CIFAR-10; CIFAR-100; ImageNet

[120] 2021 ICML IF Direct ANN pre-training VGG-16; ResNet-20/34; RegNetX CIFAR-10; CIFAR-100; ImageNet

[121] 2021 IEEE T-NNLS IF Direct ANN pre-training ResNet-50/110 CIFAR-10; CIFAR-100; ImageNet

[122] 2021 IJCAI IF Direct ANN pre-training VGG-16; PreActResNet-18/34 MNIST; CIFAR-10; CIFAR-100

[123] 2021 Sci. Adv. LIF Rate Other bio-plau. algo. MLP MNIST; NETtalk; DVS128 Gesture

[19] 2021 NeurIPS LIF; IF Direct Activation gradient ResNet-18/34/50/101/152 ImageNet; DVS128 Gesture; CIFAR10-DVS

[20] 2021 arXiv LIF Direct Activation gradient ResNet-104/482 CIFAR10-DVS; ImageNet

[124] 2022 AAAI IF Direct ANN pre-training VGG-16; ResNet-18/20 CIFAR-10; CIFAR-100; ImageNet

[31] 2022 ICLR IF Direct ANN pre-training VGG-16; ResNet-18/20/34 CIFAR-10; CIFAR-100; ImageNet

[125] 2022 ICLR LIF Direct Activation gradient VGG-11;ResNet-19/34 CIFAR-100; ImageNet; CIFAR10-DVS

4.3. Bio-plausible rules

Driven by considerations of biological plausibility, bio-plausible rules pri-

marily focus on implementation via experimentally observed biological phe-

nomena such as STDP, coherent oscillation[126], and self-backpropagation[127].

We classify these rules into STDP-based, Widrow-Hoﬀ (W-H) rule-based[128]

and other bio-plausible rules according to the underlying principles they are

derived from.

4.3.1. STDP-based rules

As mentioned in Section 2.3, STDP observed in biology experiments can

enable a neuron to learn visual features in an unsupervised way. Thus STDP-

based rules have been studied in many research. Masquelier and Thorpe [26]

proposed an HMAX-based SNN trained with unsupervised STDP to extract

visual features from a temporal coded image. Many subsequent works were

based on this method. Tavanaei and Maida [129] incorporated probabilis-

tic STDP to improve performance. Kheradpisheh et al. [106] expanded this

method to perform robust invariant object recognition tasks. Unsupervised

STDP can extract repeated features; however, it has diﬃculty in detect-

ing rare but diagnostic features. To this end, Mozafari et al. [80] introduce

reward signals to STDP, called reward-modulated STDP (R-STDP), to im-

prove feature extraction ability.

There are several ways to formalize supervised STDP. Wang et al. [100]

combined STDP and anti-STDP to implement supervised learning for the

output layer of a two-layer SNN. When a neuron emits spikes correctly, STDP

is applied, otherwise, anti-STDP is applied. Beyeler et al. [130] used super-

visory neurons to send excitatory signals to target output neurons, making

them spike at the desired ﬁring rate. Illing et al. [131] implemented super-

vised learning via target post spike trace. Synaptic weights update at every

presynaptic spike times. When the actual post spike trace is lower than

the target value, the corresponding weight increases, and decreases instead.

Hao et al. [79] introduced dopamine-modulated STDP (DA-STDP) combined

with synaptic scaling to realize supervised learning.

As early works to perform digit recognition on MNIST using unsuper-

vised STDP, Querlioz et al. [132, 133] proposed a single-layer network with

lateral inhibition and dynamic threshold, yielding an accuracy of 93.5% with

300 neurons. They used memristors as synapses. Although the rectangular

STDP time-window was used for modulation of synaptic weights in these

works, the memristive devices can implement biological STDP learning rule

easily[134, 135]. Diehl and Cook [50] increased the network scale and im-

proved its biological plausibility, yielding an accuracy of 95%. It was further

improved by Saunders et al. [48, 136] with local connections, resulting in

reduced parameters and training time. To take advantage of the feature ex-

traction capabilities of CNNs, Xu et al. [137] proposed deep CovDenseSNN

which uses spiking neurons to learn features extracted by CNNs.

Recent works adopted CNN-like neural architecture for SNNs. Kherad-

pisheh et al. [28] used STDP to train a three-layer spiking CNN layer-by-

layer. The complexity of learned features increases along the network hierar-

chy. A support vector machine (SVM) was used for feature classiﬁcation and

achieved an accuracy of 98.6% on MNIST. Mozafari et al. [138] improved the

biological plausibility of this model by placing SVM with a layer of decision-

making neurons trained with R-STDP. Unlike previous works, SpiCNN [112]

used rate coding with Poisson distribution and an output layer trained with

supervised STDP.

4.3.2. W-H rule-based rules

The W-H rule[128] is a classical learning rule proposed for ANN neurons

deﬁned as

∆wi=ηxi(yd−yo) (11)

where xiis the input of the ith synapse. yoand ydare the actual and desired

output of the neuron, respectively. ηis the learning rate. The W-H rule

requires no need for gradient calculation, so there is a bunch of SNN learning

rules that present a spiking analogy to this rule.

ReSuMe[24] interprets the W-H rule through two biological processes:

STDP and anti-STDP, which is illustrated in Fig.9. It updates the synaptic

weights according to

dwi(t)

dt = [sd(t)−so(t)] a+Z+∞

T(s)si(t−s)ds(12)

where sd(t) and so(t) are the desired and actual output spike trains, respec-

tively. ais a constant used for speeding up the convergence of learning. T(s)

is the STDP-like learning window. The convolution of the learning window

and the ith input spike train R+∞

0T(s)si(t−s)ds represents the trace of the

spike train.

Unlike the spike-driven update of the synaptic weights in ReSuMe, the

perceptron-based spiking neuron learning rule (PBSNLR)[139] transforms

a spiking neuron to a perceptron at ﬁxed points in time, where the mem-

brane potential in the misclassiﬁcation intervals is utilized for synaptic up-

dating. Inspired by the biological property that the synaptic delay is not

Input spikes

Trace

Desired output spikes

Actual output spikes

Synaptic weight

Time

Figure 9: Illustration of the ReSuMe rule[24]. The amount of modiﬁcation of an excitatory

synaptic weight is proportional to the trance induced by the convolution of input spikes

and the learning window. The STDP process is triggered by the desired output spikes,

while the anti-STDP process is triggered by the actual output spikes.

constant, Taherkhani et al. [140] proposed a synaptic delay learning rule,

delay-learning ReSuMe (DL-ReSuMe), which can improve the learning accu-

racy and convergence speed. To overcome the one-way adjustment problem

in DL-ReSuMe, Zhang et al. [141] proposed synaptic weight-delay plasticity

for ReSuMe (ReSuMe-DW) and PBSNLR (PBSNLR-DW) that can decrease

the delay to increase the membrane potential at desired spike times.

Inspired by ReSuMe, the chronotron I-learning[98] is a heuristic learning

rule, which adjusts the synaptic weights proportionally to their corresponding

synaptic currents. SPAN[25] uses convolving kernels to convert input, actual,

and desired spike trains into analog signals. Unlike SPAN, precise-spike-

driven (PSD) synaptic plasticity[99] only convolves input spike trains.

These rules make neurons ﬁre spikes at desired times. By emitting diﬀer-

ent spike trains, neurons can classify input spike patterns. However, they do

not extend well to deep networks which are vital for solving complex tasks.

4.3.3. Other bio-plausible rules

STDP modiﬁes synaptic weights depending on the pre and postsynaptic

neuronal activity, which lacks global teaching signals for the whole network.

To this end, researchers use neural activity-encoded errors to drive synaptic

changes that can be backpropagated to upstream layers. Zhang et al. [123]

introduced self-backpropagation (SBP) into a three-layer SNN to reduce the

computational cost without aﬀecting accuracy. Payeur et al. [142] proposed

burst-dependent synaptic plasticity to achieve hierarchical credit assignment

in multi-layer SNNs.

Xie et al. [143] found that STDP cannot be applied to recurrent synap-

tic connection in PCNNs, which obstruct the learning in PCNNs. Inspired

by the neural activity-dependent property of STDP, they proposed spike-

synchronization-dependent plasticity (SSDP) rule to improve the spike syn-

chronization. Experimental results showed that SSDP-based PCNNs can

get better segmentation performance. In addition, they designed a novel

memristor-based circuit model of SSDP.

4.4. Gradient-based rules

Application of powerful gradient descent formalism to SNNs is compli-

cated by the hard non-linearity of spike generation mechanism: small changes

in synaptic weights can cause large changes in the output spike train, i.e.

spikes times or counts. Speciﬁcally, the gradient of the output spike train

with respect to the synaptic weights ∇wS(t) is zero except at spikes times

where it is ill-deﬁned[29]. According to the insights on how to circumvent the

problem, gradient-based rules can be classiﬁed into three categories, direct

training, ANN pre-training, and ANN-coupled training.

4.4.1. Direct training

Direct training approaches can be categorized into likelihood, voltage,

timing, and activation gradient approaches according to the state variable

used for the optimization of the objective function.

Likelihood gradient. The threshold nonlinearity can be smoothed via stochas-

ticity which makes it possible to perform gradient descent to maximize the

likelihood of generating desired output spike trains. There are various meth-

ods to introduce stochasticity, such as stochastic threshold[96] or synapse[144]

Pﬁster et al. [96] used likelihood gradient to train single-layer networks with

temporal coding. This approach recently has been extended to multilayer

networks[145, 101, 144, 146]. However, it has not been applied to deep net-

works for complex tasks.

Voltage gradient. The voltage of a neuron at time instants is diﬀerentiable

with respect to its synaptic weights under certain conditions or approxima-

tions which facilitates the formulation of gradient-base rules.

Tempotron[23] learns to classify binary spike patterns by minimizing the

distance between the ﬁring threshold and shunted membrane potential max-

imal on error trials. It changes the synaptic weights according to

∆wi=η(y−ˆy)X

i<tmax

Ktmax −tj

i(13)

where ηis the learning rate. tmax denotes the time of maximal membrane

potential value. tj

idenotes the jth spike time of the ith synapse. K(t) is

the normalized postsynaptic potential (PSP), which is a double exponential.

y∈ {0,1}is the label of input patterns. ˆydenotes the prediction of the

neuron, which is determined by

ˆy=





1V(tmax)≥Vth

0V(tmax)< Vth

(14)

The limitation of tempotron to binary classiﬁcations was overcome by

multi-spike tempotron (MST)[104] which solves multi-classiﬁcation problems

by mapping input spike patterns to desired numbers of output spikes. MST

introduces the spike-threshold-surface (STS) function, which maps critical

thresholds to the numbers of emitted spikes, to formulate a continuous ob-

jective function diﬀerentiable with respect to neuron’s synaptic weights.

To reduce the computational costs of MST, Yu et al. [111] utilized the

linear assumption for threshold crossing[22] to derive the eﬃcient threshold-

driven plasticity (TDP) algorithm. Subsequent eﬃcient multispike learning

(EML) [147], joint weight-delay plasticity (TDP-DL) [148] further improve

MST from various perspectives. Unlike the rules in Section 4.3.2, these multi-

spike learning rules can train a neuron to emit a desired number of spikes

which is proportional to the number of underlying clues, without specifying

the spike times. Thus this type of learning is termed aggregate-label learning

[104]. In contrast to MST and TDP, membrane-potential driven aggregate-

label learning (MPD-AL) [149] constructs the error functions based on the

membrane potential instead of the critical thresholds.

The above-mentioned rules can’t be applied to multilayer networks. To fa-

cilitate image classiﬁcation tasks via these rules, a common practice is encod-

ing static images into sparse spike representations, such as S1C1-SNN[150],

SCNN[151], CNN-TDP[152] and UMP-TDP[152]. This approach can achieve

comparable performance to deep SNNs on small-scale datasets, such as MNIST

and Fashion-MNIST.

Normalized approximate descent (NormAD)[103] was derived under the

consumption of sparse spike trains, which approximate the gradient of mem-

brane potential at a given time instant with respect to synaptic weights.

This approximation method has been eﬀectively applied to multilayer SNNs

with a weight-ﬁxed convolutional layer[110], and spiking convolutional auto-

encoders which yield 99.08% accuracy on MNIST dataset[153]. Similarly,

Zhang et al. [95, 154] force the neuron to reset only at desired output spike

times[139, 155] thus enabling easy calculation of voltage gradients. Unlike

obtaining voltage gradient at speciﬁc times, Lee et al. [107] ignored discon-

tinuities of membrane potential at spike times and treated the output of a

neuron as a linear function of its inputs which has ﬁltered by the membrane.

They trained a spiking CNN, achieving an accuracy of 99.31%.

Timing gradient. The timing gradient approaches focus on the neuron’s out-

put spike times and compute the gradients with respect to the neuron’s

synaptic weights. Due to the event-driven nature of spikes, these approaches

allow for eﬃcient even-based network simulation.

SpikeProp[22] is the ﬁrst algorithm applying backpropagation to tempo-

rally coded networks of spiking neurons ﬁring at most one. Its formula is as

follows:

∆wl

ij =−η∂L

∂tl

∂V l

j(tl

∂V l

j(tl

∂wl

(15)

where ηis the learning rate, a positive constant. Lis the loss function.

jand tl

jdenote the membrane potential and spike of neuron jin layer l,

respectively. wl

ij is the synaptic weight from neuron iin l−1 layer to neuron

jin llayer. The key challenge is to solve the partial derivative ∂tl

j/∂V l

j(tl

in Eq. 15. SpikeProp assumes that the membrane potential of a neuron

increases linearly in a small enough region around the ﬁring time as shown

in Fig.10. Thus, ∂tl

j/∂V l

j(tl

j) can be expressed as

∂tl

∂V l

j(tl

j)=−1

∂V l

j(tl

j)/∂tl

=−1

Piwl

∂K (tl

j−tl−1

∂tl

(16)

where K(t) is used to describe the PSP of neuron jgenerated by the input

spike tl−1

i, which also denotes the output spike time of neuron iin layer l−1.

Time

Voltage

Vth

Figure 10: Illustration of linear approximation for threshold crossing[22]. It assumes that

the membrane potential of a neuron increases linearly in a small enough region around the

ﬁring time. The sold line denotes the membrane potential when threshold crossing.

Multi-SpikeProp[97] overcomes the limitation of one spike in SpikeProp

and improves performance. Without approximation, Mostafa [108] used LF

neurons to obtain an analytical expression relating input and output spike

times. Comsa et al. [114, 156] used SRM with biologically realistic alpha

function for membrane dynamics and derived exact gradients with respect

to input spike times and synaptic weights. Kheradpisheh and Masquelier

[82] found that the ﬁrst spike time of an IF neuron can approximate ReLU

activation, they thus derived a new learning algorithm for multilayer SNNs

with TTFS coding and achieved an accuracy of 97.4% on MNIST, which is

comparable to [108, 114].

Recent works extend the timing gradient approach to train deeper SNNs.

To overcome gradient explosion and dead neuron problem encountered in

timing gradient-based method, Zhang et al. [157] proposed a rectiﬁed linear

postsynaptic potential function (ReL-PSP) based neuron model where mem-

brane potential increases linearly prior to postsynaptic spike time. They

trained convolutional SNNs, yielding 99.4% and 90.1% accuracy on MNIST

and Fashion-MNITST, respectively. Zhou et al. [118] applied the method

proposed by [108] to spiking VGG and GoogleNet with deep ANN training

techniques such as batch normalization (BN), achieving an accuracy of 68.6%

on ImageNet.

Activation gradient. Activation gradient approaches smooth the hard non-

linearity due to the none-or-one response of spiking neurons via modiﬁcation

of their activation function in forward or backward path.

The hard thresholds of spiking neurons can be placed with soft ones in

forward propagation of SNNs. Huh and Sejnowski [109] introduced active

zone where the synaptic current is activated gradually. They trained recur-

rent SNNs with backpropagation through time (BPTT) to perform dynamics

tasks such as predictive coding. Liu et al. [158] found that all PCNN models

can’t explain the phenomenon that biological neurons stimulated by periodic

signals exhibit chaotic behavior. They thus proposed a continuous-coupled

neural network (CCNN) model to solve the problem. CCNN is a mean-ﬁeld

model where the step function used in PCNN is replaced by the sigmoid

function which facilitates standard BP for training. CCNN also achieves

better results in image segmentation tasks than state-of-the-art visual cortex

models. These methods modify the binary output of spiking neurons, which

is less friendly for low-power hardware implementation.

An alternative approach is to use surrogate gradients for backpropaga-

tion. SuperSpike[29] and STBP[30] were the early works applying gradient

gradients to train multilayer SNNs. STBP uses the explicitly iterative LIF

model. An SNN with iterative LIF can be unrolled in time as an RNN, which

facilitates utilizing BPTT for the wight update. STBP updates the synaptic

weights as follows

∆wl

ij =−ηX

∂L

∂St,l

∂V t,l

∂I t,l

∂wl

(17)

where ηis the learning rate. Lis the loss function. St,l

j,Vt,l

j, and It,l

jdenote

the output spike, the membrane potential, and the input current of neuron

jat time tin layer l, respectively. wl

ij is the synaptic weight from neuron

iin l−1 layer to neuron jin llayer. In Eq. 17, the derivative of the

spike function ∂St,l

j/∂V t,l

jis not well deﬁned. To enable gradient descent,

a surrogate derivative is used for approximation. A common family of the

surrogate gradient is the rectangular function[30] given by

h(V) = 1

asign |V−Vth|<a

2(18)

where ais a hyper-parameter determining the shape for gradient estimation.

Shrestha and Orchard [159] considered the temporal dependence between

spikes and interpreted the surrogate gradient as the probability density func-

tion. Wu et al. [85] extended STBP to deep convolutional SNNs with neuron

normalization(NeuNorm). Zheng et al. [160] proposed threshold-dependent

batch normalization (tdBN) for alleviating the gradient vanishing or explo-

sion problem and maintaining the ﬁring rate. Combined with STBP, tdBN re-

alizes direct training of spiking ResNet-50 on ImageNet. As a time-dependent

variant of tdBN, temporal eﬀective batch normalization (TEBN) [161] can

regularize the temporal distribution. To overcome the degradation problem

in deep SNNs, Fang et al. [19] proposed SEW ResNet which can realize iden-

tity mapping . Feng et al. [162] proposed a multi-level ﬁring (MLF) unit to

combat the gradient vanishing problem. Hu et al. [20] scaled SNNs up to

104 layers on ImageNet and 482 layers on CIFAR-10, achieving 76.02% and

91.9% accuracy, respectively. The above studies have fully demonstrated the

scalability of the surrogate gradient method.

The shape and smoothness of surrogate gradients have an impact on per-

formance. To avoid the heuristic choice of surrogate gradients, Li et al. [163]

proposed diﬀerentiable spike (Dspike) that can ﬁnd the optimal shape and

smoothness for surrogate gradients. Deng et al. [125] proposed temporal eﬃ-

cient training (TET) to solve the SNN generalization problem which results

from incorrect surrogate gradients, obtaining a remarkable accuracy of 83%

on CIFAR10-DVS.

The neuronal heterogeneity is a critical property of biological neural net-

works which can be incorporated into SNNs. Fang et al. [84] proposed a

parametric LIF (PLIF) model which has learnable time constants. [164] pro-

posed a learnable threshold scheme during training. Yao et al. [165] proposed

a uniﬁed gated LIF (GLIF) neuron, wherein bio-features in diﬀerent neural

behaviors are fused by the gating factor. These methods can improve the

learning of SNNs, thus obtaining better accuracy with fewer time steps

Combined with BPTT, the surrogate gradient approach can achieve spa-

tiotemporal credit assignment in very deep SNNs[20]. However, the large

computational graph from the unrolled networks requires tremendous hard-

ware resources during training. Local learning scheme provides an alterna-

tive to assign spatiotemporal credit in SNNs. Deep continuous local learn-

ing (DECOLLE) [166] uses layer-wise local readouts with random and ﬁxed

weights. Ma et al. [167] proposed a local learning scheme, wherein each layer

is trained with a local auxiliary classiﬁer. Inspired by the teacher-student

learning approach, Yang et al. [168] proposed local tandem learning (LTL)

to transfer the feature representation of a teacher ANN to a student SNN

through layer-wise loss function. These local learning rules provide a com-

petitive performance while consuming less computational costs than BPTT.

4.4.2. ANN pre-training

Leveraging the fact that the ﬁring rate of IF models can approximate

the activation value of ReLU functions, ANN pre-training approaches ﬁrst

train an ANN with constraints, then convert it into an SNN with or without

post-conversion techniques ﬁne-tuning the parameters.

The ﬁrst ANN pre-training method was proposed by Cao et al. [27] which

imposes constraints on CNN such as using ReLU activation function and

removing biases to avoid negative outputs. Diehl et al. [102] suggested us-

ing weight normalization or threshold balancing to reduce conversion error,

which is exactly equivalent mathematically[113]. Rueckauer et al. [105, 83]

proposed spike subtraction, also called soft reset, to alleviate information loss

caused by resetting and implemented spiking equivalents of common opera-

tions such as BN which allow conversion of deeper CNNs including VGG-16

and GoogLeNet Inception-V3. Sengupta et al. [113] argued that removing

the constraints in ANN training[83] suﬀers signiﬁcant accuracy loss in the

conversion process and proposed spike-norm to extend converted SNNs to

residual architectures. RMP-SNN[115] used soft reset and a threshold bal-

ancing method that alleviates the ﬁring rate vanishing problem. Hu et al.

[121] proposed a compensation mechanism to reduce discretization errors and

ﬁrstly built a converted SNN with more than 100 layers. Ding et al. [122]

placed ReLU with rate norm layer (RLN) for ANN training, enabling direct

conversion without setting thresholds manually.

In recent works, most studies focus on a theoretical analysis of conversion

errors which facilitate methods reducing them and thus inference latency of

the converted SNNs. Deng and Gu [119] added a threshold and shift in ReLU

activation function to reduce conversion error. Li et al. [120] proposed SNN

calibration which calibrates the parameters in converted SNN to match the

activations in ANN. Bu et al. [124] used optimal initialization of membrane

potentials to implement expected error-free conversion. Bu et al. [31] went

deeper into error analysis and proposed a quantization clip-ﬂoor-shift activa-

tion function to replace the ReLU, achieving ultra-low latency (4 time steps)

of converted SNNs.

There are also some interesting works that explore post-conversion ﬁne-

tuning of converted SNNs to reduce latency and increase accuracy. Rathi

et al. [116] proposed a spiking-timing dependent surrogate function to train

a converted SNN which converges within a few epochs and requires fewer

time steps. Rathi and Roy [169] jointly optimized membrane leak and ﬁr-

ing threshold along with synaptic weights to reduce latency and increases

activation sparsity. Wu et al. [21] proposed a progressive tandem learning

(PTL) framework that compensates the conversion errors layer-wise by tan-

dem learning (TL)[117] with an adaptive training scheduler.

The ANN pre-training approach leverages the superior performance of

ANNs while avoiding tremendous hardware overhead for direct training of

SNNs, and thus quicker implementation on low-power neuromorphic hard-

ware. However, this approach only works for static datasets so far and can’t

exploit the temporal dynamics of SNNs which makes them the third gener-

ation of neural networks[170].

4.4.3. ANN-coupled training

Unlike copying the weights from a well-trained ANN in the ANN pre-

training approach, ANN-couple training consists of an SNN and an ANN with

shared weights. The SNN feeds forward spike trains, while the ANN back

propagates errors to update the shared weights. Wu et al. [117] ﬁrst proposed

this idea termed tandem learning in which the networks are coupled layer-

wise. They demonstrated its eﬀectiveness on both static and neuromorphic

datasets. Kheradpisheh et al. [171] used a proxy ANN to backpropagate

the errors of an SNN on the basis of rate-coded LF neurons approximating

ReLU. They outperformed [117] on the CIFAR-10 dataset with an accuracy

of 93.11%.

4.5. Performance Comparison

So far, the majority of works on SNNs have used image classiﬁcation

datasets as benchmarks. To shed more light on the performance of SNN

learning rules, we summarize the state-of-the-art results of existing methods

tested on two types of datasets, i.e. static and neuromorphic datasets, which

are presented in Tab.4 and 5, respectively.

The static datasets in Tab.4 include MNIST, CIFAR-10/100, and Im-

ageNet. Due to the simplicity of MNIST, the recent accuracies reported

on MNIST are pretty high (>99%). Therefore, CIFAR-10/100 and Ima-

geNet have become more popular to evaluate deep SNNs. Recent results

reported on these datasets are dominated by ANN pre-training[31, 120] and

activation gradient approach[20, 125]. Though ANN pre-training methods

are better than activation gradient-based methods in terms of accuracy, ANN

pre-training methods need more time steps for inference.

Concerning the neuromorphic datasets, we include N-MNIST, CIFAR10-

DVS, and DVS128 Gesture in Tab.5. Similar to MNIST, the accuracies

reported on N-MINST are above 99% in recent publications. The CIFAR10-

DVS is a challenging dataset. TEBN [161] reported the best accuracy of

84.90% on CIFAR10-DVS with neuromorphic data augmentation[172]. No-

tably, almost all methods are based on activation gradient since they can

exploit the temporal dynamics of the neuromorphic datasets.

Table 4: State-of-the-art results on the neuromorphic datasets, in-

cluding MNIST, CIFAR-10/100, and ImageNet. — denotes the

number of time steps is not reported in the paper or is not applica-

ble to the method. T denotes the number of time steps. †denotes

local learning rules.

Dataset Reference Year Learning Rule Accuracy T

MNIST

Diehl’s method [50] 2015 STDP-based 95.00 —

Lee’s method [107] 2016 Voltage gradient 98.77 —

SDNN [28] 2018 STDP-based 98.40 —

STBP [30] 2018 Activation gradient 99.42 —

Zhou’s method [118] 2021 Timing gradient 99.33 —

RNL [122] 2021 ANN pre-training 99.46 —

PLIF [84] 2021 Activation gradient 99.72 8

STDBP [157] 2022 Timing gradient 99.40 —

CIFAR-10

NeuNorm [85] 2019 Activation gradient 90.53 12

RMP-SNN [115] 2020 ANN pre-training 93.39 512

TL [117] 2021 ANN-coupled training 90.98 8

tdBN [160] 2021 Activation gradient 93.16 6

Zhou’s method [118] 2021 Timing gradient 92.68 —

PLIF [84] 2021 Activation gradient 93.50 8

Dspike [89] 2021 Activation gradient 94.25 6

Calibration [120] 2021 ANN pre-taining 95.79 256

Proxy [171] 2022 ANN-coupled training 93.11 60

TET [125] 2022 Activation gradient 94.50 6

QCFS [31] 2022 ANN pre-taining 96.08 32

MLF [162] 2022 Activation gradient 94.25 4

TEBN [161] 2022 Activation gradient 95.60 6

LTL [168] 2022 Activation gradient†95.28 32

Table 4 continued from previous page

Dataset Reference Year Learning Rule Accuracy T

CIFAR-100

RMP-SNN [115] 2020 ANN pre-training 70.58 1024

DIET-SNN [169] 2021 ANN pre-training 69.67 5

S-ResNet [121] 2021 ANN pre-training 70.62 350

RNL [122] 2021 ANN pre-training 75.10 —

Calibration [120] 2021 ANN pre-training 77.30 128

TET [125] 2022 Activation gradient 74.72 6

QCFS [31] 2022 ANN pre-training 79.62 32

TEBN [161] 2022 Activation gradient 78.76 6

LTL [168] 2022 Activation gradient†76.08 32

ImageNet

RMP-SNN [115] 2020 ANN pre-training 73.09 2048

S-ResNet [121] 2021 ANN pre-traning 73.77 350

tdBN [160] 2021 Activation gradient 67.05 6

Zhou’s method [118] 2021 Timing gradient 68.80 —

SEW-ResNet [19] 2021 Activation gradient 69.26 4

MS-ResNet [20] 2021 Activation gradient 76.02 5

Calibration [120] 2021 ANN pre-training 77.50 256

TET [125] 2022 Activation activation 68.00 4

QCFS [31] 2022 ANN pre-training 74.22 256

TEBN [161] 2022 Activation gradient 68.28 4

GLIF [165] 2022 Activation gradient 60.09 6

5. Applications of SNNs

SNNs have potential advantages in representing and processing spatiotem-

poral patterns due to their inherent temporal dynamics. Thus, a vast body

of studies explored the applications of SNNs in spatiotemporal tasks, such as

Table 5: State-of-the-art results on the neuromorphic datasets, including N-MNIST,

CIFAR10-DVS, and DVS128 Gesture. — denotes the number of time steps is not re-

ported in the paper. T denotes the number of time steps. †denotes neuromorphic data

augmentation.

Dataset Reference Year Learning Rule Accuracy T

N-MNIST

Lee’s Method [107] 2018 Voltage gradient 98.74 —

STBP [30] 2018 Activation gradient 98.78 —

SLAYER [159] 2018 Activation gradient 99.20 —

NeuNorm [85] 2019 Activation gradient 99.53 —

TL [117] 2021 ANN-coupled traning 99.31 —

PLIF [84] 2021 Activation gradient 99.61 10

LTMD [164] 2022 Activation gradient 99.65 15

CIFAR10-DVS

NeuNorm [85] 2019 Activation gradient 60.50 —

TL [117] 2021 ANN-coupled traning 65.59 —

tdBN [160] 2021 Activation gradient 67.80 10

PLIF [84] 2021 Activation gradient 74.80 20

Dspike [89] 2021 Activation gradient 75.40 10

SEW-ResNet [19] 2021 Activation gradient 74.40 16

TET [125] 2022 Activation gradient 77.33/83.17†10

TEBN [161] 2022 Activation gradient 84.90†10

GLIF [165] 2022 Activation gradient 78.10 16

DVS128 Gesture

SLAYER [159] 2018 Activation gradient 93.64 —

tdBN [160] 2021 Activation gradient 96.87 40

PLIF [84] 2021 Activation gradient 97.57 20

SEW-ResNet [19] 2021 Activation gradient 97.92 16

MLF[162] 2022 Activation gradient 97.29 40

event-based vision and audio signal processing. In this section, we introduce

the applications of SNNs in these two ﬁelds.

Compared to conventional frame cameras, brain-inspired event cameras

have several advantages, such as low power and high temporal resolution

[9]. The SNNs are well-suited for processing the spatiotemporal data gen-

erated by event cameras due to their temporal dynamics. Orchard et al.

[173] proposed an SNN for visual motion estimation, in which LIF neurons

with synaptic delays implement motion-sensitive receptive ﬁelds. Haessig

et al. [174] proposed an SNN-based direction sensitive (DS) unit for optical

ﬂow estimation. Paredes-Valles et al. [10] incorporated STDP learning to

a hierarchical SNN which exhibits motion selectivity after training. Except

for the above low-level vision applications, SNNs can also perform high-level

vision tasks, such as recognition. The HMAX-inspired HFirst model [173]

utilizes the spike timing provided by event cameras for the character recog-

nition task. Xiao et al. [175] improved HFirst by integrating a tempotron

classiﬁer. Recent work [159, 84, 30, 117] apply spike-based backpropagation

to training deep SNNs, obtaining high accuracy on complex neuromorphic

datasets created by event sensors [91, 92].

SNN-based acoustic models have shown great potential for energy-eﬃcient

and high performance auditory information processing tasks, such as auto-

matic sound classiﬁcation (ASC) [176, 177, 178], automatic speech recogni-

tion (ASR) [179], and sound source localization (SSL) [180]. Several works

[176, 177, 178] have succes apply tempotron-based learning for ASC tasks. In

these works, conventional feature extraction methods, such as mel-frequency

cepstral coeﬃcient (MFCC) and self-organizing map (SOM), are used for gen-

erating spike patterns, which are then classiﬁed by SNNs trained with the

tempotron-based learning rule. Tavanaei and Maida [181, 182] applied STDP

to features extraction from raw speech signals. The extracted spike features

are then post-processed into real-valued feature vectors and classiﬁed by tra-

ditional classiﬁer, such as SVM. The biological plausibility of these models

is further improve by [183], wherein a fully SNN-based ASC framework is

presented by combining competitive STDP learning and tempotron-based

classiﬁcation. Recent studies apply deep SNNs to auditory systems. Pan

et al. [180] utilized surrogate gradient learning for SSL tasks. Deep SNNs

trained with tandem learning rule have been explored for speech separation

[21] and large vocabulary ASR [179].

6. Conclusion and future research directions

We presented a comprehensive survey of SNN learning rules, in which

we reviewed the most representative learning rules in SNNs and provided

discussions on their characteristics, advantages, limitations, and performance

on several popular datasets. Besides, we introduced practical applications of

SNNs in event-based vision and audio signal processing. Here, we further

discuss a few challenges and promising research directions which may foster

real-world applications in the ﬁeld.

1. Learn lessons from deep learning. As we can see from the paper, the

performance of SNNs has improved a lot by utilizing deep learning techniques,

such as BPTT and BN. Even though there is still a gap between ANNs and

SNNs in terms of accuracy, deep learning techniques have great potential

to may further improve the performance of SNNs. On the one hand, the

unique characteristics of SNNs, which may increase their performance, can

be explored via deep learning techniques. For example, recent works show

that neural architecture search (NAS)[184] can be exploited for ﬁnding better

SNN architectures[185, 186]. On the other hand, the temporal dimension and

spike representation of SNNs should be considered when applying training

techniques in deep learning to SNNs. For example, training deep SNNs using

BPTT is constrained to working on dense tensors, limiting training speed and

eﬃciency[187]. Standard BN in SNNs does not show scalability to large-scale

datasets[188].

2. Draw inspirations from brain science. Biological observations and

mechanisms of the human brain are natural references to creating intelli-

gence. For example, burst-dependent plasticity can solve the credit assign-

ment problem in hierarchical networks[142]. Dendritic spines[189] can facili-

tate weight optimization for pruning[190]. LIF neurons with lateral interac-

tions [17] have shown improved robustness to noisy inputs. However, current

SNNs mainly utilize single neuron dynamics such as LIF, while network-level

dynamics, for instance, coherent oscillations[126] and chaos[191], are less ex-

plored in SNNs[158]. Although PCNNs are more biologically plausible [14]

and the coupling mechanism have the potential for robust pattern recogni-

tion [17, 158], the learning rules for them, as this survey revealed, have not

been well developed. We expect that exploring learning rules in PCNNs may

help us understand how the brain works and further improve the robustness

of existing methods.

3. Algorithm-hardware co-design for energy-eﬃcient neuromorphic sys-

tems. In this methodology, design goals are achieved by exploiting the syn-

ergism between algorithms and hardware platforms. The SNNs are spa-

tiotemporal networks that are not well-suited for simulation on conventional

hardware. Thus, it is necessary to explore the eﬃcient implementation of

the SNNs and their learning rules. On the one hand, spike-based compu-

tation in SNNs can be utilized for low-power neuromorphic chips[192, 193].

On the other hand, novel devices may leverage more brain-like systems due

to their unique merits. For instance, the memristor has nonlinearity, non-

volatility, and is compatible with the CMOS technology[194]. It can emulate

the complex nonlinear dynamics of biological neurons and synapses[135, 195,

196, 134] and learning in SNNs[197, 198]. We expect this paradigm to gain

increased popularity in the near future.

Declaration of competing interest

The authors declare that they have no conﬂicts of interest.

Acknowledgments

This study is supported by the Natural Science Foundation of Gansu

Province (No. 21JR7RA510) and the National Natural Science Foundation

of China (No. 61906174).

References

[1] D. Cox, T. Dean, Neural Networks and Neuroscience-Inspired Com-

puter Vision, Current Biology 24 (2014) R921–R929.

[2] F. Rosenblatt, The perceptron: A probabilistic model for information

storage and organization in the brain., Psychological Review 65 (1958)

386–408.

[3] Y. Lecun, L. Bottou, Y. Bengio, P. Haﬀner, Gradient-based learning

applied to document recognition, Proceedings of the IEEE 86 (1998)

2278–2324.

[4] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N.

Gomez, L. u. Kaiser, I. Polosukhin, Attention is all you need, in:

Advances in Neural Information Processing Systems, volume 30, Cur-

ran Associates, Inc., 2017.

[5] Y. Bengio, Y. Lecun, G. Hinton, Deep learning for ai, Commun. ACM

64 (2021) 58–65.

[6] E. Strubell, A. Ganesh, A. McCallum, Energy and policy considera-

tions for deep learning in NLP, in: Proceedings of the 57th Annual

Meeting of the Association for Computational Linguistics, Association

for Computational Linguistics, Florence, Italy, 2019, pp. 3645–3650.

[7] A. Chakraborty, M. Alam, V. Dey, A. Chattopadhyay, D. Mukhopad-

hyay, A survey on adversarial attacks and defences, CAAI Transactions

on Intelligence Technology 6 (2021) 25–45.

[8] L. Zhu, S. Dong, J. Li, T. Huang, Y. Tian, Ultra-high Temporal Resolu-

tion Visual Reconstruction from a Fovea-like Spike Camera via Spiking

Neuron Model, IEEE Transactions on Pattern Analysis and Machine

Intelligence (2022) 1–1.

[9] G. Gallego, T. Delbruck, G. Orchard, C. Bartolozzi, B. Taba, A. Censi,

S. Leutenegger, A. J. Davison, J. Conradt, K. Daniilidis, D. Scara-

muzza, Event-Based Vision: A Survey, IEEE Transactions on Pattern

Analysis and Machine Intelligence 44 (2022) 154–180.

[10] F. Paredes-Valles, K. Y. W. Scheper, G. C. H. E. de Croon, Unsuper-

vised Learning of a Hierarchical Spiking Neural Network for Optical

Flow Estimation: From Events to Global Motion Perception, IEEE

Transactions on Pattern Analysis and Machine Intelligence 42 (2020)

2051–2064.

[11] A. L. Hodgkin, A. F. Huxley, A quantitative description of membrane

current and its application to conduction and excitation in nerve, The

Journal of Physiology 117 (1952) 500–544.

[12] E. Izhikevich, Simple model of spiking neurons, IEEE Transactions on

Neural Networks 14 (2003) 1569–1572.

[13] R. VanRullen, R. Guyonneau, S. J. Thorpe, Spike times make sense,

Trends in Neurosciences 28 (2005) 1–4.

[14] R. Eckhorn, H. J. Reitboeck, M. Arndt, P. Dicke, Feature Linking via

Synchronization among Distributed Assemblies: Simulations of Results

from Cat Visual Cortex, Neural Computation 2 (1990) 293–307.

[15] N. K. Kasabov, Neucube: A spiking neural network architecture for

mapping, learning and understanding of spatio-temporal brain data,

Neural Networks 52 (2014) 62–76.

[16] Z. Yu, J. K. Liu, S. Jia, Y. Zhang, Y. Zheng, Y. Tian, T. Huang,

Toward the next generation of retinal neuroprosthesis: Visual compu-

tation with spikes, Engineering 6 (2020) 449–461.

[17] X. Cheng, Y. Hao, J. Xu, B. Xu, LISNN: Improving Spiking Neu-

ral Networks with Lateral Interactions for Robust Object Recognition,

in: Proceedings of the Twenty-Ninth International Joint Conference

on Artiﬁcial Intelligence, International Joint Conferences on Artiﬁcial

Intelligence Organization, Yokohama, Japan, 2020, pp. 1519–1525.

[18] X. She, Y. Long, D. Kim, S. Mukhopadhyay, ScieNet: Deep learning

with spike-assisted contextual information extraction, Pattern Recog-

nition 118 (2021) 108002.

[19] W. Fang, Z. Yu, Y. Chen, T. Huang, T. Masquelier, Y. Tian, Deep

Residual Learning in Spiking Neural Networks, in: Advances in Neural

Information Processing Systems, volume 34, Curran Associates, Inc.,

2021, pp. 21056–21069.

[20] Y. Hu, Y. Wu, L. Deng, G. Li, Advancing Residual Learning towards

Powerful Deep Spiking Neural Networks, 2021. ArXiv:2112.08954 [cs].

[21] J. Wu, C. Xu, X. Han, D. Zhou, M. Zhang, H. Li, K. C. Tan, Pro-

gressive Tandem Learning for Pattern Recognition with Deep Spiking

Neural Networks, IEEE Transactions on Pattern Analysis and Machine

Intelligence (2021) 1–1.

[22] S. M. Bohte, J. N. Kok, H. La Poutre, Error-backpropagation in tempo-

rally encoded networks of spiking neurons, Neurocomputing 48 (2002)

17–37. Publisher: Elsevier.

[23] R. G¨utig, H. Sompolinsky, The tempotron: a neuron that learns spike

timing–based decisions, Nature Neuroscience 9 (2006) 420–428.

[24] F. Ponulak, A. Kasi´nski, Supervised Learning in Spiking Neural Net-

works with ReSuMe: Sequence Learning, Classiﬁcation, and Spike

Shifting, Neural Computation 22 (2010) 467–510.

[25] A. Mohemmed, S. Schliebs, S. Matsuda, N. Kasabov, Span: Spike

pattern association neuron for learning spatio-temporal spike patterns,

International journal of neural systems 22 (2012) 1250012.

[26] T. Masquelier, S. J. Thorpe, Unsupervised Learning of Visual Features

through Spike Timing Dependent Plasticity, PLoS Computational Bi-

ology 3 (2007) e31.

[27] Y. Cao, Y. Chen, D. Khosla, Spiking Deep Convolutional Neural Net-

works for Energy-Eﬃcient Object Recognition, International Journal

of Computer Vision 113 (2015) 54–66.

[28] S. R. Kheradpisheh, M. Ganjtabesh, S. J. Thorpe, T. Masquelier,

STDP-based spiking deep convolutional neural networks for object

recognition, Neural Networks 99 (2018) 56–67.

[29] F. Zenke, S. Ganguli, SuperSpike: Supervised Learning in Multilayer

Spiking Neural Networks, Neural Computation 30 (2018) 1514–1541.

[30] Y. Wu, L. Deng, G. Li, J. Zhu, L. Shi, Spatio-Temporal Backpropaga-

tion for Training High-Performance Spiking Neural Networks, Frontiers

in Neuroscience 12 (2018) 331.

[31] T. Bu, W. Fang, J. Ding, P. DAI, Z. Yu, T. Huang, Optimal ANN-SNN

Conversion for High-accuracy and Ultra-low-latency Spiking Neural

Networks, in: International Conference on Learning Representations,

2022.

[32] A. Tavanaei, M. Ghodrati, S. R. Kheradpisheh, T. Masquelier,

A. Maida, Deep learning in spiking neural networks, Neural Networks

111 (2019) 47–63.

[33] E. O. Neftci, H. Mostafa, F. Zenke, Surrogate Gradient Learning in

Spiking Neural Networks: Bringing the Power of Gradient-Based Op-

timization to Spiking Neural Networks, IEEE Signal Processing Mag-

azine 36 (2019) 51–63.

[34] H. Jang, O. Simeone, B. Gardner, A. Gruning, An Introduction to

Probabilistic Spiking Neural Networks: Probabilistic Models, Learning

Rules, and Applications, IEEE Signal Processing Magazine 36 (2019)

64–77.

[35] K. Roy, A. Jaiswal, P. Panda, Towards spike-based machine intelligence

with neuromorphic computing, Nature 575 (2019) 607–617.

[36] X. Wang, X. Lin, X. Dang, Supervised learning in spiking neural

networks: A review of algorithms and evaluations, Neural Networks

125 (2020) 258–280.

[37] A. Taherkhani, A. Belatreche, Y. Li, G. Cosma, L. P. Maguire,

T. McGinnity, A review of learning in biologically plausible spiking

neural networks, Neural Networks 122 (2020) 253–272.

[38] J. L. Lobo, J. Del Ser, A. Bifet, N. Kasabov, Spiking neural networks

and online learning: An overview and perspectives, Neural Networks

121 (2020) 88–100.

[39] A. Javanshir, T. T. Nguyen, M. A. P. Mahmud, A. Z. Kouzani, Ad-

vancements in Algorithms and Neuromorphic Hardware for Spiking

Neural Networks, Neural Computation 34 (2022) 1289–1328.

[40] D. Zhang, T. Zhang, S. Jia, Q. Wang, B. Xu, Recent Advances and New

Frontiers in Spiking Neural Networks, 2022. ArXiv:2204.07050 [cs].

[41] S. Wang, T. H. Cheng, M. H. Lim, A hierarchical taxonomic survey of

spiking neural networks, Memetic Computing 14 (2022) 335–354.

[42] J. Johnson, M. Padgett, PCNN models and applications, IEEE Trans-

actions on Neural Networks 10 (1999) 480–498.

[43] Z. Wang, Y. Ma, F. Cheng, L. Yang, Review of pulse-coupled neural

networks, Image and Vision Computing 28 (2010) 5–13.

[44] K. Zhan, J. Shi, H. Wang, Y. Xie, Q. Li, Computational Mechanisms of

Pulse-Coupled Neural Networks: A Comprehensive Review, Archives

of Computational Methods in Engineering 24 (2017) 573–588.

[45] L. Abbott, Lapicque’s introduction of the integrate-and-ﬁre model

neuron (1907), Brain Research Bulletin 50 (1999) 303–304.

[46] W. Gerstner, W. M. Kistler, Spiking Neuron Models: Single Neurons,

Populations, Plasticity, 1 ed., Cambridge University Press, 2002.

[47] H. Fang, A. Shrestha, Z. Zhao, Q. Qiu, Exploiting Neuron and Synapse

Filter Dynamics in Spatial Temporal Learning of Deep Spiking Neural

Network, in: Proceedings of the Twenty-Ninth International Joint

Conference on Artiﬁcial Intelligence, International Joint Conferences

on Artiﬁcial Intelligence Organization, Yokohama, Japan, 2020, pp.

2799–2806.

[48] D. J. Saunders, D. Patel, H. Hazan, H. T. Siegelmann, R. Kozma, Lo-

cally connected spiking neural networks for unsupervised feature learn-

ing, Neural Networks 119 (2019) 332–340.

[49] W. Zhang, P. Li, Skip-Connected Self-Recurrent Spiking Neural Net-

works With Joint Intrinsic Parameter and Synaptic Weight Training,

Neural Computation 33 (2021) 1886–1913.

[50] P. U. Diehl, M. Cook, Unsupervised learning of digit recognition using

spike-timing-dependent plasticity, Frontiers in Computational Neuro-

science 9 (2015).

[51] K. Zhan, H. Zhang, Y. Ma, New spiking cortical model for invariant

texture retrieval and image processing, IEEE Transactions on Neural

Networks 20 (2009) 1980–1986.

[52] J. Lian, Z. Yang, J. Liu, W. Sun, L. Zheng, X. Du, Z. Yi, B. Shi,

Y. Ma, An Overview of Image Segmentation Based on Pulse-Coupled

Neural Network, Archives of Computational Methods in Engineering

28 (2021) 387–403.

[53] K. Zhan, J. Shi, Q. Li, J. Teng, M. Wang, Image segmentation using

fast linking SCM, in: 2015 International Joint Conference on Neural

Networks (IJCNN), IEEE, Killarney, 2015, pp. 1–8.

[54] Z. Wang, Y. Ma, J. Gu, Multi-focus image fusion using PCNN, Pattern

Recognition 43 (2010) 2003–2016.

[55] W. Huang, Z. Jing, Multi-focus image fusion using pulse coupled neural

network, Pattern Recognition Letters 28 (2007) 1123–1132.

[56] M. Li, W. Cai, Z. Tan, A region-based multi-sensor image fusion scheme

using pulse-coupled neural network, Pattern Recognition Letters 27

(2006) 1948–1956.

[57] J. Lian, J. Liu, Z. Yang, Y. Qi, H. Zhang, M. Zhang, Y. Ma, A

Pulse-Number-Adjustable MSPCNN and Its Image Enhancement Ap-

plication, IEEE Access 9 (2021) 161069–161086.

[58] K. Zhan, J. Shi, J. Teng, Q. Li, M. Wang, F. Lu, Linking synaptic

computation for image enhancement, Neurocomputing 238 (2017) 1–

12.

[59] K. Zhan, J. Teng, J. Shi, Q. Li, M. Wang, Feature-Linking Model for

Image Enhancement, Neural Computation 28 (2016) 1072–1100.

[60] K. Zhan, J. Teng, Y. Ma, Spiking cortical model for rotation and scale

invariant texture retrieval., J. Inf. Hiding Multim. Signal Process. 4

(2013) 155–165.

[61] H. Li, X. Jin, N. Yang, Z. Yang, The recognition of landed aircrafts

based on PCNN model and aﬃne moment invariants, Pattern Recog-

nition Letters 51 (2015) 23–29.

[62] K. Waldemark, T. Lindblad, V. Beˇcanovi´c, J. L. Guillen, P. L.

Klingner, Patterns from the sky, Pattern Recognition Letters 21 (2000)

227–237.

[63] X. Deng, Y. Ma, M. Dong, A new adaptive ﬁltering method for re-

moving salt and pepper noise based on multilayered PCNN, Pattern

Recognition Letters 79 (2016) 8–17.

[64] M. Riesenhuber, T. Poggio, Hierarchical models of object recognition

in cortex, Nature Neuroscience 2 (1999) 1019–1025.

[65] T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, T. Poggio, Robust

Object Recognition with cortex-like Mechanisms, IEEE Transactions

on Pattern Analysis and Machine Intelligence 29 (2007) 411–426.

[66] J. Schmidhuber, Deep learning in neural networks: An overview, Neu-

ral Networks 61 (2015) 85–117.

[67] W. Gerstner, W. M. Kistler, R. Naud, L. Paninski, Neuronal Dynam-

ics: From Single Neurons to Networks and Models of Cognition, 1 ed.,

Cambridge University Press, 2014.

[68] D. Hebb, The Organization of Behavior, 0 ed., Psychology Press, 2005.

[69] C. J. Shatz, The Developing Brain, Scientiﬁc American 267 (1992)

60–67. Publisher: Scientiﬁc American, a division of Nature America,

Inc.

[70] T. V. P. Bliss, T. Lømo, Long-lasting potentiation of synaptic trans-

mission in the dentate area of the anaesthetized rabbit following stim-

ulation of the perforant path, The Journal of Physiology 232 (1973)

331–356.

[71] G.-q. Bi, M.-m. Poo, Synaptic Modiﬁcations in Cultured Hippocampal

Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsy-

naptic Cell Type, The Journal of Neuroscience 18 (1998) 10464–10472.

[72] W. Levy, O. Steward, Temporal contiguity requirements for long-term

associative potentiation/depression in the hippocampus, Neuroscience

8 (1983) 791–797.

[73] S. Song, K. D. Miller, L. F. Abbott, Competitive Hebbian learning

through spike-timing-dependent synaptic plasticity, Nature Neuro-

science 3 (2000) 919–926.

[74] R. Guyonneau, R. VanRullen, S. J. Thorpe, Neurons Tune to the

Earliest Spikes Through STDP, Neural Computation 17 (2005) 859–

879.

[75] P. Falez, P. Tirilly, I. M. Bilasco, P. Devienne, P. Boulet, Unsupervised

visual feature learning with spike-timing-dependent plasticity: How far

are we from traditional feature learning approaches?, Pattern Recog-

nition 93 (2019) 418–429.

[76] G. H. Seol, J. Ziburkus, S. Huang, L. Song, I. T. Kim, K. Takamiya,

R. L. Huganir, H.-K. Lee, A. Kirkwood, Neuromodulators control

the polarity of spike-timing-dependent synaptic plasticity, Neuron 55

(2007) 919–929.

[77] F. Nadim, D. Bucher, Neuromodulation of neurons and synapses, Cur-

rent Opinion in Neurobiology 29 (2014) 48–56.

[78] N. Fr´emaux, W. Gerstner, Neuromodulated Spike-Timing-Dependent

Plasticity, and Theory of Three-Factor Learning Rules, Frontiers in

Neural Circuits 9 (2016).

[79] Y. Hao, X. Huang, M. Dong, B. Xu, A biologically plausible supervised

learning method for spiking neural networks using the symmetric STDP

rule, Neural Networks 121 (2020) 387–395.

[80] M. Mozafari, S. R. Kheradpisheh, T. Masquelier, A. Nowzari-

Dalini, M. Ganjtabesh, First-Spike-Based Visual Categorization Using

Reward-Modulated STDP, IEEE Transactions on Neural Networks

and Learning Systems 29 (2018) 6178–6190.

[81] R. V. Rullen, S. J. Thorpe, Rate Coding Versus Temporal Order Cod-

ing: What the Retinal Ganglion Cells Tell the Visual Cortex, Neural

Computation 13 (2001) 1255–1283.

[82] S. R. Kheradpisheh, T. Masquelier, Temporal backpropagation for

spiking neural networks with one spike per neuron, International Jour-

nal of Neural Systems 30 (2020) 2050027.

[83] B. Rueckauer, I.-A. Lungu, Y. Hu, M. Pfeiﬀer, S.-C. Liu, Conversion of

Continuous-Valued Deep Networks to Eﬃcient Event-Driven Networks

for Image Classiﬁcation, Frontiers in Neuroscience 11 (2017) 682.

[84] W. Fang, Z. Yu, Y. Chen, T. Masquelier, T. Huang, Y. Tian, Incor-

porating Learnable Membrane Time Constant to Enhance Learning of

Spiking Neural Networks, in: 2021 IEEE/CVF International Confer-

ence on Computer Vision (ICCV), IEEE, Montreal, QC, Canada, 2021,

pp. 2641–2651.

[85] Y. Wu, L. Deng, G. Li, J. Zhu, Y. Xie, L. Shi, Direct Training for

Spiking Neural Networks: Faster, Larger, Better, Proceedings of the

AAAI Conference on Artiﬁcial Intelligence 33 (2019) 1311–1318.

[86] Y. Kim, H. Park, A. Moitra, A. Bhattacharjee, Y. Venkatesha,

P. Panda, Rate coding or direct coding: Which one is better for accu-

rate, robust, and energy-eﬃcient spiking neural networks?, in: ICASSP

2022 - 2022 IEEE International Conference on Acoustics, Speech and

Signal Processing (ICASSP), 2022, pp. 71–75.

[87] A. Krizhevsky, V. Nair, G. Hinton, The CIFAR-10 dataset, online:

http://www. cs. toronto. edu/kriz/cifar. html 55 (2014).

[88] J. Deng, W. Dong, R. Socher, L.-J. Li, Kai Li, Li Fei-Fei, ImageNet:

A large-scale hierarchical image database, in: 2009 IEEE Conference

on Computer Vision and Pattern Recognition, IEEE, Miami, FL, 2009,

pp. 248–255.

[89] L. Deng, Y. Wu, X. Hu, L. Liang, Y. Ding, G. Li, G. Zhao, P. Li,

Y. Xie, Rethinking the performance comparison between SNNS and

ANNS, Neural Networks 121 (2020) 294–307.

[90] W. He, Y. Wu, L. Deng, G. Li, H. Wang, Y. Tian, W. Ding, W. Wang,

Y. Xie, Comparing SNNs and RNNs on neuromorphic vision datasets:

Similarities and diﬀerences, Neural Networks 132 (2020) 108–120.

[91] G. Orchard, A. Jayawant, G. K. Cohen, N. Thakor, Converting Static

Image Datasets to Spiking Neuromorphic Datasets Using Saccades,

Frontiers in Neuroscience 9 (2015).

[92] H. Li, H. Liu, X. Ji, G. Li, L. Shi, CIFAR10-DVS: An Event-Stream

Dataset for Object Classiﬁcation, Frontiers in Neuroscience 11 (2017)

309.

[93] A. Amir, B. Taba, D. Berg, T. Melano, J. McKinstry, C. Di Nolfo,

T. Nayak, A. Andreopoulos, G. Garreau, M. Mendoza, J. Kusnitz,

M. Debole, S. Esser, T. Delbruck, M. Flickner, D. Modha, A Low

Power, Fully Event-Based Gesture Recognition System, in: 2017 IEEE

Conference on Computer Vision and Pattern Recognition (CVPR),

IEEE, Honolulu, HI, 2017, pp. 7388–7397.

[94] Y. Bi, A. Chadha, A. Abbas, E. Bourtsoulatze, Y. Andreopoulos,

Graph-Based Object Classiﬁcation for Neuromorphic Vision Sensing,

in: 2019 IEEE/CVF International Conference on Computer Vision

(ICCV), IEEE, Seoul, Korea (South), 2019, pp. 491–501.

[95] M. Zhang, H. Qu, A. Belatreche, X. Xie, EMPD: An Eﬃcient Mem-

brane Potential Driven Supervised Learning Algorithm for Spiking

Neurons, IEEE Transactions on Cognitive and Developmental Systems

10 (2018) 151–162.

[96] J.-P. Pﬁster, T. Toyoizumi, D. Barber, W. Gerstner, Optimal spike-

timing-dependent Plasticity for Precise Action Potential Firing in Su-

pervised Learning, Neural Computation 18 (2006) 1318–1348.

[97] S. Ghosh-Dastidar, H. Adeli, A new supervised learning algorithm

for multiple spiking neural networks with application in epilepsy and

seizure detection, Neural Networks 22 (2009) 1419–1431.

[98] R. V. Florian, The Chronotron: A Neuron That Learns to Fire Tem-

porally Precise Spike Patterns, PLoS ONE 7 (2012) e40233.

[99] Q. Yu, H. Tang, K. C. Tan, H. Li, Precise-Spike-Driven Synaptic Plas-

ticity: Learning Hetero-Association of Spatiotemporal Spike Patterns,

PLoS ONE 8 (2013) e78318.

[100] J. Wang, A. Belatreche, L. Maguire, T. M. McGinnity, An online

supervised learning method for spiking neural networks with adaptive

structure, Neurocomputing 144 (2014) 526–536.

[101] B. Gardner, I. Sporea, A. Gr¨uning, Learning Spatiotemporally En-

coded Pattern Transformations in Structured Spiking Neural Networks,

Neural Computation 27 (2015) 2548–2586.

[102] P. U. Diehl, D. Neil, J. Binas, M. Cook, S.-C. Liu, M. Pfeiﬀer, Fast-

classifying, high-accuracy spiking deep networks through weight and

threshold balancing, in: 2015 International Joint Conference on Neural

Networks (IJCNN), IEEE, Killarney, Ireland, 2015, pp. 1–8.

[103] N. Anwani, B. Rajendran, NormAD - Normalized Approximate De-

scent based supervised learning rule for spiking neurons, in: 2015

International Joint Conference on Neural Networks (IJCNN), IEEE,

Killarney, Ireland, 2015, pp. 1–8.

[104] R. G¨utig, Spiking neurons can discover predictive features by

aggregate-label learning, Science 351 (2016) aab4113.

[105] B. Rueckauer, I.-A. Lungu, Y. Hu, M. Pfeiﬀer, Theory and Tools for

the Conversion of Analog to Spiking Convolutional Neural Networks,

2016. ArXiv:1612.04052 [cs, stat].

[106] S. R. Kheradpisheh, M. Ganjtabesh, T. Masquelier, Bio-inspired un-

supervised learning of visual features leads to robust invariant object

recognition, Neurocomputing 205 (2016) 382–392.

[107] J. H. Lee, T. Delbruck, M. Pfeiﬀer, Training Deep Spiking Neural

Networks Using Backpropagation, Frontiers in Neuroscience 10 (2016).

[108] H. Mostafa, Supervised Learning Based on Temporal Coding in Spiking

Neural Networks, IEEE Transactions on Neural Networks and Learning

Systems (2017) 1–9.

[109] D. Huh, T. J. Sejnowski, Gradient Descent for Spiking Neural Net-

works, in: Advances in Neural Information Processing Systems

31 (NeurIPS 2018), volume 31, Curran Associates, Inc., Montr´eal,

Canada, 2018.

[110] S. R. Kulkarni, B. Rajendran, Spiking neural networks for handwrit-

ten digit recognition—Supervised learning and network optimization,

Neural Networks 103 (2018) 118–127.

[111] Q. Yu, H. Li, K. C. Tan, Spike Timing or Rate? Neurons Learn to

Make Decisions for Both Through Threshold-Driven Plasticity, IEEE

Transactions on Cybernetics 49 (2019) 2178–2189.

[112] C. Lee, G. Srinivasan, P. Panda, K. Roy, Deep spiking convolutional

neural network trained with unsupervised spike-timing-dependent plas-

ticity, IEEE Transactions on Cognitive and Developmental Systems 11

(2018) 384–394.

[113] A. Sengupta, Y. Ye, R. Wang, C. Liu, K. Roy, Going Deeper in Spik-

ing Neural Networks: VGG and Residual Architectures, Frontiers in

Neuroscience 13 (2019) 95.

[114] I. M. Comsa, K. Potempa, L. Versari, T. Fischbacher, A. Gesmundo,

J. Alakuijala, Temporal coding in spiking neural networks with alpha

synaptic function, in: ICASSP 2020-2020 IEEE International Con-

ference on Acoustics, Speech and Signal Processing (ICASSP), IEEE,

2020, pp. 8529–8533.

[115] B. Han, G. Srinivasan, K. Roy, RMP-SNN: Residual Membrane Po-

tential Neuron for Enabling Deeper High-Accuracy and Low-Latency

Spiking Neural Network, in: 2020 IEEE/CVF Conference on Computer

Vision and Pattern Recognition (CVPR), 2020, pp. 13555–13564.

[116] N. Rathi, G. Srinivasan, P. Panda, K. Roy, Enabling deep spiking

neural networks with hybrid conversion and spike timing dependent

backpropagation, in: International Conference on Learning Represen-

tations, 2020.

[117] J. Wu, Y. Chua, M. Zhang, G. Li, H. Li, K. C. Tan, A tandem learning

rule for eﬀective training and rapid inference of deep spiking neural net-

works, IEEE Transactions on Neural Networks and Learning Systems

(2021) 1–15.

[118] S. Zhou, X. Li, Y. Chen, S. T. Chandrasekaran, A. Sanyal, Temporal-

Coded Deep Spiking Neural Network with Easy Training and Robust

Performance, Proceedings of the AAAI Conference on Artiﬁcial In-

telligence 35 (2021) 11143–11151. Section: AAAI Technical Track on

Machine Learning V.

[119] S. Deng, S. Gu, Optimal Conversion of Conventional Artiﬁcial Neural

Networks to Spiking Neural Networks, in: International Conference on

Learning Representations, 2021.

[120] Y. Li, S. Deng, X. Dong, R. Gong, S. Gu, A Free Lunch From ANN:

Towards Eﬃcient, Accurate Spiking Neural Networks //////editorra-

tion, in: Proceedings of the 38th International Conference on Machine

Learning, volume 139 of Proceedings of Machine Learning Research,

PMLR, 2021, pp. 6316–6325.

[121] Y. Hu, H. Tang, G. Pan, Spiking Deep Residual Networks, IEEE

Transactions on Neural Networks and Learning Systems (2021) 1–6.

[122] J. Ding, Z. Yu, Y. Tian, T. Huang, Optimal ANN-SNN Conversion for

Fast and Accurate Inference in Deep Spiking Neural Networks, in: Pro-

ceedings of the Thirtieth International Joint Conference on Artiﬁcial

Intelligence, International Joint Conferences on Artiﬁcial Intelligence

Organization, Montreal, Canada, 2021, pp. 2328–2336.

[123] T. Zhang, X. Cheng, S. Jia, M.-m. Poo, Y. Zeng, B. Xu, Self-

backpropagation of synaptic modiﬁcations elevates the eﬃciency of

spiking and artiﬁcial neural networks, Science Advances 7 (2021)

eabh0146.

[124] T. Bu, J. Ding, Z. Yu, T. Huang, Optimized Potential Initialization

for Low-Latency Spiking Neural Networks, Proceedings of the AAAI

Conference on Artiﬁcial Intelligence 36 (2022) 11–20.

[125] S. Deng, Y. Li, S. Zhang, S. Gu, Temporal Eﬃcient Training of Spiking

Neural Network via Gradient Re-weighting, in: International Confer-

ence on Learning Representations, 2022.

[126] R. Eckhorn, R. Bauer, W. Jordan, M. Brosch, W. Kruse, M. Munk,

H. J. Reitboeck, Coherent oscillations: A mechanism of feature linking

in the visual cortex?: Multiple electrode and correlation analyses in

the cat, Biological Cybernetics 60 (1988) 121–130.

[127] R. M. Fitzsimonds, H.-j. Song, M.-m. Poo, Propagation of activity-

dependent synaptic depression in simple neural networks, Nature 388

(1997) 439–448.

[128] B. Widrow, M. E. Hoﬀ, Adaptive switching circuits, Technical Report,

Stanford Univ Ca Stanford Electronics Labs, 1960.

[129] A. Tavanaei, A. Maida, BP-STDP: Approximating backpropagation

using spike timing dependent plasticity, Neurocomputing 330 (2019)

39–47.

[130] M. Beyeler, N. D. Dutt, J. L. Krichmar, Categorization and decision-

making in a neurobiologically plausible spiking network using a STDP-

like learning rule, Neural Networks 48 (2013) 109–124.

[131] B. Illing, W. Gerstner, J. Brea, Biologically plausible deep learning —

But how far can we go with shallow networks?, Neural Networks 118

(2019) 90–101.

[132] D. Querlioz, O. Bichler, P. Dollfus, C. Gamrat, Immunity to Device

Variations in a Spiking Neural Network With Memristive Nanodevices,

IEEE Transactions on Nanotechnology 12 (2013) 288–295.

[133] D. Querlioz, O. Bichler, C. Gamrat, Simulation of a memristor-based

spiking neural network immune to device variations, in: The 2011

International Joint Conference on Neural Networks, IEEE, San Jose,

CA, USA, 2011, pp. 1775–1781.

[134] X. Fang, D. Liu, S. Duan, L. Wang, Memristive LIF Spiking Neuron

Model and Its Application in Morse Code, Frontiers in Neuroscience

16 (2022) 853010.

[135] X. Fang, S. Duan, L. Wang, Memristive fhn spiking neuron model and

brain-inspired threshold logic computing, Neurocomputing 517 (2023)

93–105.

[136] D. J. Saunders, H. T. Siegelmann, R. Kozma, M. Ruszinkao, STDP

Learning of Image Patches with Convolutional Spiking Neural Net-

works, in: 2018 International Joint Conference on Neural Networks

(IJCNN), IEEE, Rio de Janeiro, 2018, pp. 1–7.

[137] Q. Xu, J. Peng, J. Shen, H. Tang, G. Pan, Deep CovDenseSNN: A

hierarchical event-driven dynamic framework with spiking neurons in

noisy environment, Neural Networks 121 (2020) 512–519.

[138] M. Mozafari, M. Ganjtabesh, A. Nowzari-Dalini, S. J. Thorpe,

T. Masquelier, Bio-inspired digit recognition using reward-modulated

spike-timing-dependent plasticity in deep convolutional networks, Pat-

tern Recognition 94 (2019) 87–95.

[139] Y. Xu, X. Zeng, S. Zhong, A New Supervised Learning Algorithm for

Spiking Neurons, Neural Computation 25 (2013) 1472–1511.

[140] A. Taherkhani, A. Belatreche, Y. Li, L. P. Maguire, DL-ReSuMe: A

Delay Learning-Based Remote Supervised Method for Spiking Neu-

rons, IEEE Transactions on Neural Networks and Learning Systems

26 (2015) 3137–3149.

[141] M. Zhang, J. Wu, A. Belatreche, Z. Pan, X. Xie, Y. Chua, G. Li,

H. Qu, H. Li, Supervised learning in spiking neural networks with

synaptic delay-weight plasticity, Neurocomputing 409 (2020) 103–118.

[142] A. Payeur, J. Guerguiev, F. Zenke, B. A. Richards, R. Naud, Burst-

dependent synaptic plasticity can coordinate learning in hierarchical

circuits, Nature Neuroscience 24 (2021) 1010–1019.

[143] X. Xie, S. Wen, Z. Yan, T. Huang, Y. Chen, Designing pulse-coupled

neural networks with spike-synchronization-dependent plasticity rule:

image segmentation and memristor circuit application, Neural Com-

puting and Applications 32 (2020) 13441–13452.

[144] H. Mostafa, G. Cauwenberghs, A Learning Framework for Winner-

Take-All Networks with Stochastic Synapses, Neural Computation 30

(2018) 1542–1572.

[145] J. Brea, W. Senn, J.-P. Pﬁster, Matching recall and storage in se-

quence learning with spiking neural networks, Journal of Neuroscience

33 (2013) 9565–9575.

[146] D. Jimenez Rezende, W. Gerstner, Stochastic variational learning in

recurrent spiking networks, Frontiers in Computational Neuroscience

8 (2014).

[147] Q. Yu, S. Li, H. Tang, L. Wang, J. Dang, K. C. Tan, Toward Eﬃcient

Processing and Learning With Spikes: New Approaches for Multispike

Learning, IEEE Transactions on Cybernetics 52 (2022) 1364–1376.

[148] Q. Yu, J. Gao, J. Wei, J. Li, K. C. Tan, T. Huang, Improving Multispike

Learning With Plastic Synaptic Delays, IEEE Transactions on Neural

Networks and Learning Systems (2022) 1–12.

[149] M. Zhang, J. Wu, Y. Chua, X. Luo, Z. Pan, D. Liu, H. Li, Mpd-al:

an eﬃcient membrane potential driven aggregate-label learning algo-

rithm for spiking neurons, in: Proceedings of the AAAI conference on

artiﬁcial intelligence, volume 33, 2019, pp. 1327–1334.

[150] Q. Yu, H. Tang, K. C. Tan, H. Li, Rapid feedforward computation by

temporal encoding and learning with spiking neurons, IEEE transac-

tions on neural networks and learning systems 24 (2013) 1539–1552.

[151] Q. Xu, Y. Qi, H. Yu, J. Shen, H. Tang, G. Pan, CSNN: An Augmented

Spiking based Framework with Perceptron-Inception, in: Proceedings

of the Twenty-Seventh International Joint Conference on Artiﬁcial In-

telligence, International Joint Conferences on Artiﬁcial Intelligence Or-

ganization, Stockholm, Sweden, 2018, pp. 1646–1652.

[152] Q. Yu, S. Song, C. Ma, J. Wei, S. Chen, K. C. Tan, Temporal Encod-

ing and Multispike Learning Framework for Eﬃcient Recognition of

Visual Patterns, IEEE Transactions on Neural Networks and Learning

Systems (2021) 1–13.

[153] P. Panda, K. Roy, Unsupervised regenerative learning of hierarchical

features in spiking deep networks for object recognition, in: 2016 In-

ternational Joint Conference on Neural Networks (IJCNN), 2016, pp.

299–306.

[154] M. Zhang, H. Qu, A. Belatreche, Y. Chen, Z. Yi, A Highly Eﬀective and

Robust Membrane Potential-Driven Supervised Learning Method for

Spiking Neurons, IEEE Transactions on Neural Networks and Learning

Systems 30 (2019) 123–137.

[155] R.-M. Memmesheimer, R. Rubin, B. ¨

Olveczky, H. Sompolinsky, Learn-

ing Precisely Timed Spikes, Neuron 82 (2014) 925–938.

[156] I.-M. Comsa, K. Potempa, L. Versari, T. Fischbacher, A. Gesmundo,

J. Alakuijala, Temporal Coding in Spiking Neural Networks With Al-

pha Synaptic Function: Learning With Backpropagation, IEEE Trans-

actions on Neural Networks and Learning Systems (2021) 1–14.

[157] M. Zhang, J. Wang, J. Wu, A. Belatreche, B. Amornpaisannon,

Z. Zhang, V. P. K. Miriyala, H. Qu, Y. Chua, T. E. Carlson, H. Li,

Rectiﬁed Linear Postsynaptic Potential Function for Backpropagation

in Deep Spiking Neural Networks, IEEE Transactions on Neural Net-

works and Learning Systems 33 (2022) 1947–1958.

[158] J. Liu, J. Lian, J. C. Sprott, Q. Liu, Y. Ma, The Butterﬂy Eﬀect in

Primary Visual Cortex, IEEE Transactions on Computers (2022) 1–1.

[159] S. B. Shrestha, G. Orchard, Slayer: Spike layer error reassignment in

time, in: Advances in Neural Information Processing Systems, vol-

ume 31, Curran Associates, Inc., 2018.

[160] H. Zheng, Y. Wu, L. Deng, Y. Hu, G. Li, Going Deeper With Directly-

Trained Larger Spiking Neural Networks, Proceedings of the AAAI

Conference on Artiﬁcial Intelligence 35 (2021) 11062–11070.

[161] C. Duan, J. Ding, S. Chen, Z. Yu, T. Huang, Temporal eﬀective batch

normalization in spiking neural networks, in: Advances in Neural In-

formation Processing Systems, 2022.

[162] L. Feng, Q. Liu, H. Tang, D. Ma, G. Pan, Multi-Level Firing with Spik-

ing DS-ResNet: Enabling Better and Deeper Directly-Trained Spiking

Neural Networks, in: Proceedings of the Thirty-First International

Joint Conference on Artiﬁcial Intelligence, International Joint Confer-

ences on Artiﬁcial Intelligence Organization, Vienna, Austria, 2022,

pp. 2471–2477.

[163] Y. Li, Y. Guo, S. Zhang, S. Deng, Y. Hai, S. Gu, Diﬀerentiable Spike:

Rethinking Gradient-Descent for Training Spiking Neural Networks,

in: Advances in Neural Information Processing Systems, volume 34,

Curran Associates, Inc., 2021, pp. 23426–23439.

[164] S. Wang, T. H. Cheng, M.-H. Lim, LTMD: Learning improvement

of spiking neural networks with learnable thresholding neurons and

moderate dropout, in: Advances in Neural Information Processing

Systems, 2022.

[165] X. Yao, F. Li, Z. Mo, J. Cheng, GLIF: A uniﬁed gated leaky integrate-

and-ﬁre neuron for spiking neural networks, in: Advances in Neural

Information Processing Systems, 2022.

[166] J. Kaiser, H. Mostafa, E. Neftci, Synaptic Plasticity Dynamics for Deep

Continuous Local Learning (DECOLLE), Frontiers in Neuroscience 14

(2020) 424.

[167] C. Ma, R. Yan, Z. Yu, Q. Yu, Deep Spike Learning With Local Clas-

siﬁers, IEEE Transactions on Cybernetics (2022) 1–13.

[168] Q. Yang, J. Wu, M. Zhang, Y. Chua, X. Wang, H. Li, Training spiking

neural networks with local tandem learning, in: Advances in Neural

Information Processing Systems, 2022.

[169] N. Rathi, K. Roy, Diet-snn: A low-latency spiking neural network with

direct input encoding and leakage and threshold optimization, IEEE

Transactions on Neural Networks and Learning Systems (2021) 1–9.

[170] W. Maass, Networks of spiking neurons: The third generation of neural

network models, Neural Networks 10 (1997) 1659–1671.

[171] S. R. Kheradpisheh, M. Mirsadeghi, T. Masquelier, Spiking Neural

Networks Trained via Proxy, IEEE Access 10 (2022) 70769–70778.

[172] Y. Li, Y. Kim, H. Park, T. Geller, P. Panda, Neuromorphic data

augmentation for training spiking neural networks, arXiv preprint

arXiv:2203.06145 (2022).

[173] G. Orchard, C. Meyer, R. Etienne-Cummings, C. Posch, N. Thakor,

R. Benosman, HFirst: A Temporal Approach to Object Recognition,

IEEE Transactions on Pattern Analysis and Machine Intelligence 37

(2015) 2028–2040.

[174] G. Haessig, A. Cassidy, R. Alvarez, R. Benosman, G. Orchard, Spiking

Optical Flow for Event-Based Sensors Using IBM’s TrueNorth Neu-

rosynaptic System, IEEE Transactions on Biomedical Circuits and

Systems 12 (2018) 860–870.

[175] R. Xiao, H. Tang, Y. Ma, R. Yan, G. Orchard, An Event-Driven Cat-

egorization Model for AER Image Sensors Using Multispike Encoding

and Learning, IEEE Transactions on Neural Networks and Learning

Systems 31 (2020) 3649–3657.

[176] J. Wu, Y. Chua, M. Zhang, H. Li, K. C. Tan, A Spiking Neural Network

Framework for Robust Sound Classiﬁcation, Frontiers in Neuroscience

12 (2018) 836.

[177] J. Wu, Y. Chua, H. Li, A Biologically Plausible Speech Recognition

Framework Based on Spiking Neural Networks, in: 2018 International

Joint Conference on Neural Networks (IJCNN), IEEE, Rio de Janeiro,

2018, pp. 1–8.

[178] R. Xiao, R. Yan, H. Tang, K. C. Tan, A Spiking Neural Network Model

for Sound Recognition, in: Cognitive Systems and Signal Processing,

volume 710, Springer Singapore, Singapore, 2017, pp. 584–594. Series

Title: Communications in Computer and Information Science.

[179] J. Wu, E. Yılmaz, M. Zhang, H. Li, K. C. Tan, Deep Spiking Neural

Networks for Large Vocabulary Automatic Speech Recognition, Fron-

tiers in Neuroscience 14 (2020) 199.

[180] Z. Pan, M. Zhang, J. Wu, J. Wang, H. Li, Multi-Tone Phase Coding of

Interaural Time Diﬀerence for Sound Source Localization With Spiking

Neural Networks, IEEE/ACM Transactions on Audio, Speech, and

Language Processing 29 (2021) 2656–2670.

[181] A. Tavanaei, A. S. Maida, A spiking network that learns to extract

spike signatures from speech signals, Neurocomputing 240 (2017) 191–

199.

[182] A. Tavanaei, A. Maida, Bio-inspired Multi-layer Spiking Neural Net-

work Extracts Discriminative Features from Speech Signals, in: Neural

Information Processing, volume 10639, Springer International Publish-

ing, Cham, 2017, pp. 899–908.

[183] J. Wu, M. Zhang, H. Li, Y. Chua, Competitive STDP-based Fea-

ture Representation Learning for Sound Event Classiﬁcation, in: 2019

International Joint Conference on Neural Networks (IJCNN), IEEE,

Budapest, Hungary, 2019, pp. 1–8.

[184] T. Elsken, J. H. Metzen, F. Hutter, Neural architecture search: A

survey, Journal of Machine Learning Research 20 (2019) 1–21.

[185] B. Na, J. Mok, S. Park, D. Lee, H. Choe, S. Yoon, AutoSNN: Towards

energy-eﬃcient spiking neural networks, in: Proceedings of the 39th

International Conference on Machine Learning, volume 162 of Proceed-

ings of Machine Learning Research, PMLR, 2022, pp. 16253–16269.

[186] Y. Kim, Y. Li, H. Park, Y. Venkatesha, P. Panda, Neural Architecture

Search for Spiking Neural Networks, 2022. ArXiv:2201.10355 [cs, eess].

[187] N. Perez-Nieves, D. F. M. Goodman, Sparse spiking gradient descent,

in: Advances in Neural Information Processing Systems, 2021.

[188] Y. Kim, P. Panda, Revisiting Batch Normalization for Training Low-

Latency Deep Spiking Neural Networks From Scratch, Frontiers in

Neuroscience 15 (2021) 773954.

[189] R. Yuste, Dendritic Spines and Distributed Circuits, Neuron 71 (2011)

772–781.

[190] Y. Chen, Z. Yu, W. Fang, Z. Ma, T. Huang, Y. Tian, State transition of

dendritic spines improves learning of sparse spiking neural networks, in:

Proceedings of the 39th International Conference on Machine Learning,

volume 162 of Proceedings of Machine Learning Research, PMLR, 2022,

pp. 3701–3715.

[191] R. Siegel, Non-linear dynamical system theory and primary visual

cortical processing, Physica D: Nonlinear Phenomena 42 (1990) 385–

395.

[192] P. A. Merolla, J. V. Arthur, R. Alvarez-Icaza, A. S. Cassidy, J. Sawada,

F. Akopyan, B. L. Jackson, N. Imam, C. Guo, Y. Nakamura, B. Brezzo,

I. Vo, S. K. Esser, R. Appuswamy, B. Taba, A. Amir, M. D. Flickner,

W. P. Risk, R. Manohar, D. S. Modha, A million spiking-neuron in-

tegrated circuit with a scalable communication network and interface,

Science 345 (2014) 668–673.

[193] M. Davies, N. Srinivasa, T.-H. Lin, G. Chinya, Y. Cao, S. H. Choday,

G. Dimou, P. Joshi, N. Imam, S. Jain, Y. Liao, C.-K. Lin, A. Lines,

R. Liu, D. Mathaikutty, S. McCoy, A. Paul, J. Tse, G. Venkatara-

manan, Y.-H. Weng, A. Wild, Y. Yang, H. Wang, Loihi: A neuromor-

phic manycore processor with on-chip learning, IEEE Micro 38 (2018)

82–99.

[194] S. Duan, X. Hu, Z. Dong, L. Wang, P. Mazumder, Memristor-Based

Cellular Nonlinear/Neural Network: Design, Analysis, and Applica-

tions, IEEE Transactions on Neural Networks and Learning Systems

26 (2015) 1202–1213.

[195] X. Fang, S. Duan, L. Wang, Memristive Izhikevich Spiking Neuron

Model and Its Application in Oscillatory Associative Memory, Fron-

tiers in Neuroscience 16 (2022) 885322.

[196] X. Fang, S. Duan, L. Wang, Memristive Hodgkin-Huxley Spiking Neu-

ron Model for Reproducing Neuron Behaviors, Frontiers in Neuro-

science 15 (2021) 730566.

[197] J. Li, Z. Dong, L. Luo, S. Duan, L. Wang, A novel versatile win-

dow function for memristor model with application in spiking neural

network, Neurocomputing 405 (2020) 239–246.

[198] N. Zheng, P. Mazumder, Learning in Memristor Crossbar-Based Spik-

ing Neural Networks Through Modulation of Weight-Dependent Spike-

Timing-Dependent Plasticity, IEEE Transactions on Nanotechnology

17 (2018) 520–532.

Supervised Radio Frequency Interference Detection with SNNs

Preprint

Full-text available

Jun 2024

Radio Frequency Interference (RFI) poses a significant challenge in radio astronomy, arising from terrestrial and celestial sources, disrupting observations conducted by radio telescopes. Addressing RFI involves intricate heuristic algorithms, manual examination, and, increasingly, machine learning methods. Given the dynamic and temporal nature of radio astronomy observations, Spiking Neural Networks (SNNs) emerge as a promising approach. In this study, we cast RFI detection as a supervised multi-variate time-series segmentation problem. Notably, our investigation explores the encoding of radio astronomy visibility data for SNN inference, considering six encoding schemes: rate, latency, delta-modulation, and three variations of the step-forward algorithm. We train a small two-layer fully connected SNN on simulated data derived from the Hydrogen Epoch of Reionization Array (HERA) telescope and perform extensive hyper-parameter optimization. Results reveal that latency encoding exhibits superior performance, achieving a per-pixel accuracy of 98.8% and an f1-score of 0.761. Remarkably, these metrics approach those of contemporary RFI detection algorithms, notwithstanding the simplicity and compactness of our proposed network architecture. This study underscores the potential of RFI detection as a benchmark problem for SNN researchers, emphasizing the efficacy of SNNs in addressing complex time-series segmentation tasks in radio astronomy.

Evolutionary Perspectives on Neural Network Generations: A Critical Examination of Models and Design Strategies

Article

Apr 2024

In the last few years, Neural Networks have become more common in different areas due to their ability to learn intricate patterns and provide precise predictions. Nonetheless, creating an efficient neural network model is a difficult task that demands careful thought of multiple factors, such as architecture, optimization method, and regularization technique. This paper aims to comprehensively overview the state-of-the-art artificial neural network (ANN) generation and highlight key challenges and opportunities in machine learning applications. It provides a critical analysis of current neural network model design methodologies, focusing on the strengths and weaknesses of different approaches. Also, it explores the use of different deep neural networks (DNN) in image recognition, natural language processing, and time series analysis. In addition, the text explores the advantages of selecting optimal values for various components of an Artificial Neural Network (ANN). These components include the number of input/output layers, the number of hidden layers, the type of activation function used, the number of epochs, and the model type selection. Setting these components to their ideal values can help enhance the model's overall performance and generalization. Furthermore, it identifies some common pitfalls and limitations of existing design methodologies, such as overfitting, lack of interpretability, and computational complexity. Finally, it proposes some directions for future research, such as developing more efficient and interpretable neural network architectures, improving the scalability of training algorithms, and exploring the potential of new paradigms, such as Spiking Neural Networks, quantum neural networks, and neuromorphic computing.

RFI Detection with Spiking Neural Networks

Article

Full-text available

Apr 2024

Detecting and mitigating Radio Frequency Interference (RFI) is critical for enabling and maximising the scientific output of radio telescopes. The emergence of machine learning methods capable of handling large datasets has led to their application in radio astronomy, particularly in RFI detection. Spiking Neural Networks (SNNs), inspired by biological systems, are well-suited for processing spatio-temporal data. This study introduces the first exploratory application of SNNs to an astronomical data-processing task, specifically RFI detection. We adapt the nearest-latent-neighbours (NLN) algorithm and auto-encoder architecture proposed by previous authors to SNN execution by direct ANN2SNN conversion, enabling simplified downstream RFI detection by sampling the naturally varying latent space from the internal spiking neurons. Our subsequent evaluation aims to determine whether SNNs are viable for future RFI detection schemes. We evaluate detection performance with the simulated HERA telescope and hand-labelled LOFAR observation dataset the original authors provided. We additionally evaluate detection performance with a new MeerKAT-inspired simulation dataset that provides a technical challenge for machine-learnt RFI detection methods. This dataset focuses on satellite-based RFI, an increasingly important class of RFI and is an additional contribution. Our SNN approach remains competitive with the original NLN algorithm and AOFlagger in AUROC, AUPRC and F1 scores for the HERA dataset but exhibits difficulty in the LOFAR and Tabascal datasets. However, our method maintains this accuracy while completely removing the compute and memory-intense latent sampling step found in NLN. This work demonstrates the viability of SNNs as a promising avenue for machine-learning-based RFI detection in radio telescopes by establishing a minimal performance baseline on traditional and nascent satellite-based RFI sources and is the first work to our knowledge to apply SNNs in astronomy.

How to Design Reinforcement Learning Methods for the Edge: An Integrated Approach toward Intelligent Decision Making

Article

Full-text available

Mar 2024

Extensive research has been carried out on reinforcement learning methods. The core idea of reinforcement learning is to learn methods by means of trial and error, and it has been successfully applied to robotics, autonomous driving, gaming, healthcare, resource management, and other fields. However, when building reinforcement learning solutions at the edge, not only are there the challenges of data-hungry and insufficient computational resources but also there is the difficulty of a single reinforcement learning method to meet the requirements of the model in terms of efficiency, generalization, robustness, and so on. These solutions rely on expert knowledge for the design of edge-side integrated reinforcement learning methods, and they lack high-level system architecture design to support their wider generalization and application. Therefore, in this paper, instead of surveying reinforcement learning systems, we survey the most commonly used options for each part of the architecture from the point of view of integrated application. We present the characteristics of traditional reinforcement learning in several aspects and design a corresponding integration framework based on them. In this process, we show a complete primer on the design of reinforcement learning architectures while also demonstrating the flexibility of the various parts of the architecture to be adapted to the characteristics of different edge tasks. Overall, reinforcement learning has become an important tool in intelligent decision making, but it still faces many challenges in the practical application in edge computing. The aim of this paper is to provide researchers and practitioners with a new, integrated perspective to better understand and apply reinforcement learning in edge decision-making tasks.

Artificial Intelligence-Based Algorithms in Medical Image Scan Segmentation and Intelligent Visual Content Generation—A Concise Overview

Article

Full-text available

Feb 2024

Recently, artificial intelligence (AI)-based algorithms have revolutionized the medical image segmentation processes. Thus, the precise segmentation of organs and their lesions may contribute to an efficient diagnostics process and a more effective selection of targeted therapies, as well as increasing the effectiveness of the training process. In this context, AI may contribute to the automatization of the image scan segmentation process and increase the quality of the resulting 3D objects, which may lead to the generation of more realistic virtual objects. In this paper, we focus on the AI-based solutions applied in medical image scan segmentation and intelligent visual content generation, i.e., computer-generated three-dimensional (3D) images in the context of extended reality (XR). We consider different types of neural networks used with a special emphasis on the learning rules applied, taking into account algorithm accuracy and performance, as well as open data availability. This paper attempts to summarize the current development of AI-based segmentation methods in medical imaging and intelligent visual content generation that are applied in XR. It concludes with possible developments and open challenges in AI applications in extended reality-based solutions. Finally, future lines of research and development directions of artificial intelligence applications, both in medical image segmentation and extended reality-based medical solutions, are discussed.

Exploring biological challenges in building a thinking machine

Article

Jun 2024
COGN SYST RES

Predicting the Remaining Useful Life of Rails Based on Improved Deep Spiking Residual Neural Network

Article

Jun 2024
PROCESS SAF ENVIRON

Low-Power Perovskite Neuromorphic Synapse with Enhanced Photon Efficiency for Directional Motion Perception

Article

Apr 2024
ACS APPL MATER INTER

Using machine learning algorithms to solve data classification problems using multi-attribute dataset

Article

Full-text available

Jan 2024

This paper discusses various machine learning techniques such as decision trees, Kohonen maps neural network method, and correlation analysis. The training of neural networks and further comparative analysis was carried out using a real estate price segment classification dataset. The overall quality of the data collected in the dataset was evaluated using the correlation analysis method, while the other methods were used to predict the target variable. The obtained data were summarized in a comparative table. As a result of the work done, a relatively high accuracy was obtained using a large number of parameters in the work of almost all methods, the only exception is the neural network method, which does not work very correctly in the selected software product

Synchronisation of the ensemble of nonidentical FitzHugh–Nagumo oscillators with memristive couplings

Article

Full-text available

Dec 2023

The aim of the study is to investigate the features of synchronization in ensembles of nonidentical neuron-like FitzHugh–Nagumo oscillators interacting via memristor-based coupling. Methods. The collective dynamics in a ring of FitzHugh–Nagumo oscillators connected via memristive coupling was studied numerically and experimentally. The nonidentity of oscillators was achieved by detuning them by the threshold parameter responsible for the excitation of oscillator, or by detuning them by the parameter characterizing the ratio of time scales, the value of which determines the natural frequency of oscillator. We investigated the synchronization of memristively coupled FitzHugh–Nagumo oscillators as a function of the magnitude of the coupling coefficient, the initial conditions of all variables, and the number of oscillators in the ensemble. As a measure of synchronization, we used a coefficient characterizing the closeness of oscillator trajectories. Results. It is shown that with memristive coupling of FitzHugh–Nagumo oscillators, their synchronization depends not only on the magnitude of the coupling coefficient, but also on the initial states of both the oscillators themselves and the variables responsible for the memristive coupling. We compared the synchronization features of nonidentical FitzHugh–Nagumo oscillators with memristive and diffusive couplings. It is shown that, in contrast to the case of diffusive coupling of oscillators, in the case of meristive coupling, with increasing coupling strength of the oscillators, the destruction of the regime of completely synchronous in-phase oscillations can be observed, instead of which a regime of out-of-phase oscillations appears. Conclusion. The obtained results can be used when solving the problems of synchronization control in ensembles of neuronlike oscillators, in particular, for achieving or destroying the regime of in-phase synchronization of oscillations in an ensemble of coupled oscillators.

Spiking Neural Networks Trained via Proxy

Article

Full-text available

Jun 2022

We propose a new learning algorithm to train spiking neural networks (SNN) using conventional artificial neural networks (ANN) as proxy. We couple two SNN and ANN networks, respectively, made of integrate-and-fire (IF) and ReLU neurons with the same network architectures and shared synaptic weights. The forward passes of the two networks are totally independent. By assuming IF neuron with rate-coding as an approximation of ReLU, we backpropagate the error of the SNN in the proxy ANN to update the shared weights, simply by replacing the ANN final output with that of the SNN. We applied the proposed proxy learning to deep convolutional SNNs and evaluated it on two benchmarked datasets of Fashion-MNIST and Cifar10 with 94.56% and 93.11% classification accuracy, respectively. The proposed networks could outperform other deep SNNs trained with tandem learning, surrogate gradient learning, or converted from deep ANNs. Converted SNNs require long simulation times to reach reasonable accuracies while our proxy learning leads to efficient SNNs with much smaller simulation times. The source codes of the proposed method are publicly available at https://github.com/SRKH/ProxyLearning .

A hierarchical taxonomic survey of spiking neural networks

Article

Full-text available

Sep 2022

Artificial Neural Network (ANN) has served as an important pillar of machine learning which played a crucial role in fueling the robust artificial intelligence (AI) revival experienced in the last few years. Inspired by the biological brain architecture of living things, ANN has shown widespread success in pattern recognition, data analysis and classification tasks. Among the many models of neural networks conceptualized and developed over the years, the Spiking Neural Network (SNN) which was initiated in 1996 has shown great promise in the current push towards compact embedded AI. By combining both spatial and temporal information as features in the training and testing process, many inherent shortcomings of traditional ANNs can be overcome. With temporal features and event-driven updating of the network, SNNs hold the potential of improving computational and energy efficiency. In SNN, the most basic signal carrier element is the spike, bringing about a revolution in neural network weights updating compared to traditional methods that are widely applied in ANNs. In literature, there have been numerous SNN weights updating algorithms developed in recent years. With the active and dynamic research work on SNN, a consolidation of the state-of-the-art SNN research is beneficial. This paper is aimed at reviewing and surveying the current status of research pertaining to SNN, in particular highlighting the various novel SNN training techniques coupled with an objective comparison of the techniques. SNN applications and associated neuromorphic hardware systems are also covered in this survey, with some thoughts on the challenges in developing new SNN training algorithms and discussion on potential future research trends are presented in this survey.

Neural Architecture Search for Spiking Neural Networks

Chapter

Nov 2022

Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent high-sparsity activation. However, most prior SNN methods use ANN-like architectures (e.g., VGG-Net or ResNet), which could provide sub-optimal performance for temporal sequence processing of binary information in SNNs. To address this, in this paper, we introduce a novel Neural Architecture Search (NAS) approach for finding better SNN architectures. Inspired by recent NAS approaches that find the optimal architecture from activation patterns at initialization, we select the architecture that can represent diverse spike activation patterns across different data samples without training. Moreover, to further leverage the temporal information among the spikes, we search for feed-forward connections as well as backward connections (i.e., temporal feedback connections) between layers. Interestingly, SNASNet found by our search algorithm achieves higher performance with backward connections, demonstrating the importance of designing SNN architecture for suitably using temporal information. We conduct extensive experiments on three image recognition benchmarks where we show that SNASNet achieves state-of-the-art performance with significantly lower timesteps (5 timesteps). Code is available on Github.KeywordsSpiking Neural NetworksNeural architecture searchNeuromorphic computing

Temporal-Coded Deep Spiking Neural Network with Easy Training and Robust Performance

Article

May 2021

Spiking neural network (SNN) is promising but the development has fallen far behind conventional deep neural networks (DNNs) because of difficult training. To resolve the training problem, we analyze the closed-form input-output response of spiking neurons and use the response expression to build abstract SNN models for training. This avoids calculating membrane potential during training and makes the direct training of SNN as efficient as DNN. We show that the nonleaky integrate-and-fire neuron with single-spike temporal-coding is the best choice for direct-train deep SNNs. We develop an energy-efficient phase-domain signal processing circuit for the neuron and propose a direct-train deep SNN framework. Thanks to easy training, we train deep SNNs under weight quantizations to study their robustness over low-cost neuromorphic hardware. Experiments show that our direct-train deep SNNs have the highest CIFAR-10 classification accuracy among SNNs, achieve ImageNet classification accuracy within 1% of the DNN of equivalent architecture, and are robust to weight quantization and noise perturbation.

Going Deeper With Directly-Trained Larger Spiking Neural Networks

Article

May 2021

Spiking neural networks (SNNs) are promising in a bio-plausible coding for spatio-temporal information and event-driven signal processing, which is very suited for energy-efficient implementation in neuromorphic hardware. However, the unique working mode of SNNs makes them more difficult to train than traditional networks. Currently, there are two main routes to explore the training of deep SNNs with high performance. The first is to convert a pre-trained ANN model to its SNN version, which usually requires a long coding window for convergence and cannot exploit the spatio-temporal features during training for solving temporal tasks. The other is to directly train SNNs in the spatio-temporal domain. But due to the binary spike activity of the firing function and the problem of gradient vanishing or explosion, current methods are restricted to shallow architectures and thereby difficult in harnessing large-scale datasets (e.g. ImageNet). To this end, we propose a threshold-dependent batch normalization (tdBN) method based on the emerging spatio-temporal backpropagation, termed “STBP-tdBN”, enabling direct training of a very deep SNN and the efficient implementation of its inference on neuromorphic hardware. With the proposed method and elaborated shortcut connection, we significantly extend directly-trained SNNs from a shallow structure (

Memristive FHN Spiking Neuron Model and Brain-Inspired Threshold Logic Computing

Article

Aug 2022
NEUROCOMPUTING

The FitzHugh-Nagumo (FHN) spiking model has rich dynamics behaviors and can imitate the firing process of a neuron. The memristor is a nonvolatile and resistance tunable device, which gradually becomes a potential candidate for performing dynamic behaviors and implementing neuromorphic computation in the nervous system. In this paper, the memristive FitzHugh-Nagumo (MFHN) spiking model is presented. Firstly, we introduce a memristor to the FHN spiking model to build the MFHN spiking model and analyze the phase plane trajectory of the MFHN model. Secondly, we couple two MFHN models with a memristor and experimentally simulate the dynamic behaviors of two coupled neurons. The synchronous and asynchronous phenomena of two coupled neurons are discussed. Thirdly, the feasibility of the MFHN spiking model is demonstrated by the realization of binary logical operations and the implementation of binary adders. The comparison between the MFHN binary adder and the FHN binary adder is conducted. The simulation results illustrate that the proposed model efficiently performs rich, dynamic behaviors and higher firing frequency. The coupled MFHN models show the effectiveness of a memristor acting as an electric coupling synapse. The threshold logic computation can be completed efficiently by the MFHN spiking model. The MFHN binary adders reproduce the computing functions and behaviors of the biological neuron.

Deep Spike Learning With Local Classifiers

Article

Jul 2022

Backpropagation has been successfully generalized to optimize deep spiking neural networks (SNNs), where, nevertheless, gradients need to be propagated back through all layers, resulting in a massive consumption of computing resources and an obstacle to the parallelization of training. A biologically motivated scheme of local learning provides an alternative to efficiently train deep networks but often suffers a low performance of accuracy on practical tasks. Thus, how to train deep SNNs with the local learning scheme to achieve both efficient and accurate performance still remains an important challenge. In this study, we focus on a supervised local learning scheme where each layer is independently optimized with an auxiliary classifier. Accordingly, we first propose a spike-based efficient local learning rule by only considering the direct dependencies in the current time. We then propose two variants that additionally incorporate temporal dependencies through a backward and forward process, respectively. The effectiveness and performance of our proposed methods are extensively evaluated with six mainstream datasets. Experimental results show that our methods can successfully scale up to large networks and substantially outperform the spike-based local learning baselines on all studied benchmarks. Our results also reveal that gradients with temporal dependencies are essential for high performance on temporal tasks, while they have negligible effects on rate-based tasks. Our work is significant as it brings the performance of spike-based local learning to a new level with the computational benefits being retained.

Semi-Supervised Imitation Learning of Team Policies from Suboptimal Demonstrations

Conference Paper

Jul 2022

We present Bayesian Team Imitation Learner (BTIL), an imitation learning algorithm to model the behavior of teams performing sequential tasks in Markovian domains. In contrast to existing multi-agent imitation learning techniques, BTIL explicitly models and infers the time-varying mental states of team members, thereby enabling learning of decentralized team policies from demonstrations of suboptimal teamwork. Further, to allow for sample- and label-efficient policy learning from small datasets, BTIL employs a Bayesian perspective and is capable of learning from semi-supervised demonstrations. We demonstrate and benchmark the performance of BTIL on synthetic multi-agent tasks as well as a novel dataset of human-agent teamwork. Our experiments show that BTIL can successfully learn team policies from demonstrations despite the influence of team members' (time-varying and potentially misaligned) mental states on their behavior.

On the First-Order Rewritability of Ontology-Mediated Queries in Linear Temporal Logic (Extended Abstract)

Conference Paper

Jul 2022

In recent years, spiking neural networks (SNNs) have received extensive attention in brain-inspired intelligence due to their rich spatially-temporal dynamics, various encoding methods, and event-driven characteristics that naturally fit the neuromorphic hardware. With the development of SNNs, brain-inspired intelligence, an emerging research field inspired by brain science achievements and aiming at artificial general intelligence, is becoming hot. This paper reviews recent advances and discusses new frontiers in SNNs from five major research topics, including essential elements (i.e., spiking neuron models, encoding methods, and topology structures), neuromorphic datasets, optimization algorithms, software, and hardware frameworks. We hope our survey can help researchers understand SNNs better and inspire new works to advance this field.

Optimized Potential Initialization for Low-Latency Spiking Neural Networks

Article

Jun 2022

Spiking Neural Networks (SNNs) have been attached great importance due to the distinctive properties of low power consumption, biological plausibility, and adversarial robustness. The most effective way to train deep SNNs is through ANN-to-SNN conversion, which have yielded the best performance in deep network structure and large-scale datasets. However, there is a trade-off between accuracy and latency. In order to achieve high precision as original ANNs, a long simulation time is needed to match the firing rate of a spiking neuron with the activation value of an analog neuron, which impedes the practical application of SNN. In this paper, we aim to achieve high-performance converted SNNs with extremely low latency (fewer than 32 time-steps). We start by theoretically analyzing ANN-to-SNN conversion and show that scaling the thresholds does play a similar role as weight normalization. Instead of introducing constraints that facilitate ANN-to-SNN conversion at the cost of model capacity, we applied a more direct way by optimizing the initial membrane potential to reduce the conversion loss in each layer. Besides, we demonstrate that optimal initialization of membrane potentials can implement expected error-free ANN-to-SNN conversion. We evaluate our algorithm on the CIFAR-10 dataset and CIFAR-100 dataset and achieve state-of-the-art accuracy, using fewer time-steps. For example, we reach top-1 accuracy of 93.38% on CIFAR-10 with 16 time-steps. Moreover, our method can be applied to other ANN-SNN conversion methodologies and remarkably promote performance when the time-steps is small.

Learning Rules in Spiking Neural Networks: A Survey

Recommended publications

Learn About the Phi Coefficient in SPSS With Data From the Southern Opinion Research Survey (1990)

The Learning with Errors Problem

A survey of diverse students’ orientations toward using social media to literacy learning in inner-c...

Survey and Guidelines about Learning Cyber Security Risk Assessment