Investigating Time-Scales in Deep Echo State Networks for Natural Language Processing

Published in ICANN, 2025

Graphical abstract

Reservoir Computing (RC) enables efficiently-trained deep Recurrent Neural Networks (RNNs) by removing the need to train the hierarchy of representations of the input sequences. In this paper, we analyze the performance and the dynamical behavior of RC models, specifically Deep Bidirectional Echo State Networks (Deep-BiESNs), applied to Natural Language Processing (NLP) tasks. We compare the performance of Deep-BiESNs against fully-trained NLP baseline models on six common NLP tasks: three sequence-to-vector tasks for sequence-level classification and three sequence-to-sequence tasks for token-level labeling. Experimental results demonstrate that Deep-BiESNs achieve comparable or superior performance to these baseline models. We then adapt the class activation mapping technique for explainability to analyze the dynamical properties of these deep RC models, highlighting how the hierarchy of representations in Deep-BiESNs layers contributes to forming the class prediction in the different NLP tasks. Investigating time scales in deep RNN layers is highly relevant for NLP because language inherently involves dependencies that occur over various temporal horizons. The findings not only underscore the potential of Deep ESNs as a competitive and efficient alternative for NLP applications, but also contribute to a deeper understanding of how to effectively model such architectures for addressing other NLP challenges.

Recommended citation: C. Baccheschi, A. Bondielli, A. Lenci, A. Micheli, L. Passaro, M. Podda, D. Tortorella (2025). "Investigating Time-Scales in Deep Echo State Networks for Natural Language Processing." Artificial Neural Networks and Machine Learning. ICANN 2025 International Workshops and Special Sessions, LNCS vol. 16072, pp. 188-200.
Download Paper