RNNs - Infinite Lexicon - Infinite Lexicon

RNNs

Recurrent neural networks (RNNs) are a class of neural networks designed to process sequential data by maintaining a hidden state that carries information across time. At each time step t, the network receives an input x_t and updates its hidden state h_t based on the previous state h_{t-1} and the current input. The same parameters are applied at every time step, enabling the network to handle sequences of varying length and to model temporal dependencies.

The standard or vanilla RNN computes h_t = φ(W_hh h_{t-1} + W_xh x_t + b_h), where φ is an elementwise

Plain RNNs face challenges with long-range dependencies due to vanishing and exploding gradients, limiting their ability

Variants include bidirectional RNNs, which process sequences in both forward and backward directions, and encoder–decoder or

RNNs originated in the 1990s, with early work by Elman and the introduction of backpropagation through time.

t

=

+

backpropagation

sequence-to-sequence

a

a

a

parallelization