insertionbased

Insertionbased is a term used in natural language processing to describe a family of sequence generation methods in which the output is constructed by iteratively inserting tokens into a partially built sequence, rather than predicting tokens strictly from left to right. The approach enables non-sequential generation and can leverage parallel insertions, potentially speeding up decoding while maintaining competitive output quality.

The best-known example is the Insertion Transformer, introduced as an alternative to left-to-right autoregressive decoding. Insertion-based

How it works: a generation process starts from a seed sequence, often a start token. The model

Advantages and limitations: insertion-based methods can offer faster decoding due to parallel insertions and can better

Applications and scope: these models have been explored for language generation, machine translation, summarization, and code

a

a

a

sequence-generation

non-autoregressive