GPT - Infinite Lexicon - Infinite Lexicon

GPT

GPT stands for Generative Pre-trained Transformer, a family of autoregressive language models developed by OpenAI. Built on the Transformer architecture, GPT models are trained in two stages: pre-training on large corpora to learn language structure, followed by fine-tuning or instruction-following alignment to improve task performance. The models generate coherent text by predicting the next token in a sequence, using a context window that determines how much prior text influences output.

The first version, GPT, released in 2018, demonstrated the viability of unsupervised pre-training for language tasks.

Technical notes: GPT models are trained on diverse datasets drawn from publicly available text and licensed

Applications: chatbots, content generation, code completion, translation, summarization, research assistance, and more, typically via API access

Limitations and governance: GPT models can produce incorrect or biased outputs, known as hallucinations, and their

iterations—GPT-2

(2023)—scaled

a

instruction-following

misinformation,