GYAFC
GYAFC, short for Grammarly Yahoo Answers Formality Corpus, is a benchmark dataset used in natural language processing for formality style transfer. It provides a parallel corpus of informal and formal sentence variants to support training and evaluating models that convert text from informal to formal register and vice versa.
The dataset is built from Yahoo Answers posts and curated rewrites produced by Grammarly editors to reflect
GYAFC has been widely adopted as a standard resource for researching formality transfer because it offers
The dataset is publicly available for academic use and is accompanied by splits for training, validation, and