MNLI

MNLI stands for Multi-Genre Natural Language Inference. It is a large-scale benchmark dataset designed to evaluate a model's ability to determine whether a premise entails, contradicts, or is neutral with respect to a given hypothesis. The data is drawn from multiple genres of text to test robustness to domain variation and linguistic style.

The dataset includes premises and hypotheses paired from about ten genres, such as fiction, government, travel,

When released by Williams, Nangia, and Bowman in 2018, MNLI quickly became a standard benchmark for natural

The dataset's scale and genre coverage make it a challenging test for cross-domain reasoning, but it has

(out-of-domain).

a

generalization.