MMbbld
MMbbld, short for Multi-Modal Balanced Binary Labeling Dataset, is a benchmark dataset proposed for evaluating multi-modal machine learning models on binary classification tasks. It is designed to assess how effectively models fuse information from diverse modalities while preserving balanced class representations across the dataset.
Dataset composition and modalities: Each example includes an image (RGB, 224 by 224), a short text caption,
Annotation and quality control: Labels are produced through a crowd-sourced workflow with multiple independent annotators per
Data creation and licensing: Content is assembled from publicly available sources and carefully scrubbed for privacy.
Usage and benchmarks: MMbbld serves as a testbed for multi-modal fusion approaches, including early fusion, late