coflags
Coflags are a conceptual framework used to represent the aggregated outcome of multiple flagging signals about a piece of content or user behavior. The term combines co- (together) with flags (indicators of potential policy violations) to denote a consolidated status that reflects contributions from diverse sources, such as user reports, automated detectors, and human moderators.
Data model and terminology. Each coflag includes a set of dimension flags (for example, violence, harassment,
Types and aggregation. Coflags come in several forms: simple coflags (binary indicators across dimensions), composite coflags
Workflow and applications. In practice, new signals trigger the computation of coflags, which in turn determine
Limitations and governance. Challenges include potential bias in sources, signal drift, privacy concerns, latency, and interpretability