Moderationsbias
Moderationsbias is a term used to describe systematic distortions in content moderation on online platforms, where decisions by human moderators or automated systems diverge from normative expectations and disproportionately affect certain kinds of content, groups, or viewpoints. It encompasses biases that arise from the moderation process itself, rather than from the underlying policies alone.
Causes of moderationsbias include the cognitive and cultural biases of individual moderators, ambiguous or evolving guidelines,
The manifestations of moderationsbias vary, ranging from over-enforcement of certain topics (for example, political or sensitive
Research on moderationsbias uses content analysis, experiment designs, and platform data audits to identify patterns of
See also: algorithmic bias, content moderation, platform governance.