corrigible
Corrigible is an adjective describing something that is capable of being corrected, amended, or improved. It can refer to a person who is receptive to feedback and willing to adjust beliefs or actions, or to a system or process that is designed to be modified in light of errors or new information. The term derives from Latin corrigere, meaning “to correct,” with the suffix -ible indicating possibility or capability.
In broader usage, corrigible emphasizes openness to refinement and learning from mistakes. In technical and philosophical
Key attributes typically associated with corrigibility in AI safety discussions include deference to human preferences, acceptance
See also: AI safety, AI alignment, control problem, value alignment.