numbermarked
Numbermarked is a data annotation scheme used in natural language processing to label numeric expressions in text with structured metadata. It enables downstream systems to interpret numbers reliably, beyond plain text.
An annotation typically records attributes such as type (integer, decimal, fraction, percentage, currency, date-time), value, sign,
In practice, numbermarked can be encoded in formats such as JSON, XML, or IOB tagging. For example,
Uses include information extraction, financial analytics, and question answering, where explicit numeric data improves accuracy and
Relation to standards: numbermarked draws on ideas from numeric normalization and unit tagging, and can be
Limitations: language variability, ambiguous units, and privacy concerns when numbers reveal sensitive information. Language coverage and
See also: numeric normalization, unit annotation, named-entity recognition, information extraction.