SpanSn
SpanSn is a software library for span-based text annotation and processing. It provides a formal representation of contiguous spans within token sequences and offers utilities for creating, validating, and comparing spans. The library is intended to support natural language processing tasks that rely on span-level annotations, such as named entity recognition, event extraction, and coreference tagging.
Core concepts include the Span object, which records a start and end index, and the SpanSet or
SpanSn provides bindings for several programming languages and can be integrated with common data formats used
Applications range from annotation tooling and corpus management to model training for span-based labeling tasks. The