Home

textswhere

Textswhere is a framework and software platform for embedding precise location metadata within textual content to support context-aware search, annotation, and retrieval across diverse document sources. It is designed to connect text spans to their physical or digital locations, enabling users to reference exact passages by source, edition, and structure.

The core concept is the text where relation, which maps a text span to a source location

Origins of the approach are rooted in digital humanities and information retrieval research that highlights precise

Implementations typically include libraries that extract and annotate text, a database schema for fragment identifiers, and

Use cases span scholarly editions, digital libraries, legal document discovery, and media transcripts synchronized to time-based

See also: Open Annotation, fragment identifiers, provenance metadata.

such
as
a
page,
paragraph,
line,
or
a
digital
fragment
with
a
stable
identifier.
This
enables
reproducible
referencing
and
supports
multi-source
comparisons
by
linking
textual
content
to
its
provenance
and
surrounding
context.
The
model
emphasizes
interoperability,
using
standard
formats
such
as
JSON-LD
and
aligning
with
annotation
ecosystems
like
the
Open
Annotation
Data
Model
to
describe
provenance,
edition,
and
multiplicity
of
sources.
citational
anchors
as
an
improvement
over
generic
search
results.
Textswhere
prototypes
and
concepts
have
circulated
in
scholarly
and
technical
discussions
about
how
best
to
represent,
store,
and
query
location-bound
text.
APIs
for
querying
by
location
and
content.
The
approach
supports
indexing
strategies
that
combine
textual
search
with
structural
and
provenance
metadata,
enabling
refined
queries
such
as
“find
all
references
to
a
passage
within
chapter
3
of
edition
X.”
media
or
page
references.
Challenges
include
maintaining
anchor
stability
across
revisions,
performance
on
large
corpora,
and
the
need
for
community
standards
to
ensure
cross-system
compatibility.