Home

TEIbased

TEIbased describes text encoding projects, tools, or content that follows the Text Encoding Initiative (TEI) Guidelines. TEI is a widely used standard in digital humanities for representing literary, linguistic, and historical texts in machine-readable XML. TEIbased materials aim to facilitate scholarly editing, annotation, search, preservation, and interchange by encoding not only the text but also metadata, variants, structures, and annotations according to TEI conventions.

Overview: The TEI Guidelines define a modular, extensible schema for encoding features such as document structure,

Structure: A TEI document typically includes a <teiHeader> with bibliographic metadata and project information, and a

Customization and processing: TEI can be customized via ODD to create project-specific schemas. TEI documents can

Impact and usage: TEIbased is common in digital editions, linguistic corpora, manuscript studies, and heritage documentation.

bibliographic
metadata,
apparatus
for
textual
variants,
linguistic
annotation,
and
critical
notes.
TEI
P5
is
the
current
edition;
it
provides
dedicated
modules
for
prose,
poetry,
drama,
names
and
dates,
lexicon
entries,
and
more.
<text>
element
that
contains
the
encoded
content.
Inside,
<body>
holds
the
primary
text,
with
divisions
like
<div>,
<p>,
and
for
verse
<l>
and
<line>.
TEI
also
supports
markup
such
as
<note>,
<app>
for
apparatus,
<hand>
for
manuscript
hands,
and
<w>
or
<pc>
for
token-level
annotation.
be
validated
against
TEI
schemas
and
transformed
with
XSLT
into
HTML,
PDF,
or
TEI-conformant
outputs.
TEI-based
data
interoperates
with
related
standards
such
as
EAD,
METS,
or
RelaxNG;
and
it
is
often
used
with
tools
like
Oxygen,
TEI
Publisher,
and
TEI
Stylesheets.
It
emphasizes
interoperability,
reproducibility,
and
long-term
preservation,
though
it
requires
training
and
governance
to
maintain
consistency
and
quality.