Home

NewsArchiven

NewsArchiven is a web‑based platform designed to collect, preserve, and provide searchable access to news articles from a wide range of publications. Launched in 2017 by a consortium of academic institutions and media organisations, the service aims to create a long‑term digital repository that mitigates the loss of journalistic content due to website redesigns, paywalls, or the closure of news outlets.

The system archives full‑text articles, metadata, and associated multimedia, employing web crawling, publisher feeds, and user

Key features include full‑text search, advanced filtering by date, source, geographic region, and editorial topic, as

Since its inception, NewsArchiven has indexed millions of articles from newspapers, magazines, and online news portals

submissions.
Content
is
stored
in
a
standardized
format
that
supports
multiple
languages
and
complies
with
the
Open
Archival
Information
System
(OAIS)
model.
To
respect
copyright,
NewsArchiven
operates
under
a
combination
of
licenses:
public
domain
and
Creative
Commons‑licensed
material
are
openly
accessible,
while
copyrighted
items
are
available
to
verified
researchers
and
subscribing
institutions
under
controlled‑use
agreements.
well
as
an
API
that
enables
integration
with
scholarly
tools
and
data‑analysis
platforms.
The
platform
also
offers
citation
tools
that
generate
appropriate
references
in
common
academic
styles.
A
community‑driven
governance
model
allows
participating
libraries
and
news
organisations
to
propose
policy
changes,
contribute
metadata
standards,
and
curate
thematic
collections.
across
more
than
50
countries.
It
is
widely
used
by
historians,
media
scholars,
and
fact‑checking
organisations.
Independent
evaluations
have
praised
its
reliability
and
the
transparency
of
its
preservation
practices,
while
some
critics
have
raised
concerns
about
the
sustainability
of
funding
and
the
challenges
of
balancing
open
access
with
publishers’
rights.
Ongoing
development
focuses
on
expanding
coverage
of
non‑English
sources
and
enhancing
machine‑learning
tools
for
automated
content
classification.