Home

miningsuch

Miningsuch is a term used in corpus linguistics and information extraction to describe a data-mining approach that concentrates on extracting exemplars from text by exploiting the linguistic cue "such as" and its equivalents in other languages. The term is a recent neologism and is not yet standardized across disciplines; usages vary across papers and projects.

In practice, miningsuch involves locating sentences that contain a cue such as "such as" or its multilingual

Applications of miningsuch include constructing knowledge graphs, populating lexical resources, aiding summarization, and improving information retrieval

Limitations include reliance on the presence of explicit cues, variations in phrasing, cross-linguistic differences, and potential

Example: In the sentence "There are many programming languages, such as Python, Java, and C++, that are

See also: information extraction, list extraction, pattern mining, natural language processing.

equivalents,
then
parsing
the
following
material
to
identify
a
list
of
items.
The
method
typically
includes
tokenization,
part-of-speech
tagging,
and
dependency
parsing
to
separate
items
from
introductory
phrases,
handling
punctuation
and
conjunctions,
and
normalizing
multiword
expressions.
and
question
answering
by
turning
qualitative
lists
into
structured
data.
extraction
noise
from
ambiguous
or
incomplete
lists.
It
is
often
used
in
combination
with
broader
pattern-mining
or
relation-extraction
methods.
popular
for
education,"
miningsuch
would
extract
the
list
{Python,
Java,
C++}.