chunker
Chunker is a term used in natural language processing to describe a component or model that performs shallow parsing, grouping adjacent words into chunks such as noun phrases (NPs) and verb phrases (VPs). Unlike full syntactic parsers, a chunker does not attempt to build a complete hierarchical parse tree; instead it identifies spans of text with local syntactic categories.
Typically, chunkers operate on text that has been part-of-speech tagged. They can be rule-based, using regular
Common chunking tasks emphasize noun phrase chunking, though verb phrases and other categories may also be
Data sets such as the CoNLL-2000 NP chunking corpus have driven development and evaluation, using precision,