Tasks of Natural language processing
Anaphora (linguistics)
In linguistics, an anaphora is a type of expression whose reference depends on another referential element.
In linguistics, an anaphora is a type of expression whose reference depends on another referential element.
Asia Online
Asia Online is a privately owned company backed by individual investors and institutional venture capital.
Asia Online is a privately owned company backed by individual investors and institutional venture capital.
Automatic summarization
Automatic summarization is the creation of a shortened version of a text by a computer program.
Automatic summarization is the creation of a shortened version of a text by a computer program.
Collocation extraction
Collocation extraction is the task of extracting collocations automatically from a corpus using a computer.
Collocation extraction is the task of extracting collocations automatically from a corpus using a computer.
ETAP-3
ETAP-3 is a linguistic processing system focusing on English and Russian.
ETAP-3 is a linguistic processing system focusing on English and Russian.
Language identification
Language identification is the process of determining which natural language given content is in.
Language identification is the process of determining which natural language given content is in.
Lemmatisation
Lemmatisation in linguistics, is the process of grouping together the different inflected forms of a word so they can be analysed as a single item.
Lemmatisation in linguistics, is the process of grouping together the different inflected forms of a word so they can be analysed as a single item.
Linguistic empathy
Linguistic empathy in theoretical linguistics is the “point of view” in an anaphoric utterance by which a participant is bound with or in the event or state that he/she describes in that sentence.
Linguistic empathy in theoretical linguistics is the “point of view” in an anaphoric utterance by which a participant is bound with or in the event or state that he/she describes in that sentence.
Machine translation
Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation MAHT and interac...
Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation MAHT and interac...
Named entity recognition
Named entity recognition is a subtask of information extraction that seeks to locate and classify atomic elements in text into predefined categories such as the names of persons, organizations, ...
Named entity recognition is a subtask of information extraction that seeks to locate and classify atomic elements in text into predefined categories such as the names of persons, organizations, ...
Named-entity recognition
Named entity recognition (NER) (also known as entity identification and entity extraction) is a subtask of information extraction that seeks to locate and classify atomic elements in...
Named entity recognition (NER) (also known as entity identification and entity extraction) is a subtask of information extraction that seeks to locate and classify atomic elements in...
Part-of-speech tagging
In corpus linguistics, part-of-speech tagging, also called grammatical tagging or word-category disambiguation, is the process of marking up a word in a text as corresponding to a pa...
In corpus linguistics, part-of-speech tagging, also called grammatical tagging or word-category disambiguation, is the process of marking up a word in a text as corresponding to a pa...
Phrase chunking
Phrase chunking is a natural language process that separates and segments sentences into their subconstituents, such as noun, verb, and prepositional phrases.
Phrase chunking is a natural language process that separates and segments sentences into their subconstituents, such as noun, verb, and prepositional phrases.
Relationship extraction
A relationship extraction task requires the detection and classification of semantic relationship mentions within a set of artifacts, typically from text or XML documents.
A relationship extraction task requires the detection and classification of semantic relationship mentions within a set of artifacts, typically from text or XML documents.
Semantic role labeling
Semantic role labeling is a task in natural language processing consisting of the detection of the semantic arguments associated with the predicate or verb of a sentence and their classification...
Semantic role labeling is a task in natural language processing consisting of the detection of the semantic arguments associated with the predicate or verb of a sentence and their classification...
Sentence boundary
Sentence boundary disambiguation (SBD), also known as sentence breaking, is the problem in natural language processing of deciding where sentences begin and end.
Sentence boundary disambiguation (SBD), also known as sentence breaking, is the problem in natural language processing of deciding where sentences begin and end.
Shallow parsing
Shallow parsing (also chunking, "light parsing") is an analysis of a sentence which identifies the constituents (noun groups, verbs, verb groups, etc.), but does not specify their internal...
Shallow parsing (also chunking, "light parsing") is an analysis of a sentence which identifies the constituents (noun groups, verbs, verb groups, etc.), but does not specify their internal...
Stemming
In linguistic morphology and information retrieval, stemming is the process for reducing inflected words to their stem, base or root form—generally a written word form.
In linguistic morphology and information retrieval, stemming is the process for reducing inflected words to their stem, base or root form—generally a written word form.
Terminology extraction
Terminology mining, term extraction, term recognition, or glossary extraction, is a subtask of information extraction.
Terminology mining, term extraction, term recognition, or glossary extraction, is a subtask of information extraction.
Text segmentation
Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics.
Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics.
Textual entailment
Textual entailment in natural language processing is a directional relation between text fragments.
Textual entailment in natural language processing is a directional relation between text fragments.
Tokenization
Tokenization is the process of breaking a stream of text up into words, phrases, symbols, or other meaningful elements called tokens.
Tokenization is the process of breaking a stream of text up into words, phrases, symbols, or other meaningful elements called tokens.
Truecasing
Truecasing is the problem in natural language processing (NLP) of determining the proper capitalization of words where such information is unavailable.
Truecasing is the problem in natural language processing (NLP) of determining the proper capitalization of words where such information is unavailable.
Settings