DuckDuckGo
 

Natural language processing

(DDG Topics List)
A B C D E F G H I K L M N O P Q R S T W
A  [top]
 
AFNLP - AFNLP is the organization for coordinating the natural language processing related activities and events in the Asia-Pacific region.
 
Aggregation (linguistics) - Aggregation is a subtask of Natural language generation, which involves merging syntactic constituents together.
 
Attensity - Attensity Group is a software company formed by a 2009 merger of Germany's Empolis with Attensity Corp. founded in 2000, in Palo Alto, California.
 
Automatic summarization - Automatic summarisation is the creation of a shortened version of a text by a computer program.
 
B  [top]
 
Bag of words model - The bag-of-words model is a simplifying assumption used in natural language processing and information retrieval.
 
Bigram - Bigrams or digrams are groups of two written letters, two syllables, or two words, and are very commonly used as the basis for simple statistical analysis of text.
 
Brill tagger - The Brill tagger is a method for doing part-of-speech tagging.
 
C  [top]
 
Calais (Reuters Product) - Calais is a Thomson Reuters initiative to encourage the wide deployment of semantic technologies in the information and content marketplaces.
 
ChaSen - ChaSen is a morphological parser for the Japanese language.
 
ClearForest - ClearForest is a software company that develops and markets text analytics and text mining solutions.
 
CMU Pronouncing Dictionary - The CMU Pronouncing Dictionary is a public domain pronouncing dictionary created by Carnegie Mellon University.
 
Computational semantics - Computational semantics is the study of how to automate the process of constructing and reasoning with meaning representations of natural language expressions.
 
Computerized Speech Lab - The Computerized Speech Lab is a speech and signal processing computer workstation used for research and clinical speech therapy.
 
Concept mining - Concept mining is an activity that results in the extraction of concepts from artifacts.
 
Controlled natural language - Controlled natural languages are subsets of natural languages, obtained by restricting the grammar and vocabulary in order to reduce or eliminate ambiguity and complexity.
 
Conversational agent - Conversational agents are communication technologies that use natural language and computational linguistic techniques to engage users in human-like, Web-based “dialogs.” They can support a bro...
 
Cross-language information retrieval - Cross-language information retrieval is a subfield of information retrieval dealing with retrieving information written in a language different from the language of the user's query.
 
D  [top]
 
DATR - DATR is a language for lexical knowledge representation.
 
Discourse relation - A discourse relation is a description of how two segments of discourse are logically connected to one another.
 
Document classification - Document classification/categorization is a problem in information science.
 
Document-term matrix - A document-term matrix or term-document matrix is a mathematical matrix that describes the frequency of terms that occur in a collection of documents.
 
E  [top]
 
ETBLAST - eTBLAST is a text similarity search engine currently offering access to the MEDLINE database, the National Institutes of Health CRISP database, the Institute of Physics database, and the NASA te...
 
Example-based machine translation - The example-based machine translation approach to machine translation is often characterized by its use of a bilingual corpus with parallel texts as its main knowledge base, at run-time.
 
F  [top]
 
Filtered-popping recursive transition network - A filtered-popping recursive transition network, or simply filtered-popping network, is a recursive transition network extended with a map of states to keys where returning from a subrouti...
 
Foreign language writing aid - A foreign language writing aid is a computer program that assists a non-native language user in writing decently in their target language.
 
G  [top]
 
General Architecture for Text Engineering - General Architecture for Text Engineering or GATE is a Java suite of tools originally developed at the University of Sheffield beginning in 1995 and now used worldwide by a wide community ...
 
GeneRIF - A GeneRIF or Gene Reference Into Function is a short statement about the function of a gene.
 
Gorn address - A Gorn address is a method of addressing an interior node within a tree from a phrase structure rule description or parse tree.
 
Grammar checker - A grammar checker in computing terms, is a program, or part of a program, that attempts to verify written text for grammatical correctness.
 
Grammar induction - Grammatical induction, also known as grammatical inference or syntactic pattern recognition, refers to the process in machine learning of learning a formal grammar from a set of observati...
 
H  [top]
 
History of machine translation - The history of machine translation generally starts in the 1950s, although work can be found from earlier periods.
 
History of natural language processing - The history of Natural language processing describes the advances of Natural language processing There is some overlap with the history of machine translation, and the history of artificial int...
 
I  [top]
 
iGlue - iGlue is an experimental database with detailed search options, containing entities and information editing tool.
 
Information extraction - Information extraction dates back to the late 1970s in the early days of NLP.
 
International Conference on Language Resources and Evaluation - The International Conference on Language Resources and Evaluation is a biennial conference organised by the European Language Resources Association with the support of institutions and organisa...
 
K  [top]
 
L  [top]
 
Language engineering - Language engineering is the creation of natural language processing systems whose cost and outputs are measurable and predictable as well as establishment of language regulators, such as formal ...
 
Language identification - Language identification is the process of determining which natural language given content is in.
 
Languageware - LanguageWare is a natural language processing technology developed by IBM, that allows applications to process natural language text.
 
Latent semantic analysis - Latent semantic analysis is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of documents and the terms they contain by ...
 
Latent semantic mapping - Latent semantic mapping is a data-driven framework to model globally meaningful relationships implicit in large volumes of data.
 
Legal information retrieval - Legal information retrieval is the science of information retrieval applied to legal text, including legislation, case law, and scholarly works.
 
Lesk algorithm - The Lesk algorithm is a classical algorithm for word sense disambiguation introduced by Michael E. Lesk in 1986.
 
Lexalytics - Lexalytics, Inc. provides enterprise and hosted text analytics software to transform unstructured text into structured data.
 
Lexical choice - Lexical choice is a subtask of Natural language generation, which involves choosing the content words in a generated text.
 
Lexical Markup Framework - ISO 24613:2008, Language resource management - Lexical markup framework, is the ISO International Organization for Standardization ISO/TC37 standard for natural language processing and machi...
 
Lexical substitution - Lexical substitution is the task of identifying a substitute for a word in the context of a sentence.
 
Linguistic Issues in Language Technology - Linguistic Issues in Language Technology is an open-access journal that, according to its web page, "focusses on relationships between linguistic insights, which can prove valuable to language t...
 
LKB - Linguistic Knowledge Builder is a free and open source grammar engineering environment for creating grammars and lexicons of natural languages.
 
Logic form - Logic forms are simple, first-order logic knowledge representations of natural language sentences formed by the conjunction of concept predicates related through shared arguments.
 
M  [top]
 
MAREC - The MAtrixware REsearch Collection is a standardised patent data corpus available for research purposes.
 
METEOR - METEOR is a metric for the evaluation of machine translation output.
 
Microsoft text-to-speech voices - The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API.
 
Modular Audio Recognition Framework - Modular Audio Recognition Framework is an open-source research platform and a collection of voice, sound, speech, text and natural language processing algorithms written in Java and arranged int...
 
Morphological pattern - A morphological pattern is a set of associations and/or operations that build the various forms of a lexeme, possibly by inflection, agglutination, compounding or derivation.
 
Multi-document summarization - Multi-document summarization is an automatic procedure aimed at extraction of information from multiple texts written about the same topic.
 
N  [top]
 
N-gram - An n-gram is a subsequence of n items from a given sequence.
 
Natural language - In the philosophy of language, a natural language is any language which arises in an unpremeditated fashion as the result of the innate facility for language possessed by the human intellect.
 
Natural language generation - Natural Language Generation is the natural language processing task of generating natural language from a machine representation system such as a knowledge base or a logical form.
 
Natural language processing - Natural Language processing is a field of computer science and linguistics concerned with the interactions between computers and human languages.
 
Natural Language Toolkit - Natural Language Toolkit or, more commonly, NLTK is a suite of libraries and programs for symbolic and statistical natural language processing for the Python programming language.
 
Natural language user interface - Natural Language User Interfaces are a type of computer human interface where linguistic phenomena such as verbs, phrases and clauses act as UI controls for creating, selecting and modifying dat...
 
News analytics - News analysis refers to the measurement of the various qualitative and quantitative attributes of textual news stories.
 
Noisy text analytics - Noisy text analytics is a process of information extraction whose goal is to automatically extract structured or semistructured information from noisy unstructured text data.
 
O  [top]
 
Ontology learning - Ontology learning is a subtask of information extraction.
 
Open domain question answering - In information retrieval, an open domain question answering system aims at returning an answer in response to the user’s question.
 
P  [top]
 
Paco Nathan - Paco Nathan is a computer scientist, author, and performance art show producer from San Luis Obispo, California, who established much of his career in Austin, Texas.
 
Powerset (company) - Powerset is a company based in San Francisco, California that is developing a natural language search engine for the Internet.
 
Production (computer science) - A production or production rule in computer science is a rewrite rule specifying a symbol substitution that can be recursively performed to generate new symbol sequences.
 
PropBank - PropBank is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank".
 
Q  [top]
 
Question answering - In information retrieval and natural language processing, question answering is the task of automatically answering a question posed in natural language.
 
R  [top]
 
Realization (linguistics) - Realisation is a subtask of Natural language generation, which involves creating an actual text in a human language from a syntactic representation.
 
Recursive transition network - A recursive transition network is a graph theoretical schematic used to represent the rules of a context free grammar.
 
Referring expression generation - Referring expression generation is a subtask of Natural language generation, which involves creating referring expressions that identify specific entities to the reader.
 
Rewrite rule - In linguistics, a rewrite rule for natural language in generative grammar is a rule of the form A → X where A is a syntactic category label, such as noun phrase or sentence, and X is a sequence ...
 
Robby Garner - Robby Garner is a natural language programmer and software developer.
 
S  [top]
 
Semantic analytics - Semantic analytics is the use of ontologies to analyze content in web resources.
 
Semantic neural network - Semantic neural network is based on John von Neumann's neural network von Neumann, 1966 and Nikolai Amosov M-Network.
 
Sentence extraction - Sentence extraction is a technique used for automatic summarization.
 
Sentiment analysis - Sentiment analysis or opinion mining refers to a broad area of natural language processing, computational linguistics and text mining.
 
SHRDLU - SHRDLU was an early natural language understanding computer program, developed by Terry Winograd at MIT from 1968-1970.
 
Speech segmentation - Speech segmentation is the process of identifying the boundaries between words, syllables, or phonemes in spoken natural languages.
 
SPL notation - SPL stands for Sentence Plan Language.
 
Stemming - In linguistic morphology, stemming is the process for reducing inflected words to their stem, base or root form – generally a written word form.
 
String Kernel - String kernel is a mathematical tool used in large scale data analysis and mining, where sequence data are to be clustered or classified.
 
Studies in NLP - Studies in Natural Language Processing is the book series of the Association for Computational Linguistics, published by Cambridge University Press.
 
Sukhotins Algorithm - Sukhotins' Algorithm is a statistical classification algorithm for classifying characters in a text as vowels or consonants.
 
Synthetix - Synthetix Ltd is a software company based in Cambridge, United Kingdom and founded in 2001.
 
T  [top]
 
T9 (predictive text) - T9, which stands for Text on 9 keys, is a patented predictive text technology for mobile phones, originally developed by Tegic Communications, now part of Nuance Communications.
 
Teragram Corporation - Teragram Corporation is a fully owned subsidiary of SAS Institute, a major producer of statistical analysis software, headquartered in Cary, North Carolina, USA. Teragram is based in Cambridge,...
 
Terminology extraction - Terminology mining, term extraction, term recognition, or glossary extraction, is a subtask of information extraction.
 
Text analytics - The term text analytics describes a set of linguistic, statistical, and machine learning techniques that model and structure the information content of textual sources for business intelligence,...
 
Text Retrieval Conference - The Text REtrieval Conference is an on-going series of workshops focusing on a list of different information retrieval research areas, or tracks.
 
Text simplification - Text simplification is an operation used in natural language processing to modify, enhance, classify or otherwise process an existing corpus of human-readable text in such a way that the gramma...
 
Text-to-voice - Text to Voice or Text to Speech is a Firefox extension developed by Vikram Joshi, an under-graduate from IIT Delhi.
 
TipTop Technologies - TipTop Technologies offers a real-time web, social search engine with a unique platform for semantic analysis of natural language.
 
Transderivational search - Transderivational search is a psychological and cybernetics term, meaning when a search is being conducted for a fuzzy match across a broad field.
 
Trigram - Trigrams are a special case of the N-gram, where N is 3.
 
Triphone - In linguistics, a triphone is a sequence of three phonemes.
 
W  [top]
 
w-shingling - In natural language processing a w-shingling is a set of unique "shingles"—contiguous subsequences of tokens in a document—that can be used to gauge the similarity of two documents.
 
Word sense - In computational linguistics, word sense disambiguation is an open problem of natural language processing, which governs the process of identifying which sense of a word is used in a sentence, w...
 
Word sense induction - In computational linguistics, word sense induction or discrimination is an open problem of natural language processing, which concerns the automatic identification of the senses of a word.
 
WYSIWYM (Meant) - What You See Is What You Meant allows users to create abstract knowledge representations such as those required by the Semantic Web using a natural language i...
 
Web Links  [top]
Try search on: