Natural language processing
Affix grammar over a finite lattice
In linguistics, the affix grammars over a finite lattice (AGFL) formalism is a notation for context-free grammars with finite set-valued features, acceptable to linguists of many different schools.
In linguistics, the affix grammars over a finite lattice (AGFL) formalism is a notation for context-free grammars with finite set-valued features, acceptable to linguists of many different schools.
AFNLP
AFNLP (Asian Federation of Natural Language Processing Associations) is the organization for coordinating the natural language processing related activities and events in the Asia-Pacific region.
AFNLP (Asian Federation of Natural Language Processing Associations) is the organization for coordinating the natural language processing related activities and events in the Asia-Pacific region.
Aggregation (linguistics)
Aggregation is a subtask of Natural language generation, which involves merging syntactic constituents (such as sentences and phrases) together.
Aggregation is a subtask of Natural language generation, which involves merging syntactic constituents (such as sentences and phrases) together.
Askbot
Askbot is an open source question and answer oriented Internet forum similar to Stack Overflow or Yahoo Answers.
Askbot is an open source question and answer oriented Internet forum similar to Stack Overflow or Yahoo Answers.
Attensity
Attensity provides social analytics and engagement applications for software for Social Customer Relationship Management.
Attensity provides social analytics and engagement applications for software for Social Customer Relationship Management.
Automated essay scoring
Automated essay scoring is the use of specialized computer programs to assign grades to essays written in an educational setting.
Automated essay scoring is the use of specialized computer programs to assign grades to essays written in an educational setting.
Automatic acquisition of lexicon
Automatic acquisition of lexicon is a computerized process used for the development of a complex morphological lexicon of a language.
Automatic acquisition of lexicon is a computerized process used for the development of a complex morphological lexicon of a language.
Automatic summarization
Automatic summarization is the creation of a shortened version of a text by a computer program.
Automatic summarization is the creation of a shortened version of a text by a computer program.
Bag of words model
The bag-of-words model is a simplifying assumption used in natural language processing and information retrieval.
The bag-of-words model is a simplifying assumption used in natural language processing and information retrieval.
Bigram
A bigram or digram is every sequence of two adjacent elements in a string of tokens, which are typically letters, syllables, or words; they are n-grams for n=2.
A bigram or digram is every sequence of two adjacent elements in a string of tokens, which are typically letters, syllables, or words; they are n-grams for n=2.
Brill tagger
The Brill tagger is a method for doing part-of-speech tagging.
The Brill tagger is a method for doing part-of-speech tagging.
Cache language model
A cache language model is a type of statistical language model that contains a cache component and that assigns relatively high probabilities to words or word sequences that occur elsewhere in a...
A cache language model is a type of statistical language model that contains a cache component and that assigns relatively high probabilities to words or word sequences that occur elsewhere in a...
Calais (Reuters Product)
Calais is a Thomson Reuters initiative to encourage the wide deployment of semantic technologies in the information and content marketplaces.
Calais is a Thomson Reuters initiative to encourage the wide deployment of semantic technologies in the information and content marketplaces.
Calais (Reuters product)
Calais is a service by Thomson Reuters that automatically extracts semantic information from web pages in a format that can be used on the semantic web.
Calais is a service by Thomson Reuters that automatically extracts semantic information from web pages in a format that can be used on the semantic web.
ClearForest
ClearForest is a software company that develops and markets text analytics and text mining solutions.
ClearForest is a software company that develops and markets text analytics and text mining solutions.
CMU Pronouncing Dictionary
The CMU Pronouncing Dictionary is a public domain pronouncing dictionary created by Carnegie Mellon University.
The CMU Pronouncing Dictionary is a public domain pronouncing dictionary created by Carnegie Mellon University.
Computational semantics
Computational semantics is the study of how to automate the process of constructing and reasoning with meaning representations of natural language expressions.
Computational semantics is the study of how to automate the process of constructing and reasoning with meaning representations of natural language expressions.
Computerized Speech Lab
The Computerized Speech Lab is a speech and signal processing computer workstation used for research and clinical speech therapy.
The Computerized Speech Lab is a speech and signal processing computer workstation used for research and clinical speech therapy.
Concept mining
Concept mining is an activity that results in the extraction of concepts from artifacts.
Concept mining is an activity that results in the extraction of concepts from artifacts.
Content determination
Content determination is a subtask of Natural language generation, which involves deciding the on the information communicated in a generated text.
Content determination is a subtask of Natural language generation, which involves deciding the on the information communicated in a generated text.
Controlled natural language
Controlled natural languages (CNLs) are subsets of natural languages, obtained by restricting the grammar and vocabulary in order to reduce or eliminate ambiguity and complexity.
Controlled natural languages (CNLs) are subsets of natural languages, obtained by restricting the grammar and vocabulary in order to reduce or eliminate ambiguity and complexity.
Cross-language information retrieval
Cross-language information retrieval (CLIR) is a subfield of information retrieval dealing with retrieving information written in a language different from the language of the user's query.
Cross-language information retrieval (CLIR) is a subfield of information retrieval dealing with retrieving information written in a language different from the language of the user's query.
DATR
DATR is a language for lexical knowledge representation.
DATR is a language for lexical knowledge representation.
DELPH-IN
DEep Linguistic Processing with HPSG - INitiative is a collaboration where computational linguists worldwide develops natural language processing tools for deep linguisti...
DEep Linguistic Processing with HPSG - INitiative is a collaboration where computational linguists worldwide develops natural language processing tools for deep linguisti...
Discourse relation
A discourse relation (or rhetorical relation) is a description of how two segments of discourse are logically connected to one another.
A discourse relation (or rhetorical relation) is a description of how two segments of discourse are logically connected to one another.
Document classification
Document classification or document categorization is a problem in both library science, information science and computer science.
Document classification or document categorization is a problem in both library science, information science and computer science.
Document structuring
Document Structuring is a subtask of Natural language generation, which involves deciding the order and grouping (for example into paragraphs) of sentences in a generated text.
Document Structuring is a subtask of Natural language generation, which involves deciding the order and grouping (for example into paragraphs) of sentences in a generated text.
Document-term matrix
A document-term matrix or term-document matrix is a mathematical matrix that describes the frequency of terms that occur in a collection of documents.
A document-term matrix or term-document matrix is a mathematical matrix that describes the frequency of terms that occur in a collection of documents.
Dragomir R. Radev
Dragomir R. Radev is a University of Michigan computer science professor working on natural language processing and information retrieval.
Dragomir R. Radev is a University of Michigan computer science professor working on natural language processing and information retrieval.
ETBLAST
eTBLAST is a free text similarity service search engine currently offering access to the MEDLINE database, the National Institutes of Health CRISP database, the Institute of Physics database, Wi...
eTBLAST is a free text similarity service search engine currently offering access to the MEDLINE database, the National Institutes of Health CRISP database, the Institute of Physics database, Wi...
Example-based machine translation
The example-based machine translation (EBMT) approach to machine translation is often characterized by its use of a bilingual corpus with parallel texts as its main knowledge base, at ...
The example-based machine translation (EBMT) approach to machine translation is often characterized by its use of a bilingual corpus with parallel texts as its main knowledge base, at ...
Filtered-popping recursive transition network
A filtered-popping recursive transition network (FPRTN), or simply filtered-popping network (FPN), is a recursive transition network (RTN) extended with a map of states to keys...
A filtered-popping recursive transition network (FPRTN), or simply filtered-popping network (FPN), is a recursive transition network (RTN) extended with a map of states to keys...
GeneRIF
A GeneRIF or Gene Reference Into Function is a short statement about the function of a gene.
A GeneRIF or Gene Reference Into Function is a short statement about the function of a gene.
Gorn address
A Gorn address (Gorn, 1967) is a method of addressing an interior node within a tree from a phrase structure rule description or parse tree.
A Gorn address (Gorn, 1967) is a method of addressing an interior node within a tree from a phrase structure rule description or parse tree.
Grammar checker
A grammar checker in computing terms, is a program, or part of a program, that attempts to verify written text for grammatical correctness.
A grammar checker in computing terms, is a program, or part of a program, that attempts to verify written text for grammatical correctness.
Grammar induction
Grammatical induction, also known as grammatical inference or syntactic pattern recognition, refers to the process in machine learning of learning a formal grammar (usually in the form of...
Grammatical induction, also known as grammatical inference or syntactic pattern recognition, refers to the process in machine learning of learning a formal grammar (usually in the form of...
Grammatik
Grammatik was the first grammar checking program developed for home computer systems.
Grammatik was the first grammar checking program developed for home computer systems.
History of machine translation
The history of machine translation generally starts in the 1950s, although work can be found from earlier periods.
The history of machine translation generally starts in the 1950s, although work can be found from earlier periods.
History of natural language processing
The history of natural language processing describes the advances of natural language processing.
The history of natural language processing describes the advances of natural language processing.
iGlue
iGlue is an experimental database with detailed search options, containing entities and information editing tool.
iGlue is an experimental database with detailed search options, containing entities and information editing tool.
Information extraction
Information extraction is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents.
Information extraction is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents.
Information retrieval
Information retrieval is the area of study concerned with searching for documents, for information within documents, and for metadata about documents, as well as that of searching structured sto...
Information retrieval is the area of study concerned with searching for documents, for information within documents, and for metadata about documents, as well as that of searching structured sto...
International Conference on Language Resources and Evaluation
The International Conference on Language Resources and Evaluation (LREC) is a biennial conference organised by the European Language Resources Association with the support of institutions and o...
The International Conference on Language Resources and Evaluation (LREC) is a biennial conference organised by the European Language Resources Association with the support of institutions and o...
Jean-Philippe de Lespinay
Jean-Philippe de Lespinay is a French inventor.
Jean-Philippe de Lespinay is a French inventor.
Kleene star
In mathematical logic and computer science, the Kleene star (or Kleene operator or Kleene closure) is a unary operation, either on sets of strings or on sets of symbols or characters.
In mathematical logic and computer science, the Kleene star (or Kleene operator or Kleene closure) is a unary operation, either on sets of strings or on sets of symbols or characters.
Language Computer Corporation
Language Computer Corporation (LCC) is a natural language processing research company based in Richardson, Texas.
Language Computer Corporation (LCC) is a natural language processing research company based in Richardson, Texas.
Language guessing
Language identification or language guessing is the process of automatically determining the (natural) language a document or piece of text is written in.
Language identification or language guessing is the process of automatically determining the (natural) language a document or piece of text is written in.
Language identification
Language identification is the process of determining which natural language given content is in.
Language identification is the process of determining which natural language given content is in.
Languageware
LanguageWare is a natural language processing (NLP) technology developed by IBM, that allows applications to process natural language text.
LanguageWare is a natural language processing (NLP) technology developed by IBM, that allows applications to process natural language text.
Latent semantic analysis
Latent semantic analysis (LSA) is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of documents and the terms they...
Latent semantic analysis (LSA) is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of documents and the terms they...
Latent semantic mapping
Latent semantic mapping (LSM) is a data-driven framework to model globally meaningful relationships implicit in large volumes of (often textual) data.
Latent semantic mapping (LSM) is a data-driven framework to model globally meaningful relationships implicit in large volumes of (often textual) data.
Legal information retrieval
Legal information retrieval is the science of information retrieval applied to legal text, including legislation, case law, and scholarly works.
Legal information retrieval is the science of information retrieval applied to legal text, including legislation, case law, and scholarly works.
Lesk algorithm
The Lesk algorithm is a classical algorithm for word sense disambiguation introduced by Michael E. Lesk in 1986.
The Lesk algorithm is a classical algorithm for word sense disambiguation introduced by Michael E. Lesk in 1986.
Lessac Technologies
Lessac Technologies, Inc. is a domestic firm incorporated in Delaware.
Lessac Technologies, Inc. is a domestic firm incorporated in Delaware.
Lexalytics
Lexalytics, Inc. provides enterprise and hosted text analytics software to transform unstructured text into structured data.
Lexalytics, Inc. provides enterprise and hosted text analytics software to transform unstructured text into structured data.
Lexical choice
Lexical choice is a subtask of Natural language generation, which involves choosing the content words (nouns, verbs, adjectives, adverbs) in a generated text.
Lexical choice is a subtask of Natural language generation, which involves choosing the content words (nouns, verbs, adjectives, adverbs) in a generated text.
Lexical Markup Framework
ISO 24613:2008, Language resource management - Lexical markup framework, is the ISO International Organization for Standardization ISO/TC37 standard for natural language processing and machi...
ISO 24613:2008, Language resource management - Lexical markup framework, is the ISO International Organization for Standardization ISO/TC37 standard for natural language processing and machi...
Lexical substitution
Lexical substitution is the task of identifying a substitute for a word in the context of a sentence.
Lexical substitution is the task of identifying a substitute for a word in the context of a sentence.
Lexxe
Lexxe is an Internet search engine that uses natural language processing for queries (semantic search).
Lexxe is an Internet search engine that uses natural language processing for queries (semantic search).
Linguistic Issues in Language Technology
Linguistic Issues in Language Technology (LiLT) is an open-access journal that, according to its web page, "focusses on relationships between linguistic insights, which can prove valuable to lan...
Linguistic Issues in Language Technology (LiLT) is an open-access journal that, according to its web page, "focusses on relationships between linguistic insights, which can prove valuable to lan...
LKB
Linguistic Knowledge Builder (LKB) is a free and open source grammar engineering environment for creating grammars and lexicons of natural languages.
Linguistic Knowledge Builder (LKB) is a free and open source grammar engineering environment for creating grammars and lexicons of natural languages.
Logic form
Logic forms are simple, first-order logic knowledge representations of natural language sentences formed by the conjunction of concept predicates related through shared arguments.
Logic forms are simple, first-order logic knowledge representations of natural language sentences formed by the conjunction of concept predicates related through shared arguments.
LRE Map
The LRE Map (Language Resources and Evaluation) is a freely accessible large database on resources dedicated to Natural language processing (NLP).
The LRE Map (Language Resources and Evaluation) is a freely accessible large database on resources dedicated to Natural language processing (NLP).
MAREC
The MAtrixware REsearch Collection (MAREC) is a standardised patent data corpus available for research purposes.
The MAtrixware REsearch Collection (MAREC) is a standardised patent data corpus available for research purposes.
METEOR
METEOR (Metric for Evaluation of Translation with Explicit ORdering) is a metric for the evaluation of machine translation output.
METEOR (Metric for Evaluation of Translation with Explicit ORdering) is a metric for the evaluation of machine translation output.
Microsoft text-to-speech voices
The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API.
The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API.
Modular Audio Recognition Framework
Modular Audio Recognition Framework (MARF) is an open-source research platform and a collection of voice, sound, speech, text and natural language processing (NLP) algorithms written in Java and...
Modular Audio Recognition Framework (MARF) is an open-source research platform and a collection of voice, sound, speech, text and natural language processing (NLP) algorithms written in Java and...
Morphological pattern
A morphological pattern is a set of associations and/or operations that build the various forms of a lexeme, possibly by inflection, agglutination, compounding or derivation.
A morphological pattern is a set of associations and/or operations that build the various forms of a lexeme, possibly by inflection, agglutination, compounding or derivation.
Multi-document summarization
Multi-document summarization is an automatic procedure aimed at extraction of information from multiple texts written about the same topic.
Multi-document summarization is an automatic procedure aimed at extraction of information from multiple texts written about the same topic.
Multilingual notation
A Multilingual notation is a representation in a lexical resource that allows the translation between two or more words.
A Multilingual notation is a representation in a lexical resource that allows the translation between two or more words.
N-gram
In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sequence of text or speech.
In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sequence of text or speech.
Naive semantics
Naive semantics is an approach used in computer science for representing basic knowledge about a specific domain, and has been used in applications such as the representation of the meaning of n...
Naive semantics is an approach used in computer science for representing basic knowledge about a specific domain, and has been used in applications such as the representation of the meaning of n...
Natural language
In the philosophy of language, a natural language (or ordinary language) is any language which arises in an unpremeditated fashion as the result of the innate facility for language possess...
In the philosophy of language, a natural language (or ordinary language) is any language which arises in an unpremeditated fashion as the result of the innate facility for language possess...
Natural language processing
Natural language processing is a field of computer science, artificial intelligence, and linguistics concerned with the interactions between computers and human languages.
Natural language processing is a field of computer science, artificial intelligence, and linguistics concerned with the interactions between computers and human languages.
Natural Language Toolkit
Natural Language Toolkit or, more commonly, NLTK is a suite of libraries and programs for symbolic and statistical natural language processing for the Python programming language.
Natural Language Toolkit or, more commonly, NLTK is a suite of libraries and programs for symbolic and statistical natural language processing for the Python programming language.
Natural language understanding
Natural language understanding is a subtopic of natural language processing in artificial intelligence that deals with machine reading comprehension.
Natural language understanding is a subtopic of natural language processing in artificial intelligence that deals with machine reading comprehension.
Natural language user interface
Natural Language User Interfaces are a type of computer human interface where linguistic phenomena such as verbs, phrases and clauses act as UI controls for creating, selecting and modifying dat...
Natural Language User Interfaces are a type of computer human interface where linguistic phenomena such as verbs, phrases and clauses act as UI controls for creating, selecting and modifying dat...
NetBase Solutions, Inc.
NetBase Solutions, Inc. is a Mountain View, CA based developer of natural language processing technology used to analyze social media and other content.
NetBase Solutions, Inc. is a Mountain View, CA based developer of natural language processing technology used to analyze social media and other content.
News analytics
News analysis refers to the measurement of the various qualitative and quantitative attributes of textual (unstructured data) news stories.
News analysis refers to the measurement of the various qualitative and quantitative attributes of textual (unstructured data) news stories.
Noisy text analytics
Noisy text analytics is a process of information extraction whose goal is to automatically extract structured or semistructured information from noisy unstructured text data.
Noisy text analytics is a process of information extraction whose goal is to automatically extract structured or semistructured information from noisy unstructured text data.
NooJ
NooJ is a development environment used to construct large-coverage, formalized descriptions of natural languages and to apply them to large corpora in real time.
NooJ is a development environment used to construct large-coverage, formalized descriptions of natural languages and to apply them to large corpora in real time.
Ontology learning
Ontology learning is a subtask of information extraction.
Ontology learning is a subtask of information extraction.
Open domain question answering
In information retrieval, an open domain question answering system aims at returning an answer in response to the user’s question.
In information retrieval, an open domain question answering system aims at returning an answer in response to the user’s question.
OpenNLP
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text.
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text.
Paco Nathan
Paco Nathan (born 1962) is a computer scientist, author, and performance art show producer from San Luis Obispo, California, who established much of his career in Austin, Texas.
Paco Nathan (born 1962) is a computer scientist, author, and performance art show producer from San Luis Obispo, California, who established much of his career in Austin, Texas.
Phrase structure grammar
The term phrase structure grammar was originally introduced by Noam Chomsky as the term for grammars as defined by phrase structure rules, i.e. rewrite rules of the type studied previously by Em...
The term phrase structure grammar was originally introduced by Noam Chomsky as the term for grammars as defined by phrase structure rules, i.e. rewrite rules of the type studied previously by Em...
Powerset (company)
Powerset is a Microsoft owned company based in San Francisco, California, that, in 2006, was developing a natural language search engine for the Internet.
Powerset is a Microsoft owned company based in San Francisco, California, that, in 2006, was developing a natural language search engine for the Internet.
Production (computer science)
A production or production rule in computer science is a rewrite rule specifying a symbol substitution that can be recursively performed to generate new symbol sequences.
A production or production rule in computer science is a rewrite rule specifying a symbol substitution that can be recursively performed to generate new symbol sequences.
PropBank
PropBank is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank".
PropBank is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank".
Question answering
In information retrieval and, question answering is the task of automatically answering a question posed in natural language.
In information retrieval and, question answering is the task of automatically answering a question posed in natural language.
Realization (linguistics)
Realisation is a subtask of Natural language generation, which involves creating an actual text in a human language (English, French, etc) from a syntactic representation.
Realisation is a subtask of Natural language generation, which involves creating an actual text in a human language (English, French, etc) from a syntactic representation.
Recursive transition network
A recursive transition network ("RTN") is a graph theoretical schematic used to represent the rules of a context free grammar.
A recursive transition network ("RTN") is a graph theoretical schematic used to represent the rules of a context free grammar.
Referring expression generation
Referring expression generation is a subtask of Natural language generation (NLG), which involves creating referring expressions (noun phrases) that identify specific entities to the reader.
Referring expression generation is a subtask of Natural language generation (NLG), which involves creating referring expressions (noun phrases) that identify specific entities to the reader.
Rewrite rule
In linguistics, a rewrite rule for natural language (phrase structure rule, analog of production in formal grammars) in generative grammar is a rule of the form A → X where A is a syntac...
In linguistics, a rewrite rule for natural language (phrase structure rule, analog of production in formal grammars) in generative grammar is a rule of the form A → X where A is a syntac...
Semantic analysis (computational)
Semantic Analysis (computational) is a composite of the "Semantic Analysis" and the "Computational" components.
Semantic Analysis (computational) is a composite of the "Semantic Analysis" and the "Computational" components.
Semantic analytics
Semantic analytics is the use of ontologies to analyze content in web resources.
Semantic analytics is the use of ontologies to analyze content in web resources.
Semantic compression
In natural language processing, semantic compression is a process of compacting a lexicon used to build a textual document by reducing language heterogeneity, while maintaing text semantics.
In natural language processing, semantic compression is a process of compacting a lexicon used to build a textual document by reducing language heterogeneity, while maintaing text semantics.
Semantic neural network
Semantic neural network (SNN) is based on John von Neumann's neural network von Neumann, 1966
And Nikolai Amosov M-Network.
Semantic neural network (SNN) is based on John von Neumann's neural network von Neumann, 1966
And Nikolai Amosov M-Network.
SemEval
SemEval (Semantic Evaluation) is an ongoing series of evaluations of computational semantic analysis systems; it evolved from the Senseval word sense evaluation series.
SemEval (Semantic Evaluation) is an ongoing series of evaluations of computational semantic analysis systems; it evolved from the Senseval word sense evaluation series.
Sentence extraction
Sentence extraction is a technique used for automatic summarization of a text.
Sentence extraction is a technique used for automatic summarization of a text.
Sentiment analysis
Sentiment analysis or opinion mining refers to the application of natural language processing, computational linguistics, and text analytics to identify and extract subjective information ...
Sentiment analysis or opinion mining refers to the application of natural language processing, computational linguistics, and text analytics to identify and extract subjective information ...
SHRDLU
SHRDLU was an early natural language understanding computer program, developed by Terry Winograd at MIT from 1968-1970.
SHRDLU was an early natural language understanding computer program, developed by Terry Winograd at MIT from 1968-1970.
Speech segmentation
Speech segmentation is the process of identifying the boundaries between words, syllables, or phonemes in spoken natural languages.
Speech segmentation is the process of identifying the boundaries between words, syllables, or phonemes in spoken natural languages.
SPL notation
SPL (Sentence Plan Language) is an abstract notation representing the semantics of a sentence in natural language.
SPL (Sentence Plan Language) is an abstract notation representing the semantics of a sentence in natural language.
Stemming
In linguistic morphology and information retrieval, stemming is the process for reducing inflected (or sometimes derived) words to their stem, base or root form—generally a written word form.
In linguistic morphology and information retrieval, stemming is the process for reducing inflected (or sometimes derived) words to their stem, base or root form—generally a written word form.
String kernel
A string kernel is a mathematical tool used in large scale data analysis and mining, where sequence data are to be clustered or classified (concerning especially the popular research fields of t...
A string kernel is a mathematical tool used in large scale data analysis and mining, where sequence data are to be clustered or classified (concerning especially the popular research fields of t...
Studies in NLP
Studies in Natural Language Processing is the book series of the Association for Computational Linguistics, published by Cambridge University Press.
Studies in Natural Language Processing is the book series of the Association for Computational Linguistics, published by Cambridge University Press.
Sukhotins Algorithm
Sukhotins' Algorithm is a statistical classification algorithm for classifying characters in a text as vowels or consonants.
Sukhotins' Algorithm is a statistical classification algorithm for classifying characters in a text as vowels or consonants.
Synthetix
Synthetix Ltd is a software company based in Cambridge, England, founded in 2001.
Synthetix Ltd is a software company based in Cambridge, England, founded in 2001.
T9 (predictive text)
T9, which stands for Text on 9 keys, is a patented predictive text technology for mobile phones, originally developed by Tegic Communications, now part of Nuance Communications.
T9, which stands for Text on 9 keys, is a patented predictive text technology for mobile phones, originally developed by Tegic Communications, now part of Nuance Communications.
Tatoeba
Tatoeba.org is a free online database of example sentences geared towards foreign language learners.
Tatoeba.org is a free online database of example sentences geared towards foreign language learners.
Teragram Corporation
Teragram Corporation is a fully owned subsidiary of SAS Institute, a major producer of statistical analysis software, headquartered in Cary, North Carolina, USA. Teragram is based in Cambridge,...
Teragram Corporation is a fully owned subsidiary of SAS Institute, a major producer of statistical analysis software, headquartered in Cary, North Carolina, USA. Teragram is based in Cambridge,...
Terminology extraction
Terminology mining, term extraction, term recognition, or glossary extraction, is a subtask of information extraction.
Terminology mining, term extraction, term recognition, or glossary extraction, is a subtask of information extraction.
Text analytics
The term text analytics describes a set of linguistic, statistical, and machine learning techniques that model and structure the information content of textual sources for business intelligence,...
The term text analytics describes a set of linguistic, statistical, and machine learning techniques that model and structure the information content of textual sources for business intelligence,...
Text mining
Text mining, sometimes alternately referred to as text data mining, roughly equivalent to text analytics, refers to the process of deriving high-quality information from text.
Text mining, sometimes alternately referred to as text data mining, roughly equivalent to text analytics, refers to the process of deriving high-quality information from text.
Text Retrieval Conference
The Text REtrieval Conference (TREC) is an on-going series of workshops focusing on a list of different information retrieval (IR) research areas, or tracks.
The Text REtrieval Conference (TREC) is an on-going series of workshops focusing on a list of different information retrieval (IR) research areas, or tracks.
Text simplification
Text simplification is an operation used in natural language processing to modify, enhance, classify or otherwise process an existing corpus of human-readable text in such a way that the gramma...
Text simplification is an operation used in natural language processing to modify, enhance, classify or otherwise process an existing corpus of human-readable text in such a way that the gramma...
Text to Voice
Text to Voice or Text to Speech is a Firefox extension developed by Vikram Joshi, an under-graduate from IIT Delhi.
Text to Voice or Text to Speech is a Firefox extension developed by Vikram Joshi, an under-graduate from IIT Delhi.
Text-to-voice
Text to Voice or Text to Speech is a Firefox extension (add-on) developed by Vikram Joshi, an under-graduate from IIT Delhi.
Text to Voice or Text to Speech is a Firefox extension (add-on) developed by Vikram Joshi, an under-graduate from IIT Delhi.
Textual entailment
Textual Entailment in natural language processing is a directional relation between text fragments.
Textual Entailment in natural language processing is a directional relation between text fragments.
TipTop Technologies
TipTop Technologies offers a real-time web, social search engine with a unique platform for semantic analysis of natural language.
TipTop Technologies offers a real-time web, social search engine with a unique platform for semantic analysis of natural language.
TMC Corpus
TMC is a large-scale Persian monolingual corpus.
TMC is a large-scale Persian monolingual corpus.
Transderivational search
Transderivational search (often abbreviated to TDS) is a psychological and cybernetics term, meaning when a search is being conducted for a fuzzy match across a broad field.
Transderivational search (often abbreviated to TDS) is a psychological and cybernetics term, meaning when a search is being conducted for a fuzzy match across a broad field.
Trigram
Trigrams are a special case of the N-gram, where N is 3.
Trigrams are a special case of the N-gram, where N is 3.
Triphone
In linguistics, a triphone is a sequence of three phonemes.
In linguistics, a triphone is a sequence of three phonemes.
w-shingling
In natural language processing a w-shingling is a set of unique "shingles"—contiguous subsequences of tokens in a document—that can be used to gauge the similarity of two documents.
In natural language processing a w-shingling is a set of unique "shingles"—contiguous subsequences of tokens in a document—that can be used to gauge the similarity of two documents.
William Aaron Woods
William A. Woods (born 1942), generally known as Bill Woods, is a researcher in natural language processing, continuous speech understanding, knowledge representation, and knowledge-based ...
William A. Woods (born 1942), generally known as Bill Woods, is a researcher in natural language processing, continuous speech understanding, knowledge representation, and knowledge-based ...
Word sense
In computational linguistics, word sense disambiguation is an open problem of natural language processing, which governs the process of identifying which sense of a word is used in a sentence, w...
In computational linguistics, word sense disambiguation is an open problem of natural language processing, which governs the process of identifying which sense of a word is used in a sentence, w...
Word sense induction
In computational linguistics, word sense induction (WSI) or discrimination is an open problem of natural language processing, which concerns the automatic identification of the senses of a...
In computational linguistics, word sense induction (WSI) or discrimination is an open problem of natural language processing, which concerns the automatic identification of the senses of a...
Word-sense
In computational linguistics, word sense disambiguation (WSD) is an open problem of natural language processing, which governs the process of identifying which sense of a word (i.e.
In computational linguistics, word sense disambiguation (WSD) is an open problem of natural language processing, which governs the process of identifying which sense of a word (i.e.
Word-sense induction
In computational linguistics, word-sense induction or discrimination is an open problem of natural language processing, which concerns the automatic identification of the senses of a word.
In computational linguistics, word-sense induction or discrimination is an open problem of natural language processing, which concerns the automatic identification of the senses of a word.
Settings