Internet search algorithms
Distributed web crawling
Distributed web crawling is a distributed computing technique whereby Internet search engines employ many computers to index the Internet via web crawling.
Distributed web crawling is a distributed computing technique whereby Internet search engines employ many computers to index the Internet via web crawling.
Federated search
Federated search is an information retrieval technology that allows the simultaneous search of multiple searchable resources.
Federated search is an information retrieval technology that allows the simultaneous search of multiple searchable resources.
Focused crawler
A focused crawler or topical crawler is a web crawler that attempts to download only web pages that are relevant to a pre-defined topic or set of topics.
A focused crawler or topical crawler is a web crawler that attempts to download only web pages that are relevant to a pre-defined topic or set of topics.
Hilltop algorithm
The Hilltop algorithm is an algorithm created by Krishna Bharat while he was at Compaq Systems Research Center and George A. Mihăilă, then at the University of Toronto.
The Hilltop algorithm is an algorithm created by Krishna Bharat while he was at Compaq Systems Research Center and George A. Mihăilă, then at the University of Toronto.
Image meta search
Image meta search is a type of search engine specialised on finding pictures, images, animations etc.
Image meta search is a type of search engine specialised on finding pictures, images, animations etc.
Index (search engine)
An alternate name for the process in the context of search engines designed to find web pages on the Internet is Web indexing.
An alternate name for the process in the context of search engines designed to find web pages on the Internet is Web indexing.
PageRank
PageRank is a link analysis algorithm, named after Larry Page and used by the Google Internet search engine, that assigns a numerical weighting to each element of a hyperlinked set of documents,...
PageRank is a link analysis algorithm, named after Larry Page and used by the Google Internet search engine, that assigns a numerical weighting to each element of a hyperlinked set of documents,...
Proximity search (text)
In text processing, a proximity search looks for documents where two or more separately matching term occurrences are within a specified distance, where distance is the number of intermediate wo...
In text processing, a proximity search looks for documents where two or more separately matching term occurrences are within a specified distance, where distance is the number of intermediate wo...
Search engine indexing
An alternate name for the process in the context of search engines designed to find web pages on the Internet is Web indexing.
An alternate name for the process in the context of search engines designed to find web pages on the Internet is Web indexing.
URL normalization
URL normalization (or URL canonicalization) is the process by which URLs are modified and standardized in a consistent manner.
URL normalization (or URL canonicalization) is the process by which URLs are modified and standardized in a consistent manner.
Web crawler
A Web crawler is a computer program that browses the World Wide Web in a methodical, automated manner or in an orderly fashion.
A Web crawler is a computer program that browses the World Wide Web in a methodical, automated manner or in an orderly fashion.
Settings