Search engine

From Wikipedia, the free encyclopedia

Jump to: navigation, search
This is a generic overview only. For Web searches, see: Web search engine,
or for a particular search engine, see: List of search engines.

A search engine is an information retrieval system designed to help find information stored on a computer system. Search engines help to minimize the time required to find information and the amount of information which must be consulted, akin to other techniques for managing information overload.[citation needed]

The most public, visible form of a search engine is a Web search engine which searches for information on the World Wide Web.

[edit] How search engines work

Main article: Web search engine

Search engines provide an interface to a group of items that enables users to specify criteria about an item of interest and have the engine find the matching items. The criteria are referred to as a search query. In the case of text search engines, the search query is typically expressed as a set of words that identify the desired concept that one or more documents may contain.[1]

There are several styles of search query syntax that vary in strictness. Where as some text search engines require users to enter two or three words separated by white space, other search engines may enable users to specify entire documents, pictures, sounds, and various forms of natural language. Some search engines apply improvements to search queries to increase the likelihood of providing a quality set of items through a process known as query expansion.

Image:Search-engine-diagram-en.svg
index-based search engine

The list of items that meet the criteria specified by the query is typically sorted, or ranked, in some regard so as to place the most relevant items first. Ranking items by relevance (from highest to lowest) reduces the time required to find the desired information. Probabilistic search engines rank items based on measures of similarity and sometimes popularity or authority. Boolean search engines typically only return items which match exactly without regard to order.

To provide a set of matching items quickly, a search engine will typically collect metadata about the group of items under consideration beforehand through a process referred to as indexing. The index typically requires a smaller amount of computer storage, and provides a basis for the search engine to calculate item relevance. The search engine may store a copy of each item in a cache so that users can see the state of the item at the time it was indexed or for archive purposes or to make repetitive processes work more efficiently and quickly.

Notably, some search engines do not store an index. Crawler, or spider type search engines may collect and assess items at the time of the search query. Meta search engines simply reuse the index or results of one or more other search engines.

[edit] See also

[edit] References

  1. ^ Voorhees, E.M. Natural Language Processing and Information Retrieval. National Institute of Standards and Technology. March 2000.
ar:محرك بحث

bs:Internet pretraživač bg:Търсачка ca:Motor de recerca cs:Internetový vyhledávač da:Søgemaskine de:Suchmaschine el:Μηχανή αναζήτησης es:Motor de búsqueda eo:Serĉilo eu:Bilaketa motor fa:موتور جستجو fr:Moteur de recherche gl:Motor de busca ko:검색 엔진 hr:Tražilica id:Mesin pencari ia:Mechanismo de recerca it:Motore di ricerca he:מנוע חיפוש lv:Meklētājprogramma hu:Keresőmotor ms:Enjin gelintar nl:Zoekmachine ja:検索エンジン no:Søkemotor nn:Søkemotor pl:Wyszukiwarka internetowa pt:Motor de busca ro:Motor de căutare ru:Поисковая система scn:Muturi di circata simple:Search engine sr:Интернет претраживач fi:Hakukone sv:Söktjänst ta:தேடுபொறி th:เครื่องมือค้นหาบนอินเทอร์เน็ต vi:Máy truy tìm dữ liệu tr:Arama motoru uk:Пошукова машина zh:搜索引擎

Views
Personal tools

Toolbox