This search engine glossary was originally created from a number of sources including:
- a list created by members of Relevance Slack
- the glossary section of Search Insights 2022 from The Search Network
Our thanks to everyone who contributed the definitions. We aim to keep it correct & up to date but do please let us know if you spot any errors – there is a form to submit corrections, comments or terms you think need definition at the bottom of this page and on each term page.
Do you have a term that should be added to the Glossary? Let us know.
Search Engine Glossary
- Absolute boosting
- Access control list
- Advanced search
- Aggregated search
- Apache
- Appliance
- Artificial intelligence
- AI
- Auto-categorisation
- Auto-classification
- Average response time
- Baseline Search
- Hello Search
- BERT
- Best bets
- BM25
- Default Similarity
- Boolean operators
- Boosting
- Categorisation
- Chatbot
- Clustering
- Co-occurence
- Cognitive search
- Collection
- Computational linguistics
- Concept
- Entity
- Concept extraction
- Connector
- Controlled vocabulary
- Conversational search
- COTS
- Crawler
- Cross-language search
- Data/Schema Modelling
- Fields Engineering
- Deep learning
- Description
- Diversity
- Document
- Document processing
- Document repository
- Early binding
- Elasticsearch
- Elastic
- Embedding
- Word Vector
- Paragraph Vector
- Document Vector
- Enrichment
- Entity extraction
- Exact match
- Explicit judgements
- Exploratory query
- Broad search
- Exploratory search
- Facet
- Fallout
- Feature
- Federated search
- Field query
- Filter
- Findability
- Freshness
- Fuzzy search
- Gating
- Golden set
- Guided search
- Hit
- Hypothesis-driven
- Experiment-driven
- Idiom Strategy
- Implicit Judgements
- Index
- Index file
- Indexing
- Information Need
- User Intent
- Ingestion rate
- Interleaving
- Inverse document frequency (IDF)
- Inverted file
- Inverted Index
- Search Index
- Inverted index
- Jaccard index
- Judgements
- Ratings
- Keyword
- Keyword search
- Knowledge Graph
- Semantic Graph
- KPI
- Label
- Language detection
- Large Language Model
- LLM
- Late binding
- Learning to Rank
- LTR
- Learning-to-rank
- Lemmatisation
- Lexical analysis
- Linguistic indexing
- Linguistics
- Long tail
- Lucene
- Apache Lucene
- Machine learning
- Meta tag
- Metadata
- Morphologic analysis
- Natural language processing
- NLP
- Natural language query
- Neural IR
- Ontology
- OpenSearch
- Parametric search
- Parsing
- Pattern matching
- Personalization
- Phrase extraction
- Phrase search
- Playbook
- Precision
- Precision and Recall
- Preferred Label (pref label)
- Professional search
- Proximity search
- Query by example
- More Like This
- MLT
- Query Intent
- Query transformation
- Ranking
- Recall
- Reindex
- Relevance
- Relevance engineer
- Relevance Score
- Reranking
- Search Experience
- Search Metric
- Search Performance
- Search Quality
- Search Relevance
- Search results
- Search terms
- Searchandising
- Active Search Management
- Semantic analysis
- Semantic Search
- Cognitive Search
- Sentiment analysis
- SERPs
- Session
- Signal
- Signal Modeling
- SKOS
- Snippet
- Solr
- Apache Solr
- Soundex search
- spaCy
- Spider
- Stemming
- Stop list
- Stop words
- Stopping distance
- Structured data
- Summarisation
- Survey query
- Synonym
- Synonym expansion
- Syntactic analysis
- Targeted query
- Navigational search
- Taxonomy
- Ontology
- Dictionary
- Vocabulary
- Synonym list
- Term frequency
- TF IDF
- Classic Similarity
- Thesaurus
- Thumbnail
- Tiering
- Token
- Tokenising
- Transformer
- Transliteration
- Truncation
- Unstructured information
- Vector space
- Weighting
- Wildcard
- Word exclusion
- xAI
- Zero Search Results