Fundamentals of query rewriting (part 1): introduction to query expansion
Johannes Peter introduces query expansion in the first part of a series on fundamentals of query rewriting
Johannes Peter introduces query expansion in the first part of a series on fundamentals of query rewriting
How the combined_fields query introduces better term-centric scoring in Elasticsearch with BM25F
As a contributor to AI Powered Search I’m struck by how often organizational, rather than technical, challenges stymie machine learning adoption. Let’s explore one of those challenges: silos between…
I wrote this back in 2012 for version 1.5.2-incubating, and never published it. So I’m updating it for the October 2018 version of OpenNLP, 1.9.0. Visit http://opennlp.apache.org/ and you…
In April I went on a pilgrimage to Enterprise Data World to encourage my colleagues in the Data world who typically are focused on issues of Data Governance, Data…
I am a big proponent of Solr and Elasticsearch search engines BUT… … they are frameworks. A bag of tools. I can teach you how they fit together in…
Thanks so much to community members who let us know that Elasticsearch Learning to Rank did not work with Elastic Cloud/XPack Security. Today Elastic Learning to Rank 1.0.1 is…
I screwed up! Mea Culpa. Let he who is without synonyms throw the first rock! In my talk Taxonomical Semantical Magical Search, I presented an index-time hypernym/hyponym expansion solution….
As we learn more about Vespa, we wanted to give our initial impressions when comparing to Lucene-based search (Solr/Elasticsearch). This is based on initial passes with Vespa and our…
Responding to queries takes CPU time, memory, and in unfortunate cases, wall time as well. Increasing the power of a cluster helps, over-provisioning can be very expensive. Caching is…
Searching for non-ASCII characters can be a challenge. There are a number of reasons for doing so, even in a primarily English corpus: Accented characters in names and words…
As part of the London hack days Diego Ceccarelli started a BM25F implementation. I began to continue it at Lucene Revolution’s Lucene hackathon. I realized though that when you…