Elastic DC Meetup

Doug TurnbullMarch 29, 2017

Doug will highlight the Elasticsearch Learning to Rank plugin OSC has developed. Jason Kowalewski will discuss use cases related to personalized job search & recs.


Future of Data Philly Meetup

Doug TurnbullMarch 22, 2017

Doug will be speaking about 'Elasticsearch as a General Purpose Recommendation System' including highlighting Elastic Graph's capabilities to support a real-time recommendation system.



Doug TurnbullFebruary 20, 2017

Doug will be attending ElasticOn and sharing OSC's new Elasticsearch Learning to Rank Plugin with the conference attendees.

Our Solution to Solr Multiterm Synonyms: The Match Query Parser

OpenSource Connections — January 23, 2017

You have probably heard us talk about Solr Multiterm Synonyms a lot. This article demonstrates a technique that solves the problem, with fewer performance and relevance issues than other approaches we've used.

BM25F in Lucene with BlendedTermQuery

Doug TurnbullOctober 19, 2016

As part of the London hack days Diego Ceccarelli started a BM25F implementation. I began to continue it at Lucene Revolution's Lucene hackathon. I realized though that when you break down the problem, BM25F can be implemented using existing Lucene bits, including the existing BM25Similarity and the BlendedTermQuery.



October 11, 2016

*The* conference focused on Solr, we've made the pilgrimage since 2010


A One/Two punch of speakers on Testing and Relevancy

Eric PughSeptember 27, 2016

Doug Turnbull (OpenSource Connections), ‘Test Driven Relevancy -- How to Work with Content Experts to Optimize and Maintain Search Relevancy’ (Doug will join us via video) René Kriegler (Freelance Search Consultant), ‘Managing Search Relevancy in Solr using the Querqy open source library’.



Eric PughSeptember 16, 2016

If you're a geek in or around the Charlottesville metroplex or even if you're merely tech-curious, this is the event you don't want to miss. beCamp is Charlottesville's version of the BarCamp unconference phenomenon: organized on the fly by attendees, for attendees.

High-Quality Recommendation Systems with Elasticsearch

Doug TurnbullSeptember 9, 2016

Let's explore how to deliver great recommendations with Elasticsearch. In this article, we dive into an aggregrations based method for Elasticsearch recommendations. We attempt to understand the mechanics and assumptions of the underlying JLH scoring method.


Cassandra Summit

Eric PughSeptember 7, 2016

Eric will be giving a short talk on how to break the Cassandra data modeling strait jacket with DSE Search


How to Practice Search Relevancy

Matt OverstreetAugust 14, 2016

How to build a search practice with Doug Turnbull, Matt Overstreet and Scott Stultz. Doug released a companion to relevant search about how to actually practice good search. We discuss it.

Content expert's guide to diagnosing site search relevance problems

Doug TurnbullAugust 8, 2016

In this series of articles, I want to give you, the content person, a very practical and straight-forward guide to managing site search. We'll start by discussing diagnosis -- how to find problems. We'll use a simple, free analytics tool (google analytics). We'll make a few naive assumptions about these analytics that act as a good starting point.


Relevant Search Book With Doug Turnbull and John Berryman

Matt OverstreetJuly 1, 2016

A conversation with Doug Turnbull and John Berryman, authors of Relevant Search from Manning Publications about: The future for search What it’s like to write a techinical book, an how they got started doing it What is different about their book compared to what has been written before, including some book recomendations Search problems they wish they could solve If you’ve listened to this podcast before you are know Doug, the search relevancy lead at OpenSource Connections. John Berryman is an OpenSource Collections Alumni who works on the search and discovery team at EventBrite Relavant Search is available now from Manning Publications “Relevant Search demystifies relevance work. Using Elasticsearch, it teaches you how to return engaging search results to your users, helping you understand and leverage the internals of Lucene-based search engines”

Deploying AACT Oracle Dump File into the Cloud with Docker

Eric PughJune 30, 2016

ClinicalTrials.gov is a wealth of information. But the only database format they support is an Oracle dmp file. Follow along as I help our data science intern answer hard questions about ethnic diversity in clinical trials by deploying Oracle using Docker Cloud.

Top 7 Mistakes Organizations Make With Search

Doug TurnbullJune 29, 2016

After much sweat and tears our book Relevant Search is out! Relevant Search reflects the wisdom we've acquired over the years helping many clients improve search. I thought it would be an appropriate time to recap where many organizations get stuck with search