Tika Tuesdays: Using Tika and Tesseract as an API exposed by Solr
Don’t want to deploy a separate Tika server? But need Tika server-like capabilities and you already have Solr? Then this is the solution for you! What I am going…
Don’t want to deploy a separate Tika server? But need Tika server-like capabilities and you already have Solr? Then this is the solution for you! What I am going…
Extracting content from file formats using Tika as a standalone service is the traditional architectural approach, and what my most recent project is built around. You can try out…
What is Tika Tuesdays? Over the past few months I’ve finally accomplished the long time personal goal of being able to easily search PDF documents with in context hit…
Edismax is the query parser-of-choice for many Solr applications. The default behaviors are correct for a wide range of use cases. The syntax has become familiar. While edismax has…
Search Insights 2019, a new report from The Search Network As many of you will know I’ve now joined OSC as a Managing Consultant – so here’s my first…
Welcome, dear reader, to my first OSC blog post. Let’s dive in! While search relevance is often equated with ensuring customers find what they need, that is only part…
Background As a search relevancy engineer at OpenSource Connections (OSC), when I work on a client’s search application, I use Quepid every day! Quepid is a “Test-Driven Relevancy Dashboard”…
In product meetings, when I advocate for customer retention – regular, repeat business from the customer over months/years – I sometimes get blank stares. Teams want to do everything…
I recently completed a fairly straight-forward search project in which we were replacing a legacysystem with Solr. The goal for the first release was to just make sure that…
Our mission is to empower the world’s best search teams. Search teams ultimately generate value for their organizations through better, smarter search. That is: relevance. Sadly relevance remains maddening!…
Replacing an enterprise search product? Use this three-step process to create actionable requirements for your team. The inherent challenge in replacing existing enterprise products is that their functionality and…
Heatmap facets are a powerful demonstration of Solr’s geospatial capabilities. Given a corpus of indexed shapes, a heatmap facet will show you where the results of your query fit…