Tika Tuesday: Using Tika and Tesseract outside of Solr
Extracting content from file formats using Tika as a standalone service is the traditional architectural approach, and what my most recent project is built around. You can try out…
Extracting content from file formats using Tika as a standalone service is the traditional architectural approach, and what my most recent project is built around. You can try out…
What is Tika Tuesdays? Over the past few months I’ve finally accomplished the long time personal goal of being able to easily search PDF documents with in context hit…
Edismax is the query parser-of-choice for many Solr applications. The default behaviors are correct for a wide range of use cases. The syntax has become familiar. While edismax has…
Search Insights 2019, a new report from The Search Network As many of you will know I’ve now joined OSC as a Managing Consultant – so here’s my first…
Welcome, dear reader, to my first OSC blog post. Let’s dive in! While search relevance is often equated with ensuring customers find what they need, that is only part…
Background As a search relevancy engineer at OpenSource Connections (OSC), when I work on a client’s search application, I use Quepid every day! Quepid is a “Test-Driven Relevancy Dashboard”…
In product meetings, when I advocate for customer retention – regular, repeat business from the customer over months/years – I sometimes get blank stares. Teams want to do everything…
I recently completed a fairly straight-forward search project in which we were replacing a legacysystem with Solr. The goal for the first release was to just make sure that…
Our mission is to empower the world’s best search teams. Search teams ultimately generate value for their organizations through better, smarter search. That is: relevance. Sadly relevance remains maddening!…
Replacing an enterprise search product? Use this three-step process to create actionable requirements for your team. The inherent challenge in replacing existing enterprise products is that their functionality and…
Heatmap facets are a powerful demonstration of Solr’s geospatial capabilities. Given a corpus of indexed shapes, a heatmap facet will show you where the results of your query fit…
Trey is SVP of Engineering at Lucidworks and the co-author of Solr in Action. He sat down with Doug Turnbull and Matt Overstreet in December of last year to…