I Became a Solr Committer in 4662 Days. Here’s how you can do it faster!
On April 6th, 2020 I was invited to become a committer on the Apache Solr project. My journey to becoming a committer started in earnest 4662 days before that! …
On April 6th, 2020 I was invited to become a committer on the Apache Solr project. My journey to becoming a committer started in earnest 4662 days before that! …
couple your generators with context managers! As part of my AI Powered Search chapters, I’m cleaning up Hello LTR’s Python API to make the code more readable as book examples. A big part…
Now that you are using Quepid to collaborate with members of your team by sharing your cases with other members of your Team, you need a way to view…
Are you migrating from Google Search Appliance, Endeca, or some other legacy search solution to Solr or Elasticsearch? Want to prove the migration succeeded without worrying about lost customers,…
Tika was originally built as a pure library that you would embed in other applications, but soon people wanted to access it over HTTP as a standalone service. In…
It’s been six months since we open sourced Quepid, and it looks like momentum is growing. Since we flipped the switch, we’ve had two minor point releases. We’ve also…
As search professionals, we’re often lost in our everyday minutiae. We don’t appreciate the impact of our work on everyday lives. The data shows that when users encounter non-trivial…
In our first couple of posts, we ended up doing a lot of the processing work outside of Solr. It gave me a chance to polish my PowerShell skills,…
Five years ago, I wrote an article called What is Search Relevance?. Back then, I had to shout to convince people to even notice whether search results were accurate…
Tesseract 4 is a major upgrade to this venerable OCR library, incorporating neural networks and lots of other great improvements, but not everyone has upgraded to it (including one…
Intro I joined OSC a month ago as their first data-scientist, so I’ve been drinking from a firehose trying to get up to speed on Solr. After breezing through…
In a search engine, the “document” is the basic unit of indexing and retrieval. It’s the “result” on the search results screen when a user enters a query. Many…