• Quepid: Write Tests Against Your Search Results

    Doug Turnbull — December 9, 2014 | 0 Comments | Filed in: solr

    Quepid is our “Test Driven Search Relevancy” workbench product actively used by several clients. What do we mean by test-driven relevancy? We want to give you the ability to iterate quickly when creating a search solution. Sometimes the correctness of search results is fuzzy — based on how users or domain experts grade search results. […]

  • Title Search: when relevancy is only skin deep

    Doug Turnbull — December 8, 2014 | 0 Comments | Filed in: solr

    How do users judge that articles, books, and blog posts are relevant to what they’re searching for? What about you? If you’re searching for an article on ‘Socrates’, what might be relevant to you? A lot of search relevancy work with Solr or Elasticsearch focusses on getting really deep into written prose–the type that occurs […]

  • Apache Sentry. So close, and yet nothing.

    Eric Pugh — December 2, 2014 | 2 Comments | Filed in: solr

    Security, it’s always been the bug a boo of Solr. There is a wide sense that security isn’t a concern of the Solr community, and that isn’t quite accurate. How to secure Solr is pretty simple. It’s just that there isn’t any one “blessed” approach that is wrapped into the codebase as each organizations needs […]

  • Stepwise Date Boosting in Solr

    Doug Turnbull — November 26, 2014 | 0 Comments | Filed in: solr

    When you want to boost on recency of content (ie more recently published documents before older ones), the Solr function query documentation gives you a basic date boost: boost=recip(ms(NOW,mydatefield),3.16e-11,1,1) This will give you a nicely curving multiplicative downboost where the latest documents come first (multiplying the relevancy score by X for stuff “NOW”), slowly sloping […]

  • Two Search Conferences in Two Weeks Was Too Informative

    Eric Pugh — November 25, 2014 | 0 Comments | Filed in: Conference, Opinion, solr

    This year I experienced the conference equivalent of a lunar eclipse: two search conferences in two weeks located two hours away from my home town of Charlottesville, Virginia! Enterprise Search Summit (ESS) and LuceneRevolution (LR) share many similarities. Both have changed their names in the last year, Enterprise Search Summit expanding it’s focus to be […]

  • Playing with Thoth

    Eric Pugh — | 0 Comments | Filed in: solr

    At LuceneRevolution last week, one of the sessions that got me really excited was about Thoth, presented by Damiano Braga and Praneet Mhatre. It was very nicely done, especially considering a 30 minute timeslot! Thoth is a new Solr monitoring solution open sourced by Trulia. Ho hum I can hear you saying, yet another logging […]

  • When click scoring can hurt search relevance — towards better signals processing in search

    Doug Turnbull — October 8, 2014 | 0 Comments | Filed in: solr

    Have you heard of “click scoring” or “click tracking”? In the context of search click scoring is the method whereby you collect statistics on where users click in their search results, then use that information to prefer that search result for the queried text. Consider, Virginia Decoded. A set of Virginia’s state laws For example, […]

  • Let’s Stop Saying “NoSQL”

    Doug Turnbull — September 27, 2014 | 22 Comments | Filed in: solr

    I say the word “NoSQL” a lot. When I say NoSQL, I tend to talk about denormalized and hierarchical document/row-based data stores like Cassandra, Mongo, Couch, or HBase. But its a terrible way to use that term. Because there are also graph databases that feel even more normalized than traditional relational databases. Then there are […]

Developed in Charlottesville, VA | ©2013 – OpenSource Connections, LLC