…between a snippet highlight and a payload highlight: HELOCs|NiAzNTEgNTAyIDQwNSA1Mjc= Learn more by looking at the Solr Payload Component from https://github.com/o19s/payload-component and the Offset Highlighter Component from https://github.com/o19s/offset-hl-formatter. Read other posts…
…included dictionaries) then Paoding is an excellent choice. Resources and References http://www.zein.se/patrick/3000char.html** – **The most common Chinese characters in order of frequencyhttp://translate.google.com/** – **A fantastic way to quickly translate a…
…web pages including www.wikipedia.org and www.digg.com. Bob also discussed another web semantics technology referred to as Web Ontology Language (OWL). OWL is an extension of RDF Schema which is a…
…from time-to-time to compare the two, and the quick answer is that Elasticsearch is “Logs, Logs, Logs.” The Elasticsearch search engine itself is general-purpose, but Elastic the company has put…
…have seasonal habits. We act nicer in December to our fellow man (woman), unless it comes to taking the last tickle-me elmo off the shelf, then its every person for…
We built the Elasticsearch Learning to Rank (LTR) plugin with the Wikimedia Foundation several years ago and we’ve been pleased to see how it’s been adopted by many companies wanting…
…Since January 2015 David Meikle has maintained a great Docker image for running Tika Server available from https://hub.docker.com/r/logicalspark/docker-tikaserver. The project source is at https://github.com/LogicalSpark/docker-tikaserver. He’s done a yeoman’s job of…
…being run. The UI for Spark has really improved since last fall when I last did Spark work. I then triggered a commit on the Solr collection: http://localhost:8983/solr/collection1/update?commit=true, and then…
…tool: https://developers.google.com/webmasters/structured-data/testing-tool/ as well as a nice tool for visually marking up existing HTML content with the microdata tags: https://www.google.com/webmasters/markup-helper One disappointment I had is that I couldn’t get the…
…this JavaScript code: http://www.michaelapproved.com/articles/daylight-saving-time-dst-detect/ and while it seems very promising, it still wasnt quite giving me what I want. nn Finally, I realized it…. By arbitrarily grabbing the first time…
…do we present the remaining results? Finally, calculating the non-dominated frontier is computationally much more complex that simply sorting the documents by some scalar score. This complexity, however, might be…
…and dicing of the query string to manually construct whatever I need to http://www.supermind.org/blog/1134/custom-solr-queryparsers-for-fun-and-profit One thing I often do is repeat the functionality of Elasticsearch‘s match query. Elasticsearch‘s match query…