• The Semantic Web up and coming – impressions of SEMANTiCS 2014

    René Kriegler — September 19, 2014 | 0 Comments | Filed in: Conference, Natural Language Processing, Technologies

    When you hear someone say about a technology that ‘it only works in theory’, ‘it is too labour-intensive’ and ‘it is not industry-ready’, chances are that they are talking about semantic web technologies. As my experience has been different in a semantic search project for OpenSource Connections, I went to the SEMANTiCS 2014 conference in […]

  • September Chock Full of Talks (Dougtember?)

    Doug Turnbull — August 28, 2014 | 0 Comments | Filed in: solr

    I somehow managed to line up a speaking gig for every week in September! I hope you’ll join me on this insane marathon. I’ll be talking about topics key to what we care about at OSC: search as a datastructure, search relevancy, and search/big data at performance and scale. Don’t hesitate to *protected email* if […]

  • Introducing Splainer — The Open Source Search Sandbox That Tells You Why

    Doug Turnbull — August 18, 2014 | 0 Comments | Filed in: solr

    One piece of feedback that has consistently come with our Quepid search testing tool is the need to understand “why” search results come back the order they do. In plain English, what factors influence search the most? Why does my search engine think a document about “water bottles” is more relevant than “baby bottles” for […]

  • Using Quepid to Improve Relevancy of Advance Auto Parts Intranet Search

    Doug Turnbull — July 24, 2014 | 0 Comments | Filed in: solr

    Recently, Advance Auto Parts contacted OSC to improve the search relevancy of their intranet application, Starting Line. Starting Line serves as the knowledge base for every store employee, so having relevant internal search results helps keeps employees connected with resources and company news. Through our two day Quepid relevancy assessment, we helped bring together content […]

  • Improving The Camel Solr Component

    Doug Turnbull — July 15, 2014 | 0 Comments | Filed in: solr

    We’ve been using Apache Camel a fair amount recently as our ingestion pipeline of choice. It presents a fairly nice DSL for wiring together different data sources, performing transformations, and finally sending data to Solr. Using the normal Solr component, you can write code that looks like this: from(“file://foo?fileName=input.csv”) .unmarshall().csv() .split(body()) .to(“bean:convertToSolrDoc”) .setHeader(SolrConstants.OPERATION, SolrConstants.INSERT) .to(“solr://localhost:8983/solr/collection1”) […]

  • Reindexing Collections with Solr’s Cursor Support

    Doug Turnbull — July 13, 2014 | 1 Comment | Filed in: solr

    When a Solr schema changes, us Solr devs know what’s next — a large reindex of all of our data to capture any changes to index-time analysis. When we deliver solutions to our customers, we frequently need to build this in as a feature. Many cases, we can’t easily access the source system to reindex. […]

  • Quepid : Athena Release

    Jonathan Thompson — July 11, 2014 | 0 Comments | Filed in: OSC

    As the newest full time developer working on Opensource Connection’s search relevancy tool, Quepid, I’m happy to announce that our newest release, codenamed “Athena”, is now live. This release is the first in a series named after Greek figures in mythology that aims to add powerful new features for our tool. Athena So what does […]

  • RDS is expensive — a cautionary AWS tale

    Eric Pugh — June 30, 2014 | 1 Comment | Filed in: AWS, Cloud Computing, Opinion, System Administration

    I wanted to share with the world a cautionary story related to my by @softwaredoug that reminded me that while Amazon AWS is amazing, it’s also best used in situations where your needs are extremely variable. It’s the natural gas powerplant versus coal powerplant of hosting providers. I looked at our bill out of curiosity. […]

Developed in Charlottesville, VA | ©2013 – OpenSource Connections, LLC