The Rimm-Kaufman Group (RKG) provides a wide range of Data-Driven online marketing solutions to online businesses. RKG clients range from startups to the Fortune 500, and of late business has been growing at an ever increasing rate. This is great! But this also means that ever increasing pressure is placed upon their current MySQL-based infrastructure. Since RKG anticipates their clientele to continue growing larger, RKG decided that now is the time to take action and upgrade their infrastructure. And their technology of choice: Cassandra.
OSC helped us jumpstart our usage of Cassandra. We have been able to speed up our processing of certain data by a factor of 5 simply by moving the storage of some data from MySQL to Cassandra. We look forward to using Cassandra more!
Over the course of our two month project, OpenSource Connections aided RKG in building and deploying at production-ready Cassandra cluster. Highlights of the work are as follows:
- Upon reviewing RKG’s technical and business requirements, OpenSource Connections outlined the hardware specifications and defined the optimal cluster configuration. Based upon this information, new servers were purchased and assembled into the production Cassandra cluster.
- Part of the challenge when moving from SQL to Cassandra is getting your head around the new, and somewhat foreign way that Cassandra expects you to model data. OpenSource Connections worked closely with RKG developers to port their current solutions to Cassandra.
- Testing is almost as important as developing an appropriate data model. OpenSource Connections helped RKG design a Cassandra test framework that fit in well with their current MySQL testing framework.
- Before leaving RKG, OpenSource Connections provided the RKG tech team with a full-day training course in Cassandra development and systems administration. As part of the training, the development team brainstormed solutions to actual in-house development objectives.
By the time that OpenSource Connections left, RKG was up and running with their new 7-node Cassandra cluster – capable of storing terabytes of data and accepting more than 10K writes per second! But more important than this, RKG is now fully able to utilize and maintain their new Cassandra cluster on their own.