Apache Cassandra at Instagram for Messaging
Instagram's Use Case
Instagram’s first deployment with Apache Cassandra was for storing audit information related to site integrity purposes; this was running on Redis in-memory. Implementing Apache Cassandra cut costs, to the point that they were paying around a 1/4 of what we were paying before. Instagram now uses Apache Cassandra for messaging, newsfeed, & fraud detection services. Instagram found Apache Cassandra to be a perfect fit because of its high availability, write throughput, and linear scalability.
"Implementing Cassandra cut our costs to the point where we were paying around a quarter of what we were paying before. Not only that, but it also freed us to just throw data at the cluster because it was much more scalable and we could add nodes whenever needed."
-Rick Branson, Infrastructure Engineer
Apache Cassandra at Nexgate for Fraud Detection
Nexgate's Use Case
Nexgate harvests huge swarms of data out of the social web and stores it in Apache Cassandra; they then classify and action this data based on policies that customers can configure. Nexgate needed the ability to quickly add new columns, and be able to write code that operates on those new columns in a performant manner. Apache Cassandra allows Nexgate to build an almost endless scalable store for all of their social data.
"When we ran our benchmarks, Apache Cassandra won hands down in terms of reliability, ease of use, the speed in which you could scale horizontally. It just won technologically."
-Rich Sutton, Chief Technology Officer
Apache Cassandra at Spotify for Product Catalogs & Playlists
Spotify's Use Case
Spotify uses Cassandra to store data for their entire product catalog and key customer experience capabilities such as playlists, radio stations, notification popups, recommendation engine, and the customized lists of artists. To achieve the level of service, demanded by its 40 million+ active users, Spotify needed Apache Cassandra to keep up with its growth, without performance or availability issues. Spotify initially started out as a PostgreSQL shop and migrated to Apache Cassandra as their big data needs were not being met.
“During the four years that we have used Cassandra, we’ve never lost a single bit of data.”
-Jimmy Mårdell, Tech Product Owner
Apache Cassandra at I2O Water for Internet of Things
I2O Water's Use Case
i2O Water records time-series data for multiple physical channels from their devices in the field, over a GPRS mobile phone network, through the internet and into Cassandra. Prior to using Apache Cassandra, i2O Water had a traditional analysis technology using Microsoft SQL Server. They currently use Apache Cassandra to save over 100 million liters of water per day, for customers across the world.
"The other technologies we looked at were other column stores, both open-source and commercial, and by far and away Cassandra had the best reputation and had the best performance for the testing that we did."
-Mike Williams, Software Director
Apache Cassandra at eBay for Recommendation & Personalization
eBay's Use Case
eBay is storing user activity data in Apache Cassandra, representing it as a graph that is made up of edges between users and items that the user has indicated an interest or disinterest towards. As new behavioral data is recorded, in real time, they update their models about what the user is predicted to like or not. Cassandra is critical for being able to look up historical behavior data quickly, so that they can do update their models with low latency. eBay is storing this data in multiple data centers of their own.
“Cassandra is critical for being able to look up historical behavior data quickly, so that we can do these model updates with low latency.”
-Thomas Pinckney, Senior Director of Engineering