A memorable moment for Erik was seeing the first Redshift petabyte cluster in action: “We had 100 8XL Redshift nodes spun up to hold the Amazon click log–which was 15 months' worth of clicks on. To get more details on the early days of Redshift, I interviewed Erik Selberg, former data warehouse team Director and one of the earliest Redshift users. The Redshift pricing scheme would open the door for smaller companies to have enterprise-level data warehouses. Enterprises used on-premise data warehouses (like Teradata, IBM, and Oracle) that cost millions to run. It's essential to put this within the context of the industry at the time. Werner Vogels, CTO of, explained the experiment in more detail in his 2012 blog post: ’s data warehouse was a multi-million dollar system (consisting of 32 nodes, 128 CPUs, 4.2TB of RAM, and 1.6PB of disk space) that could be replaced with a two 8XL node Redshift cluster, costing less than $32,000 per year. The experiment results: at least 10x faster queries for a fraction of the cost.Īndy Jassy announcing Redshift at the inaugural AWS re:Invent conference The new service was presented to the excited audience as “the first fully managed, petabyte-scale cloud data warehouse” and made available on “limited preview”.ĭuring the talk, Andy explained that a two-billion-row data set was used for the initial pilot at, where the data warehouse team tested Redshift and compared it to their existing on-premise data warehouse. Ten years ago, Andy Jassy – then AWS Senior Vice President – revealed Amazon Redshift at the inaugural AWS re:Invent conference. To offer diverse points of view throughout this retrospective, I interviewed the former Director of the data warehouse team, as well as a Senior Data Engineer who has been a longtime Redshift user. By the end of this post, you will understand how Redshift interacts with other services in the AWS ecosystem and what the future may hold. This blog post will walk you through the evolution of Redshift, from its origin to the present day, detailing the most important architectural shifts that led to the current version of the service. ![]() Without Redshift at its core, The Amazon Web Services (AWS) data platform would be incomplete. Since then, it has survived the rise and fall of Hadoop, been crowned the fastest-growing Amazon service ever, and successfully evolved to suit the needs of a dynamic data analytics world.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |