AWS launches Redshift Spectrum, which lets users query data in S3

AWS launches Redshift Spectrum, which lets users query data in S3

By | April 20th, 2017
No Comments

At the AWS Summit in San Francisco today, public cloud infrastructure

provider Amazon Web Services (AWS) announced the launch of Redshift

Spectrum


														
							

The extension of AWS’ Redshift managed data warehousing service that enables querying on data that sits inside of the longstanding AWS S3 storage service.

The introduction of Redshift Spectrum will make certain types of queries on data more economical, because Redshift, which includes computing and storage capabilities, is a more complex and costly service especially for number crunching on lots of data.

“When you issue a query, Redshift rips it apart and generates a query plan that minimizes the amount of S3 data that will be read, taking advantage of both column-oriented formats and data that is partitioned by date or another key,” AWS chief evangelist Jeff Barr wrote in a blog post. “Then Redshift requests Spectrum workers from a large, shared pool and directs them to project, filter, and aggregate the S3 data. The final processing is performed within the Redshift cluster and the results are returned to you.”

AWS introduced Amazon Redshift in 2012. S3 itself dates to 2006.

[Source]

Google
Nisheeth Bhakuni

\devworx in print
  • IBM Open Platform with Apache Hadoop Get access to all data, in Hive, HBase or HDFS; within a single query (Big SQL). Let Bluemix™ enable you to play with IBM’s Analytics for Hadoop. Try it now.
    Click to know more
  • \devworx contests
      • No contests are currently running.