Amazon Redshift

Redshift is a petabyte-scale data warehousing solution. It’s a column-based database designed for analytical workloads. Generally, a relational store like RDS would be used for OLTP workloads (e.g., queries, inserts, updates, and deletes), and Redshift would be used for OLAP (e.g., retrieval and analytics). Multiple databases become source data to be injected into a data warehouse solution such as Redshift.

Czytaj dalej Amazon Redshift

Amazon Kinesis

Kinesis is a scalable and resilient streaming service from AWS. It is designed to ingest large amounts of data from hundreds, thousands, or even millions of producers. Consumers can access a rolling window of that data, or it can be stored in persistent storage of database products.

Czytaj dalej Amazon Kinesis

Amazon Elastic Map Reduce (EMR)

Amazon Elastic MapReduce (EMR) is a tool for large-scale parallel processing of big data and other large data workloads. It’s based on the Apache Hadoop framework and is delivered as a managed cluster using EC2 instances. EMR is used for huge-scale log analysis, indexing, machine learning, financial analysis, simulations, bioinformatics, and many other large-scale applications.

Czytaj dalej Amazon Elastic Map Reduce (EMR)