{"id":665,"date":"2017-10-20T11:01:23","date_gmt":"2017-10-20T11:01:23","guid":{"rendered":"http:\/\/wiki.thomasandsofia.com\/?p=665"},"modified":"2017-10-22T21:17:05","modified_gmt":"2017-10-22T21:17:05","slug":"redshift","status":"publish","type":"post","link":"https:\/\/wiki.thomasandsofia.com\/?p=665","title":{"rendered":"RedShift"},"content":{"rendered":"<p><a href=\"https:\/\/www.udemy.com\/aws-certified-solutions-architect-associate\/learn\/v4\/t\/lecture\/2050702?start=0\" target=\"_blank\" rel=\"noopener\">https:\/\/www.udemy.com\/aws-certified-solutions-architect-associate\/learn\/v4\/t\/lecture\/2050702?start=0<\/a><\/p>\n<p>Redshift is a fast and powerful, fully managed, petabyte-scale data warehouse service in the cloud.\u00a0 Customer can startsmall for just $0.25 per hour with no commitments or upfront costs and scale to a petabyte or more for $1000\/terabyte per year, less than 1\/10 most other data warehousing solutions.<\/p>\n<p>Data Warehousing databases use different type of architecture both from a database perspective and infrastructure layer.<\/p>\n<h2>Redshift Configuration<\/h2>\n<ul>\n<li>Start with a single node with up to 160GB data on that node.<\/li>\n<li>Scale to Multi-Node\n<ul>\n<li>Leader Node (Managed client connections and receives queries)<\/li>\n<li>Compute Node (store data and perform queries and computations)\n<ul>\n<li>Up to 128 compute nodes.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h2>10X Faster!<\/h2>\n<h4>Columnar Data Storage<\/h4>\n<p>Instead of storing data as a seris of rows, Redshift organizes the data by column.\u00a0 Unlike row-based systems, which are ideal for transaction processing, column-based systems are ideal for data warehousing and analytics, where queries often\u00a0 involve aggregates performed over large data sets.\u00a0 Since only the columns involved in the queries are processed and columnar data is stored sequentially on the storage media, column-based systems require far fewer I\/Os, greatly improving query performance.<\/p>\n<h4>Advanced Compression<\/h4>\n<p>Columnar data stores can be compressed much more than row-based data stores because similar data is stored sequentially on the disk.\u00a0 Redshift employs multiple compression techniques and can often achieve significant compression relative to traditional relational data stores.\u00a0 In addition, Redshift doesn&#8217;t require indexes or materialized views and so uses less space than traditional relational database systems.\u00a0 When loading data into an empty table, Redshift automatically samples your data and selects the most appropriate compression scheme.<\/p>\n<h4>Massively Parallel Processing (MPP)<\/h4>\n<p>Redshift automatically distributes data and query load across all notes.\u00a0 Redshift makes it easy to add nodes to your data warehouse and enables you to maintain fast query performance as your data warehouse grows.<\/p>\n<h2>Pricing<\/h2>\n<ul>\n<li>Compute Node Hours: (Total number of hours you run across all your compute nodes for the billing period.\u00a0 You are billed for 1 unit per node per hour, so a 3-node data warehouse cluster running persistently for an entire month would incur 2160 instances hours.\u00a0 You will not be charged for leader node hours &#8211; only compute nodes will incur charges.)<\/li>\n<li>Backup<\/li>\n<li>Data transfer (Only within a VPC, not outside it.)<\/li>\n<\/ul>\n<h2>Security<\/h2>\n<ul>\n<li>Encrypted in transit using SSL<\/li>\n<li>Encrypted at rest using AES-256 encryption<\/li>\n<li>By default, RedShift takes care of key management.\n<ul>\n<li>Managed your own keys through HSM (Hardware Security Modules)<\/li>\n<li>AWS Key Management Service<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h2>Availability<\/h2>\n<ul>\n<li>Currently only available in 1 AZ\n<ul>\n<li>Not designed for production, so should not be the end of the world if this is not available all the time.)<\/li>\n<\/ul>\n<\/li>\n<li>Can restore snapshots to new AZ&#8217;s in the event of an outage.<\/li>\n<\/ul>\n<h2>Exam Tips<\/h2>\n<ul>\n<li>Redshift is a Database warehousing service, used primarily for running reports.<\/li>\n<li>Extreme speed due to:\n<ul>\n<li>Columnar Data structure<\/li>\n<li>Data stored sequentially<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/www.udemy.com\/aws-certified-solutions-architect-associate\/learn\/v4\/t\/lecture\/2050702?start=0 Redshift is a fast and powerful, fully managed, petabyte-scale data warehouse service in the cloud.\u00a0 Customer can startsmall for just $0.25 per hour with no commitments or upfront costs and scale to a petabyte or more for $1000\/terabyte per year, less than 1\/10 most other data warehousing solutions. Data Warehousing databases use different type ..<\/p>\n<div class=\"clear-fix\"><\/div>\n<p><a href=\"https:\/\/wiki.thomasandsofia.com\/?p=665\" title=\"read more...\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[18,20],"tags":[],"class_list":["post-665","post","type-post","status-publish","format-standard","hentry","category-amazon-web-services-aws","category-aws-databases"],"_links":{"self":[{"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=\/wp\/v2\/posts\/665","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=665"}],"version-history":[{"count":4,"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=\/wp\/v2\/posts\/665\/revisions"}],"predecessor-version":[{"id":670,"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=\/wp\/v2\/posts\/665\/revisions\/670"}],"wp:attachment":[{"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=665"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=665"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wiki.thomasandsofia.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=665"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}