AWS Notes

Redshift

AWS Notes

Table of Contents
Identity
Identity
Compute
Compute
- Containers
  Containers
  - Basics
  - Ecr
  - Ecs
  - Eks
- Ec2
  Ec2
- Elasticbeanstalk
  Elasticbeanstalk
  - TOC
  - Basics
  - Deployment
  - Docker
- Lambda
  Lambda
  - Basics
  - Execution
  - Networking
  - Stepfunctions
  - TOC
Databases
Databases
- Athena
- Elasticcache
- Redshift
- Aurora
  Aurora
  - Index
- Dynamodb
  Dynamodb
  - TOC
  - Basics
  - Dax
  - Globaltables
  - Indexes
  - Operating
  - Streams
- Rds
  Rds
  - Index
  - Basics
  - Multiaz
  - Backups
  - Readreplicas
  - Rdssecurity
  - Aurorabasics
  - Auroraserverless
  - Rdsproxy
  - Dms
Networking
Networking
- Toc
- Apigateway
  Apigateway
  - Basics
- Hybrid
  Hybrid
- Scaling
  Scaling
  - Asg
  - Loadbalancer
- Vpc
  Vpc
Storage
Storage
- Efs
- Fsx
- Transferfamily
Security
Security
- Cloudhsm
- Guardduty
- Inspector
- Neworkfirewall
- Trustedadvisor
- Waf
- Key management service
  Key management service
  - KMS Basics
Monitoring
Monitoring
- Awsconfig
- Cloudtrail
- Vpcflowlogs
- X ray
- Cloudwatch
  Cloudwatch
  - Basics
  - Cwlogs
  - Toc
Devops
Devops
- Basics
- Cicd
Costmanagement
Costmanagement
- Basics
Bcpdr
Bcpdr
- Disasterrecovery
Eventdriven
Eventdriven
- Architecture
- Eventbridge
- Kinesis
- Sns
- Sqs
- TOC
Machinelearning
Machinelearning
- Rekognition
- Comprehend
- Devicefarm
- Forecast
- Frauddetector
- Glue
- Kendra
- Kinesisvideostreams
- Lexconnect
- Polly
- Sagemaker
- Textract
- Transcribe
- Translate
Tbd

Redshift

Petabyte scale Data warehouse.
It is OLAP and Column based.
Data on S3 can be queried directly without loading into Redshit using Redshift Spectrum
Federated query allows querying of multiple remote databases
It is server based and provisioned
Redshift runs across multiple nodes connected by high speed network. Hence it runs in one AZ. Thus not highly available. It is VPC service
Leader Node - Takes query input, creates execution plan,performs aggregation
Compute Node - Perform actual queries. Each compute node is divided into slices
Slices work in parallel
A node can have 2,4,16 or 32 slices
Enhanced VPC routing needs to be enabled to perform advanced VPC configurations.
Data is automatically replicated across to additional node other than writer, when being written
Automatic backups happen to S3 every 8 hours of 5 GB data written to cluster with 1 day retention by default. Manual snapshots can be created.