Compute

EC2 - Elastic Cloud Compute

Amazon Machine Images - AMI

Large selection, own AMI, market place AMI

Instance types

An instance type simply defines the size of the instance based on a number of different parameters, these being ECUs. This defines:
- vCPUs this is the number of virtual CPUs on the instance.
- Physical processor, this is the process speed used on the instance.
- Clock speed, it's clock speed in gigahertz.
- Memory, the amount of memory associated.
- Instance storage this is the capacity of the local instance store volumes available.
- EBS optimized available, this defines if the instance supports EBS optimized storage or not.
- Network performance, this shows the performance level of rate of data transfer.
- IPV6 support, this simply indicates if the instance type supports IPV6.
- Process architecture this shows the architected type of the processor.
- AES-NI, this stands for advanced encryption standard new instructions and it shows if the instance supports it for enhanced data protection.
- AVX this indicates if the instance supports AVX which is advanced vector extensions, which are primary used for applications focused on audio and video, scientific calculations and 3D modeling analysis.
- And finally Turbo which shows if the instance supports intel turbo boost and AMD turbo core technologies.

### Instance Family * Micro instances - These are ideal for very low throughput use cases such as low traffic websites. * General-purpose - balanced mix of CPU memory and storage making them ideal for small to medium databases, tests and development servers and back-end servers. * Compute optimized - focus on compute. They have the highest performing processes for high-performance front end servers, web servers, high-performance science and engineering applications and video encoding and batch processing. * GPU - GPU stands for graphics processing unit, optimized for graphic intensive applications. * FPGA - field programmable gate arrays. To create application specific hardware accelerations when used with applications that use massively parallel processing power such as genomics and financial computing. * Memory optimized - primarily used for large-scale enterprise class in-memory applications, such as performing real time processing of unstructured data, Microsoft SharePoint. * Storage optimized - optimized for enhanced storage. SSD backed instant storage for low latency and very high I/O, input/output performance, including very high IOPS which is input/output operations per second. And these are great for analytic workloads and no SQL databases. Data file systems and log processing applications.

Instance purchasing options

On-demand instances

unpredictable
testing and dev.

Reserved instances

75% discount
1- 3 years
upfront payment, partial upfront, no up-front.
long-term committed, steady workloads

Scheduled instances

fixed recurring schedule
better rates than on-demand. charged even not used.
scheduled but not continuously running workloads

Spot instances

bid higher than the current spot price
2 minute warning if bid is not met
low cost
suddenly removed - batch jobs and background jobs.

on-demand capacity reservations.

reserve capacity based on different attributes
this can be used with reserved instances plan to save money

Tenancy

Shared tenancy

multiple customers on same host
AWS manages security

Dedicated instances

hardware only be used by one aws account
additional charge
for security and compliance reasons.

Dedicated hosts

additional visibility into hardware
placement of instances can be controlled
socket and core visibility for licensing (per vm license)
compliance, security and regulation requirements

User data

runs onl once during first boot
pull software updates and install during first boot

storage options

Instance Storage

instance backed storage - all data is lost when stopped or terminated. Reboot retains data. Not detachable.
Ephemeral storage / temporary.
No additional cost and included as part of EC2.
very high i/o speed. I3 - 3.3 million read IOPS/1.4 million write IOPS.
Good for cache/buffer.
Not all instances support instance type volumes. The sizes will increase with the size of the EC2 instance.
Same security and policy as the EC2.

EBS Storage

EBS storage - attached by network, persistent, EBS is auto replicated by AWS for resiliency, encryption and backup, size and performance can vary, Detachable and re-attachable.
delete on termination is set to True by default for root volume.

security

Security Group - instance level firewall. Inbound and Outbound control.
- Default
- All inbound traffic from same SG.
- All outbound traffic to any destination.
Key pair - Public and private key pair. Private key is your and public one remains with AWS. Used to decrypt password on windows or ssh for linux. Same key pair can be used across multiple instances.
OS and its patches are your responsibility.

Status Checks

System Status Check - Likely H/W issue, stop and start so that it launches somewhere else. Do not reboot as likely to remain on same server.
Instance Status Check - EC2 instance itself, network issue, corrupted file system etc. Require user intervention.

EC2 Auto Scaling

AWS Auto Scaling can handle scaling for Amazon ECS, DynamoDB and Amazon Aurora.

Why

Automation (scale-in and scale-out) based on metrics and thresholds.
Customer Satisfaction
Cost Reduction

Components

1. Launch Template Configuration

Two Options: Launch Template / Launch Configuration Include: * AMI * Instance Type * Spot instance or not * Public IP Address * user data * storage volume and configuration * Security Groups

Launch Template

Newer version of Launch Configuration

Create a Template: * Template or Version * Source Template * AMI * Instance Type * Key/Pair * VPC/Classic * Security Groups * Storage * Tags * Instance specific configurations (loads of them) * user data

Launch Configuration

Create a Configuration: * AMI * Instance Type * Spot or not * IAM role * Limited instance specific options * user data * Storage * Security Groups

Difference

Single page for Launch Template
Advanced options in Launch Template

2. Auto Scaling Group

Group

Launch Template / Configuration choose.
Adhere to Launch Template for fleet or combine purchase options
Group Size
Network --> VPC and Subnets
Association to Load Balancer
Instance Protection (from scale in)
service-linked role
Keep to size or use policy to scale

Autoscaling Policies

Minimum and Maximum Size
Cooldown - default 300s
Health Check Grace - default 300s
Manual Scaling
Schedule Scaling
Dynamic Scaling
Predictive Scaling

Manual Policy

When there is a known reason to adjust the group size up - for example known campaign with known user surge
When there is a known reason to adjust the group size down - for example over resource of budgeting.

Dynamic Scaling

Step Scaling

Adding or removing instances based on metric tracking.
Lower Bound - 30% CPU Load - Remove instance
Upper Bound - 80% CPU Load - Add Instance
Cooldown Policy - Period where autoscaling is suspended allowing previous scaling activity to take effect.
Threshold Trigger points are based on CloudWatch Alarms that Autoscaling can listen to. Multiple Triggers can be set with various levels of impact such as 50% (add 1), 60% (add 3), 90% (add 6)
While adding using step scaling, Autoscaling accounts for ongoing scaling events and doesn't add on top but accumulates
Scale up aggressively, scale down slowly (have a large cooldown policy).
We are in-charge of when and how scaling happens.

Target Tracking

Put a target on the metric and all the alarms and steps are setup automatically for the system to keep track of the metric and try to hit the target.
Works less well with smaller ASG size and the granularity/impact of each change is large. Better for large ASG sizes.
Do not delete any cloudwatch alarms setup by Target Tracking Policy. The alarms will be automatically remove if the policy is removed.

Predictive Scaling

Machine learning to identify patterns of loads and the ASG is scaled out according to prediction and scaled in when load decreases.
Needs data to run the model - at least 24 hours of historical data. It will look upto 14 days back.
Good for patterns that repeat on some schedule - batch, end of day checking of things etc.
Forecast only mode means it will only show predictions and graphs the predicted performance for review. Enabling scaling will enable the system to do scaling.
The granularity of the predicted scaling is every hour - i.e. the autoscaling happen every hour and not at more granularity than that.
Predictive scaling can be combined with Dynamic Auto Scaling.

Scheduled Scaling

Scale in and out at fixed time
Batch processing using spot instances late night
Turn off Dev/Test after hours at night.
Again can be combined with dynamic scaling.

Note

Associate ASG directly to the Classic Load Balancer or to a Target Group from its configuration. ALB / NLB can then forward traffic to the target group and would automatically pickup all the EC2 instances in the ASG for load balancing.

ECS - Elastic Container Service

No management software needed
No monitoring software needed
Clusters are region specific can't scale across regions.
Clusters act as a resource pool and dynamically scalable
Multiple instance type is possible
Docker Demon and ECS agent on nodes of a cluster allowing ECS commands to be translated to Docker Commands.

Launch Options

AWS Fargate

Configuration

CPU and Memory required
Networking
IAM policies
Package application into containers

Cluster management across EC2 instances. AWS Fargate is an engine used to enable ECS to run containers without having to manage and provision instances and clusters for containers.

Docker is piece of software that allows you to automate the installation and distribution of applications inside Linux Containers. A Container holds everything that an application requires to enable it to run from within it's isolated container package. This may include system libraries, code, system tools, run time, etcetera.

EC2

instance and instance patching
more granularity
security and compliance controls

ECR

Fully managed by AWS for docker imgaes

Registry

Host and store docker images
Your account w/r to images in registry
IAM policies can control access to registry
docker client needs to authorize to the default registry

Authorization Token

AWS CLI get login for authorization. Outputs a docker login command
Docker command to be used in docker client for login
Token lasts for 12 hours.
ecr:getAuthorizationToken API call for an AWS user.

Repository

Grouping of images into repository for storage
IAM and registry policies. IAM policies exist for RegistryFullAccess, RegistryPowerUser, RegistryReadOnly.

Repository Policy

Resource based policies and requires a principal
Created within ERC itself and within the repositories.

EKS - Elastic Container Service for Kubernetes

open source container orchestration tool.
Can run Docker and Rocket containers
AWS takes care of management infrastructure knowns as control plane. Ths is done across multi-AZ for resilience.
User needs to maintain the work nodes.
Control Plane does scheduling, tracking stake of all containers. It is made of:
- Number of APIs
- Kubelet Process
- Kubernetes Master
Nodes - Worker machine. On-Demand EC2 instance and includes software to run containers via specific AMI which ensures Docker and Kublet in installed. Provisioned worker nodes connect to an EKS endpoint.

Working with EKS

EKS Service Role - AmazonEKSServicePolicy and AmazonEKSClusterPolicy. Can be assigned to multiple clusters.
EKS Cluster VPC - use Cloud Formation to run a specific template
Install kubectl - kubernetes command line
Install AWS-IAM-Authenticator - needed to authenticate to EKS Cluster
Create EKS Cluster - use VPC details from step 2.
Configure kubectl for EKS - update-kubeconfig on AWS CLI create a kubeconfig file for EKS cluster
Provision and configure worker nodes - can be done via cloud formation using a template.
Configure the worker node to join the EKS Cluster.

Elastic BeanStalk

AWS Elastic Beanstalk is an AWS managed service that allows you to upload the code of your web application, along with the environment configurations, which will then allow Elastic Beanstalk to automatically provision and deploy the appropriate and necessary resources required within AWS to make the web application operational.

These resources can include other AWS services and features, such as EC2, Auto Scaling, application health-monitoring, and Elastic Load Balancing, in addition to capacity provisioning.

Elastic Beanstalk is able to operate with a variety of different platforms and programming languages, making it a very flexible service for your DevOps teams. There is no cost associated with Elastic Beanstalk, however, any resources that are created on your application's behalf, such as EC2 instances, you will be charged for as per the standard pricing policies at the time of deployment.

Components

An application version is a very specific reference to a section of deployable code. The application version will point typically to S3, simple storage service to where the deployable code may reside.
An environment refers to an application version that has been deployed on AWS resources. This comprises of all the infrastructure provisioned by Elastic Beanstalk along with the code.
An environment configuration is a collection of parameters and settings that dictate how an environment will have its resources provisioned by Elastic Beanstalk and how these resources will behave.
The environment tier. This component reflects on how Elastic Beanstalk provisions resources based on what the application is designed to do. You can deploy your application across one of two different environment tiers, either the web server tier or the worker tier. The web server environment is typically used for standard web applications that operate and serve requests over HTTP port 80. This tier will typically use service and features such as Route 53, Elastic Load Balancing, Auto Scaling, EC2, and Security Groups. The worker environment is slightly different and are used by applications that will have a back-end processing task that will interact with AWS SQS, the Simple Queue Service. This tier typically uses the following AWS resources in this environment, an SQS Queue, an IAM Service Role, Auto Scaling, and EC2.
The configuration template. This is the template that provides the baseline for creating a new, unique environment configuration.
The platform is a culmination of components in which you can build your application upon using Elastic Beanstalk. This is everything that may be used.
Application. Within Elastic Beanstalk, an application is a collection of different elements, such as environments, environment configurations, and application versions.

Compatibility


Node.js	PHP	Python
Single Container Docker	Java SE	Go
Multi Container Docker	Java with Tomcat	Packer Builder
Preconfigured Docker	.NET on Windows Server IIS	Ruby

Environment Tiers

Setting up configuration template for the application requires selection.
Potentially both these can be used together decoupled by a SQS to transfer messages between the environments.
Additional customization can be done via Elastic Beanstalk Configuration files within the application source code written in json or yaml. Saved with .config extension inside .ebextensions folder inside source code.

Web Server Tier

Typically uses the following components: * Route53 - handles the CNAME record and alias for using the DNS name with a ELB * ELB - Should have at least one ELB for load balancing * EC2 Autoscaling Group - Handles scale-in and scale-out * EC2 Instances - Part of the ASG that handles the requests. Host Manager is installed enabling the following: * aid in the deployment of your application, * collates different metrics and different events from the EC2 instance which can then be reviewed from within the console or via the AWS CLI or API, * generates instance-level events, it monitors both the application log files and the application server itself. * It will also patch instance components, * it will manage the log files, allowing them to be published at S3. * Security Group - Provides instance protection for EC2. Will by default allow HTTP traffic/ port 80.

Worker Tier

Back-end processing tasks usually dealing with a SQS. Typical components: * SQS - This will be created if one doesn't exist that is configured * IAM Service Role - For the EC2 instances to be able to access and interact with the SQS * EC2 Autoscaling Group - Handles scale-in and scale-out * EC2 Instances - Does the work of pulling data from SQS. Daemon installed on EC2 to pull data from SQS and send data to application.

Deployment Option of Elastic Beanstalk

All at once (Default)

Default
Rolls out all the resource at the same time. May disrupt if the existing is already in use.

Rolling

Minimizes disruption to existing usage
Deploy application and batches and move forward as batches succeed
Can serve request while the gradual roll out happens.

Rolling with additional batch

Application availability is completely minimized by using additional batches of resources and replaces one batch at a time to reduce the impact on usage

Immutable

A completely second environment is created separately
Doubles the environment until the new environment is setup and ready to serve traffic
Destroys the old environment once all health checks passed.

Monitoring and Health checks

Cloudwatch metrics at 5 min intervals as done by default in other approaches.
Health check is not determined based on these metrics

Status Checks

If ELB exists within the environment, it sends request to EC2 every 10s for health check
If no ELB, then EC2 Instance Status Check is used determine the health

Other Checks

High level check of the environment are also done automatically.
Web Tier - ASG has at least one healthy EC2, Correct settings in Route53 (CNAME redirect to correct ELB), Security Group allows port 80 inbound
Worker Tier - SQS being polled every 3 mins at minimum.

Basic Health Reporting

Gray - Update in progress
Green - Environment passed recent health check and at least one instance available
Yellow - Environment failed one or more health check, some requests failing
Red - Environment failed three of more health check, unavailable and constantly failing.

Enhanced Health Reporting

Ok - Passed all health checks
Warning - extended time to progress and some failures
Degraded - high number of failures
Severe - excessive number of failures
Pending - operations in progress
Unknown - insufficient data
Suspended - Not monitored

Health Agent installed in the AMIs to get deeper level of data from the instances and allows additional metrics. Additional metrics ca be sent to CloudWatch as custom metrics but incurs cost. Agent is a Daemon that gets more deeper data and more frequently (10s).

Lambda

Properties

serverless

How does it work

Write code and upload to Lambda. Code can contain other libraries.
Trigger the lambda to run the code.
Computer power is ascertained and lambda code executes
sub ms billing by aws.

Lambda Function:

Code
Permissions
Environment Variables
CPU/Memory. Choose Memory --> Proportional CPU/Network/DisIO allocated. 128MB - 10240MB.

Programming Languges/Runtime

Java
Go
Powershell
Node.js
C#
Python
Ruby
Custom Runtime feature

Invocation

Directly through AWS Console, AWS SDK, AWS Toolkits, AWS CLI
Function URL
Triggers based on other AWS Service events as source. Even scheduled events.

Invocation Models

Synchronous - Request/Response Model. Trigger responsible for retry if the Lambda invocation fails.
Asynchronous - Parameter during invocation - invocation-type Event. S3 even invocation is an example. No response is sent back without additional business logic. Function handles retries and has a queue for requests. Failed events can be sent to
- Dead Letter Queue - Received only the even content
- Lambda Destination Services - Receives request/response context and payload.
Stream (Poll-based Model) - Lambda service runs a poller on our behalf. Event source mapping is needed to process items from Stream or Queue. Event source mapping needs to be done to link to the Lambda function. Can be done using CLI or SDK. aws lambda create-event-source-mapping --function name my-function --batch-size 500 --maximum-batching-window-in-seconds 5 --starting-position LATEST --event-source-arn arn:aws:danamodb:us-east-2:.... Integration of:
- DynamoDB Streams
- Kinesis Streams
- Amazon SQS

invocation by AWS Services

When invoked by AWS Services, the service decides on the invocation model. This can't be configured.

Components

Lambda application. Lambda function. The Lambda function is compiled of your own code that you want Lambda to invoke as per defined triggers.
Event source. Event sources are AWS services that can be used to trigger your Lambda functions, or put another way, they produce the events that your Lambda function essentially responds to by invoking it.
The Trigger is essentially an operation from an event source that causes the function to invoke. So essentially triggering that function. For example, an Amazon S3 put request could be used as a trigger.
Downstream Resources. These are the resources that are required during the execution of your Lambda function. For example, your function might call upon accessing a specific SNS topic, or a particular SQS queue. So they are not used as the source of the trigger, but instead they are the resources to be used to execute the code within the function upon invocation.
Log streams. In an effort to help you identify issues and troubleshoot issues with your Lambda function, you can add logging statements to help you identify if your code is operating as expected into a log stream.

Monitoring

Automatic monitoring of logs and metrics
Write custom logging statements to be included into the logs
Logs are part of the log streams in CloudWatch
Metrics are also sent to CloudWatch

Types of Metrics

Invocation metrics - Successful / failed invocations
Performance Metrics - Duration, Iterator Age for stream (ms) - How long Lambda too to reach a batch of records since last written to stream.
Concurrency Metrics - Concurrent execution counts

In CloudWatch each lambda function gets a Log Group. Log Streams in the Log Groups get the logs and will also include custom logging statements.

Cost

Amount of requests sent to functions
Charge based on the duration of the function run (nearest ms)
Charge based on provisioned memory

Code Specifics

boto3 Python SDK library
def lambda_handler (event, context) expected lambda entry point
`lambda_function expected file name.
Both the file name and the function handler can be updated in the runtime configuration

Configuration

Both the file name and the function handler can be updated in the runtime configuration
Memory settings
Ephemeral Storage
Timeout storage
IAM Roles (need access to CloudWatch logs as well)
Add Resource-based policies
Environmental Variables - Key, Value pairs.
VPC - by default run in VPC configured by AWS but can be run in our own VPC.
Subnets - Assigned to private/public subnets.
Concurrency - reserved (max number of instances, keeps this in reserve - can't be used by other functions - free) / provisioned (cold start initialization, initialized and warm - cost associated)

Deployment Package

Container images
zip file archives Options on console:
Author from scratch
Use Blueprint
Serverless App Repository from ThirdPart
Container Image

AWS Batch

Batch computing is primarily used in specialist use cases which require a vast amount of compute power across a cluster of compute resources to complete batch processing executing a series of jobs or tasks.

Components

Jobs - A job is classed as a unit of work that is to be run by AWS Batch. For example, this can be a Linux executable file, an application within an ECS cluster or a shell script.
Job definitions - These define specific parameters for the jobs themselves. They dictate how the job will run and with what configuration. Some examples of these may be how many vCPUs to use for the container, which data volume should be used, which IAM role should be used, allowing access for AWS Batch to communicate with other AWS services, and mount points. Can also define instance types and bid for spot instances.
Job queues - Jobs that are scheduled are placed into a job queue until they run. It's also possible to have multiple queues with different priorities if needed.
Job scheduling - The Job Scheduler takes control of when a job should be run and from which Compute Environment.
Compute Environments - These are the environments containing the compute resources to carry out the job. The environment can be defined as managed or unmanaged. Managed environments automates this using ECS. Unmanaged requires manual creation of ECS clusters.

Amazon Lightsail

Its a virtual private server (VPS) much like EC2. Simple, quick, easy to use and low cost point. Host small website and blogs etc. Multiple lightsail can be used and possible to connect to other infrastructure. Lightsail sits outside the management console and configuration is a single page which is a simple version of EC2. Pricing is per usage basis and is very simple to pick based on the requirements.

Compute Savings Plans

Compute Savings Plans provide the most flexibility and help to reduce your costs by up to 66%. These plans automatically apply to EC2 instance usage regardless of instance family, size, AZ, Region, OS or tenancy, and also apply to Fargate or Lambda usage.

EC2 Instance Savings Plans

EC2 Instance Savings Plans provide the lowest prices, offering savings up to 72% in exchange for commitment to usage of individual instance families in a Region (e.g. M5 usage in N. Virginia). This automatically reduces your cost on the selected instance family in that region regardless of AZ, size, OS or tenancy. EC2 Instance Savings Plans give you the flexibility to change your usage between instances within a family in that region. For example, you can move from c5.xlarge running Windows to c5.2xlarge running Linux and automatically benefit from the Savings Plan prices.

AS Serverless Application Model (SAM)

SAM is an open-source framework for building serverless applications in AWS. You can use any programming language that has a runtime currently supported by AWS Lambda. SAM provides a simplified way to define and deploy serverless resources, including functions, APIs, and databases using YAML templates. During the deployment process, SAM will transform these templates into CloudFormation. And from there, the CloudFormation service will provision the resources for your serverless applications.

SAM also includes a command-line interface, the SAM CLI, which provides an environment that allows you to build and test your SAM applications locally, including Lambda, DynamoDB, and API Gateway. You can even debug your Lambda functions locally if they’re written in Node.js, Java, Python, or Go.

Deployment Steps

The sam build command is used to build your application locally and transform your YAML SAM template into a CloudFormation template.
The sam package command will take your CloudFormation template and application code, zip it, and upload it to S3.
The sam deploy command will create and execute a CloudFormation change set, allowing you to deploy your application.