Friday, October 15, 2021

Module 5 - Storage and Database

Module 5 - Storage and Database 

Learning objectives

In this module, you will learn how to:

Summarize the basic concept of storage and databases.
Describe the benefits of Amazon Elastic Block Store (Amazon EBS).
Describe the benefits of Amazon Simple Storage Service (Amazon S3).
Describe the benefits of Amazon Elastic File System (Amazon EFS).
Summarize various storage solutions.
Describe the benefits of Amazon Relational Database Service (Amazon RDS).
Describe the benefits of Amazon DynamoDB.
Summarize various database services.


Hypervisor 
AWS Host  -  Amazon EC2 Instance (Virtual Machine) => EBS Volume Data
Do not create  

EBS - Elastic Block Storage - Virtual Hard drive attach to EC2 Instance
Size /type/ Configuration 
if not when you stop the EC2 Instances - you will lose your data if you put data on the ec2 instance
Configure volume and then attached it to the EC2 instance
Snapshots: Incremental Backup


Storage 

EC2 - Computer  EC2 Instance (Virtual Machine)
S3  - Storage   (Simple Storage Service) 
      - EBS - Amazon Elastic Block Storage
EFS - (Amazon Elastic File System )


3 Types of Storage   EBS, S2, EFS

EBS - Elastic Block Storage - It's a hard drive 
  • Volume attached to EC2 instances
  • Availability Zone level Resource
  • Need to be in the same AZ Availability Zone to attach EC2 instance
  • Volume do no automatically scale
  • Sized up to 16terabite
  • Survive Termination of their EC2 Instance 
  • Solid state by Default
  • HDD option
  • E3 is object storage / EBS is block storage
  • read write changes 
  • An Amazon EBS volume stores data in a single Availability Zone. 
  • To attach an Amazon EC2 instance to an EBS volume, both the Amazon EC2 instance and the EBS volume must reside within the same Availability Zone.


S3 Simple Storage Service
S3- Bucket 
  • Regionally Distributed 
  • Data is stored in 3 facilities multiple 
  • Object Level Storage
  • Provides unlimited storage of any type of data
  • provides 99. 119's of data durability
  • Upload a maximum object size of 5 TB each


Allows you to store and retrieve an unlimited amount 
Store data as objects,  Store objects in buckets
Version objects
Multiple buckets /Unlimited storage
Web enable 
Offer cost saving
Serverless 


  • S3 Standard  99.9999 Durability , Static website

  • S3 IA Standard- infrequent Access - Backup long term storage

  • S3 Glacier - Audit data WORM Write Once/Read Many
    • Live Cycle Policies - perform move between buckets
    • Long term analytic Data
    • Active Archive 
  • S3 Intelligent-Tiering
    • Before for data with unpredictable access patters

EFS - (Amazon Elastic File System ) - Managed
  • Multiple instance Reading and writing simultaneously 
  • Lunux file system
  • Regional Resource
  • Automatically Scales
  • Amazon EFS is a regional service. It stores data in and across multiple Availability Zones. 
  • The duplicate storage enables you to access data concurrently from all the Availability Zones in the Region where a file system is located. Additionally, on-premises servers can access Amazon EFS using AWS Direct Connect.

Question

You want to store data that is infrequently accessed but must be immediately available when needed. Which Amazon S3 storage class should you use?  S3 Standard-IA


Database

Tech Overview - DB Services

 **Probably just know DynamoDB, RDS, Aurora, Redshift 

  • When you run Aurora is highly available and durable, and when you have a cluster, it will run 6 copies of the DB across 3 AZs (more expensive than RDS)

  • Aurora Serverless - much less expensive than Aurora (need be basis)
    • Good for development or infrequently used apps

  • Neptune - Managed Graph DB
  • Redshift - Columnar DB, petabyte warehouse (1000 TB = 1PB)
    • Instead of reading via rows, it reads via columns
    • Good for working with large amounts of data for reports, analytics
    • Handles PBs of data!!!
  • ElastiCache - Redis or Memcached database
    • For caching
  • Caching
    • Caching is an area of a computer’s memory devoted to temporarily storing recently used information. The content, which includes HTML pages, images, files and Web objects, is stored on the local hard drive in order to make it faster for the user to access it, which helps improve the efficiency of the computer and its overall performance.





AWS Foundation Services- Database

RDS - Managed relational Database in AWS cloud that you can launch in minutes with a few clicks
Automatic patching
Backups
Redundancy
Failover
Disaster recovery 
Storing data in an Amazon Aurora database
Using SQL to organize data
Automatic high availability recovery provided
customer ownership of data
customer ownership of schema 
customer control of network
AWS Supported Database
- MYSQL
- PostgreSQL
- Oracle
- Microsoft SQL Server

Lift and shift


Aurora - Fully managed MYSQL compatible relational Database with 5X performance and Enterprise level features
- MYSQL
- PostgreSQL
- 1/10th the cost of commercial database
- Data replication 
- 15 read replicas
- continuous backup to Amazon S3
- Point in time recovery

DynamoDB - A managed NoSQL database offering extremely fast performance, elastic scability and reliability
serverless database 
Non-relational NOSQL database
Purpose built
Milliseconds response time
Fully managed
Highly scalable
Key-value
Massive throughput capabilities
PB size potential
Granular API access

DynamoDB is serverless db, which means that you do not have to provision, patch, or manage servers.  You also do not have to install, maintain, or operate software.
Nonrelational databases are sometimes referred to as “NoSQL databases” because they use structures other than rows and columns to organize data.
One type of structural approach for nonrelational databases is key-value pairs. With key-value pairs, data is organized into items (keys), and items have attributes (values). You can think of attributes as being different features of your data.
As the size of your database shrinks or grows, DynamoDB automatically scales to adjust for changes in capacity while maintaining consistent performance. 
This makes it a suitable choice for use cases that require high performance while scaling.

RedShift - Fast, fully managed petabyte scale data warehouse at less that thents of the cost of traditional solutions.



AWS Database Migration Service  (DMS)
- Source database remains fully operational during migration
- Downtime is minimized for application that rely on that database
- The source and target database don't have to be of the same type
Homogenous (same type)
Schema structures
Data types
Database code 
All compatible
Source
On premise
Amazon Ec2
Amazon RDS

Additional database services
Amazon DocumentDB
Amazon Neptune
Amazon Quantum Ledger Database (Amazon QLDB)
Amazon Managed Blockchain
Amazon ElastiCache
Amazon DynamoDB Accelerator

Amazon DocumentDB is a document database service that supports MongoDB workloads. (MongoDB is a document database program.)

Amazon Neptune is a graph database service.
You can use Amazon Neptune to build and run applications that work with highly connected datasets, such as recommendation engines, fraud detection, and knowledge graphs. 

Amazon Quantum Ledger Database (Amazon QLDB) is a ledger database service. 
You can use Amazon QLDB to review a complete history of all the changes that have been made to your application data.

Amazon Managed Blockchain is a service that you can use to create and manage blockchain networks with open-source frameworks. 
Blockchain is a distributed ledger system that lets multiple parties run transactions and share data without a central authority.

Amazon ElastiCache is a service that adds caching layers on top of your databases to help improve the read times of common requests. 
It supports two types of data stores: Redis and Memcached.

Amazon DynamoDB Accelerator (DAX) is an in-memory cache for DynamoDB. 
It helps improve response times from single-digit milliseconds to microseconds.



Data Base Services  - migrate your database to the cloud easly. Inexpensive with zero downtime

https://www.youtube.com/watch?v=YVe9amljgaw


QUESTION
What are the scenarios in which you should use Amazon Relational Database Service (Amazon RDS)? (Select TWO.)
Using SQL to organize data
Storing data in an Amazon Aurora database


Amazon Redshift
Data wareshouse as a service
10x higher performance than regular db.

Development and test database migaation

continuous database replication
Database consolidation 


Question 1

1. Which Amazon S3 storage classes are optimized for archival data? (Select TWO.)
S3 Glacier
S3 Glacier Deep Archive

Note:

S3 Standard is a storage class that is ideal for frequently accessed data, not archival data.

S3 Intelligent-Tiering monitors access patterns of objects and automatically moves them between the S3 Standard and S3 Standard-IA storage classes. It is not designed for archival data.

S3 Standard-IA is ideal for data that is infrequently accessed but requires high availability when needed.


2. Which statement or statements are TRUE about Amazon EBS volumes and Amazon EFS file systems?

EBS volumes store data within a single Availability Zone. Amazon EFS file systems store data across multiple Availability Zones.


NOTE:

An EBS volume must be located in the same Availability Zone as the Amazon EC2 instance to which it is attached.

Data in an Amazon EFS file system can be accessed concurrently from all the Availability Zones in the Region where the file system is located.


3, You want to store data in an object storage service. Which AWS service is best for this type of storage?
Amazon Simple Storage Service (Amazon S3)


NOTE

Amazon Managed Blockchain is a service that you can use to create and manage blockchain networks with open-source frameworks. Blockchain is a distributed ledger system that lets multiple parties run transactions and share data without a central authority.

Amazon Elastic File System (Amazon EFS) is a scalable file system used with AWS Cloud services and on-premises resources. It does not store data as object storage.

Amazon Elastic Block Store (Amazon EBS) is a service that provides block-level storage volumes that you can use with Amazon EC2 instances.


4. Which statement best describes Amazon DynamoDB?
A serverless key-value database service


Note:

A service that enables you to run relational databases in the AWS Cloud describes Amazon Relational Database Service (Amazon RDS).

A service that you can use to migrate relational databases, nonrelational databases, and other types of data stores describes AWS Database Migration Service (AWS DMS).

An enterprise-class relational database describes Amazon Aurora


5. Which service is used to query and analyze data across a data warehouse?
Amazon Redshift


Note:

Amazon Neptune is a graph database service. You can use Amazon Neptune to build and run applications that work with highly connected datasets, such as recommendation engines, fraud detection, and knowledge graphs.

Amazon DocumentDB is a document database service that supports MongoDB workloads.

Amazon ElastiCache is a service that adds caching layers on top of your databases to help improve the read times of common requests.


Module 5, you learned about the following concepts:
Amazon EC2 instance store and Amazon EBS
Amazon S3
Amazon EFS
Relational databases and Amazon RDS
Nonrelational databases and DynamoDB
Amazon Redshift
AWS DMS
Additional database services and accelerator

No comments:

Post a Comment

AWS Certified Solutions Architect Associate

  Notes for AWS Certified Solutions Architect Associate Credit: https://github.com/SkullTech/aws-solutions-architect-associate-notes I recen...