Amazon Red Shift

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 17

Case Study : Amazon shifting to Redshift from oracle

DWH
Oracle Data Warehouse
• Oracle Warehouse Builder which is used to build Oracle Data
Warehouse was built from the ground up in Oracle, it was first
released in January 2000.

• Amazon Used Oracle Data Warehouse for 14 Consecutive years


from 2004 to 2018.

• Huge share of amazon revenue was spent on this service, nearly


13% annually.
Why Amazon turning off it’s Oracle DWH
• Query Performance in Oracle Data warehouse compared to other parallel
execution databases is a little low.
• For Big Data Implementation, it's not very cost effective compared to
other big data solutions that are based on Hadoop systems and RedShift
BI Tools.
• “These [relational] databases are not cloud native. They are not good,
fundamental building blocks for database innovation, and are definitely
not [equipped] for really massive scale” -
• “Even in the past six months, Redshift has become 3.5 times faster and
multiple users can use it concurrently” -
Werner Vogels ,CTO amazon
• The outage highlights the challenges Amazon could face as it looks to
move completely off Oracle’s database by 2020.
• Oracle and Redshift are two different [database] technologies” that handle
“savepoints” differently.
• Savepoints are an important database tool for tracking and recovering
individual transactions. On Prime Day, an excessive number of savepoints
was created, and Amazon’s Redshift software wasn’t able to handle the
pressure, slowing down the overall database performance.
Amazon Redshift
• Amazon Redshift is an Internet hosting service and data
warehouse product which forms part of the larger cloud-
computing platform Amazon Web Services.
• Fast, simple, cost-effective data warehouse that can extend
queries to your data lake
• Amazon Redshift is a fully managed data warehouse service in
the cloud. Its datasets range from 100s of gigabytes to a
petabyte. The initial process to create a data warehouse is to
launch a set of compute resources called nodes, which are
organized into groups called cluster.
Benefits
• FASTER PERFORMANCE
• EASY TO SET UP, DEPLOY, AND MANAGE
• COST-EFFECTIVE
• SCALE QUICKLY TO MEET YOUR NEEDS
• QUERY YOUR DATA LAKE
• SECURE
Featured customers
• Amazon Redshift powers the largest number of data warehousing
deployments in the cloud for business, real-time, and predictive
analyses.
How it Works
Data Warehouse System Architecture
• Client applications
• Connections
• Clusters
• Leader node
• Compute nodes
• Node slices
• Over 270 operating Companies

• Operate in around 60 countries across the globe

• Highly Decentralized Model


Why RedShift ?
• Server Reduction
• Automated IT
• Business Efficiency
• Seamless Operability with Data Centers
BYJU’S USES AWS:
• BYJU’S Uses AWS to Deliver Cutting-Edge Content to 15 Million Students.

• Meeting the mobile app’s fast rate of growth required BYJU’S to find a more scalable and cost-effective
solution than its Heroku cloud platform.

• BYJU’S runs its website and mobile apps on Amazon Elastic Compute Cloud (Amazon EC2) instances. The
company uses Amazon Relational Database Service (Amazon RDS) for PostgreSQL as its primary database
service, and it stores presentations and other educational content in Amazon Simple Storage
Service (Amazon S3) buckets.

• For data analytics, BYJU’S takes advantage of the Amazon Redshift fully managed data warehouse to
analyse app and website user data through the company’s existing business-intelligence software tools.

• Using Amazon Redshift, BYJU’S can evaluate student feedback and capitalize on those insights to provide
a more personalized learning experience.
Benefits Realized after Migrating to AWS:
•Offers a complete learning experience that integrates classes, assessments, and personalized
assignments, along with in-depth analysis and recommendations.

•Scales to meet the demands of more than 15 million students globally.

•Takes advantage of newer technologies to create innovative new products.

•Uses deeper data analysis to personalize learning.


Pros of Redshift
• Exceptionally fast
• High Performance
• Horizontally Scalable
• Massive Storage capacity
• Attractive and transparent pricing
• SQL interface
• AWS ecosystem
Cons of Redshift
• Doesn’t enforce uniqueness
• Only S3, DynamoDB and Amazon EMR support for parallel
upload
• Requires a good understanding of Sort and Dist keys
• Can’t be used as live app database
• Data on Cloud
Conclusion :
• Amazon Redshift is an amazing solution for data warehousing.
• In case you choose to set up Redshift data warehouse, one of the
biggest hurdles you might have to cross is to seamlessly bring
data from your existing data sources into Redshift.
• It has some limitations but it is way ahead of the alternatives
like Bigquery and Snowflake

You might also like