Zhe Zhang’s Post

Building the future of AI infra at NVIDIA. Apache Software Foundation Member; Former Head of Open Source (Ray) + Head of Field Engineering @ Anyscale.

4mo Edited

I'm really excited to share this new blog on Amazon's Exabyte-Scale migration from #Spark to #Ray! 🐘 1.5EiB of data processed in a quarter 🚀 82% better cost efficiency 💸 $120M saving per year OK you might ask: isn't #BigData processing Apache Spark's bread and butter? 🤔 Well, this is a story about software abstractions.. Like Patrick Ames wrote in the blog, "... (Amazon engineers) had limited options to resolve performance issues due to Apache Spark successfully (and unfortunately in this case) abstracting away most of the low-level data processing details". So, the takeaway: if you need more flexibility in data processing (e.g. need GPU, or #unstructured data like video), let's talk! https://lnkd.in/gesSyxHF to get started https://lnkd.in/gfj2Xp_2

Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2 | Amazon Web Services

aws.amazon.com

5 Comments

Max Barker

Hire FAANG talent on Discord 🕹️ | Trusted by top VC backed startups | Send me a DM for access 👋

4mo

https://discord.gg/learnmutiny

1 Reaction

Leo Liang

Building Something New | Venture Partner @ Sancus | Plumber in Data, AI and Blockchain

4mo

The day eventually comes - congrats Anyscale team!

2 Reactions

Akshay Verma

#python #dataops #mlops #opensource

4mo

Really helpful!

Malathi Sankar

Director | Data/ML/Search Engineering and Platform

4mo

Fascinating read. Thanks for sharing!

2 Reactions

Pritam Pan

Senior Data Engineer@Pinterest📌 || AWS || Spark || Iceberg || Airflow || Ex-Groupon

4mo

This is another excellent use case for Ray beyond machine learning. Fascinating read!

2 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Dr. John Rares Almasan

Cloud & AI Technology Executive | Public Speaker | Academic Lecturer | Ex McKinsey & Co. and Amex
1mo Edited
Report this post
AWS Achieves Major Cost Savings… and boosted efficiency by 82% when it shifted database table compaction from Apache Spark to Apache Ray. AWS is projected to save $100 million annually in compute services… more importantly, it also underscores the importance of adaptability in technology. What do you think?

Amazon to Save Millions Moving From Apache Spark to Ray

https://thenewstack.io
Like Comment
To view or add a comment, sign in
Atique Ahmed

Developer Advocate at Zoom | Ex-Vmware / broadcom | Principal Engineer | Technical Architect | Distributed Systems | DDD Enthusiast | NodeJS Aficionado | AWS Specialist
2mo
Report this post
Introducing Amazon CloudFront KeyValueStore: A low-latency datastore for CloudFront Functions https://lnkd.in/d5UGfJt4 #aws #cloud #cloudcomputing #azure #devops #technology #python #amazonwebservices #linux #amazon #programming #awscloud #cybersecurity #coding #googlecloud #developer #kubernetes #bigdata #datascience #microsoft #machinelearning #software #java #tech #it #gcp #awstraining #javascript #security #docker

Introducing Amazon CloudFront KeyValueStore: A low-latency datastore for CloudFront Functions | Amazon Web Services

aws.amazon.com
Like Comment
To view or add a comment, sign in
Atique Ahmed

Developer Advocate at Zoom | Ex-Vmware / broadcom | Principal Engineer | Technical Architect | Distributed Systems | DDD Enthusiast | NodeJS Aficionado | AWS Specialist
3w
Report this post
New – Improve Amazon S3 Glacier Flexible Restore Time By Up To 85% Using Standard Retrieval Tier and S3 Batch Operations https://lnkd.in/dGrmNwCj #aws #cloudcomputing #cloud #devops #azure #technology #python #awscloud #cybersecurity #amazonwebservices #linux #coding #software #kubernetes #datascience #developer #tech #java #programming #bigdata #amazon #googlecloud #microsoft #it #docker #machinelearning #javascript #security #devopsengineer #gcp

New – Improve Amazon S3 Glacier Flexible Restore Time By Up To 85% Using Standard Retrieval Tier and S3 Batch Operations | Amazon Web Services

aws.amazon.com
Like Comment
To view or add a comment, sign in
Atique Ahmed

Developer Advocate at Zoom | Ex-Vmware / broadcom | Principal Engineer | Technical Architect | Distributed Systems | DDD Enthusiast | NodeJS Aficionado | AWS Specialist
4mo
Report this post
Amazon DynamoDB zero-ETL integration with Amazon OpenSearch Service is now available https://lnkd.in/dPYwp87v #aws #cloud #cloudcomputing #azure #devops #technology #python #amazonwebservices #linux #amazon #programming #awscloud #cybersecurity #coding #googlecloud #developer #kubernetes #bigdata #datascience #microsoft #machinelearning #software #java #tech #it #gcp #awstraining #javascript #security #docker

Amazon DynamoDB zero-ETL integration with Amazon OpenSearch Service is now available | Amazon Web Services

aws.amazon.com
Like Comment
To view or add a comment, sign in
Codeztech

553 followers
4mo
Report this post
Amazon DynamoDB zero-ETL integration with Amazon OpenSearch Service is now available https://lnkd.in/gT9cZwig #aws #cloud #cloudcomputing #azure #devops #technology #python #amazonwebservices #linux #amazon #programming #awscloud #cybersecurity #coding #googlecloud #developer #kubernetes #bigdata #datascience #microsoft #machinelearning #software #java #tech #it #gcp #awstraining #javascript #security #docker

Amazon DynamoDB zero-ETL integration with Amazon OpenSearch Service is now available | Amazon Web Services

aws.amazon.com
Like Comment
To view or add a comment, sign in
Daniel Gullin

☁ Cloud Operations Architect ☁ | Building 🛠️, maintaining ♻ and migrating 🚀 in the cloud
2w Edited
Report this post
⚠️⚠️⚠️ This year I´m not in Las Vegas and #awsreinvent, but of course I follow the event from 🇸🇪 and 💻📱 So far I think the most interesting announcement is Amazon Aurora DSQL (distributed sql & true serverless) and it should be very interesting to follow its progress. Today only available in preview in 🇺🇸 regions (per standard) but it will be available in more regions of course during the year. https://lnkd.in/dsDsCPJ9

Announcing Amazon Aurora DSQL (Preview) - AWS

aws.amazon.com
Like Comment
To view or add a comment, sign in
Atique Ahmed

Developer Advocate at Zoom | Ex-Vmware / broadcom | Principal Engineer | Technical Architect | Distributed Systems | DDD Enthusiast | NodeJS Aficionado | AWS Specialist
9mo
Report this post
New – Amazon CloudWatch Logs Insights – Fast, Interactive Log Analytics https://lnkd.in/ekpyyVSv #aws #cloud #cloudcomputing #azure #devops #technology #python #amazonwebservices #linux #amazon #programming #awscloud #cybersecurity #coding #googlecloud #developer #kubernetes #bigdata #datascience #microsoft #machinelearning #software #java #tech #it #gcp #awstraining #javascript #security #docker

New – Amazon CloudWatch Logs Insights – Fast, Interactive Log Analytics | Amazon Web ...

aws.amazon.com
Like Comment
To view or add a comment, sign in
Mukesh Murugan

Microsoft MVP | AWS Community Builder | .NET Tech Lead
9mo
Report this post
Are you sure that your application properly handles data? ⚠️ Ever been caught in a situation where your application scale is too big that you need to handle concurrent database write conflicts? Here is Optimistic Locking to ensure that your data is always consistent. We will be using Amazon DynamoDB and .NET to understand the problem statement and how Optimistic Locking can guarantee to filter out stale data in concurrent write operations. 📌 Topics Covered: 1/ The Problem Statement 2/ Understanding Optimistic Locking 3/ How DynamoDB implements Optimistic Locking 4/ Explicit Versioning with Conditional Expressions 5/ Benefits of Optimistic Locking 📌 Read the entire article: https://lnkd.in/gGs5rhN4 🎁 Bonus: Grab 25$ AWS Credits for FREE. The Link is attached to the article. #dotnet #developer #amazon #aws #concurreny #ddb #race

Handling Concurrency in Amazon DynamoDB with Optimistic Locking - Detailed Guide - codewithmukesh

codewithmukesh.com

1 Comment
Like Comment
To view or add a comment, sign in
Daniel Galinier

Technology Architect HPE Barcelona
6mo Edited
Report this post
Interesting but they forgot us 😄 We are too far in the future maybe 😁 Greenlake block Storage for AWS https://lnkd.in/d6MtX7Ku #greenlake #AlletraMP #AWS #blockisnotdead

Blocks and Files

3,303 followers
6mo

Eminent Sun alumnus says NFS must die – Blocks and Files

https://blocksandfiles.com
Like Comment
To view or add a comment, sign in
Brad Caffey

Apache Spark expert | Databricks Certified
8mo Edited
Report this post
Check out my new blog post about how to lower job costs and improve performance for Spark batch jobs! My blog explains how to effectively use the persist command to achieve these goals. If you're working with EC2 instances and using Spark's persist method, this post is a must-read. Click the link to learn more! #Spark #bigdata #jobcosts #performanceimprovement

How to lower job costs on EC2 instances and improve performance when using Spark’s persist method

medium.com
Like Comment
To view or add a comment, sign in

10,053 followers

View Profile Follow

Zhe Zhang’s Post

Amazon’s Exabyte-Scale Migration from Apache Spark to Ray on Amazon EC2 | Amazon Web Services

aws.amazon.com

More from this author

Impact of Large Requests in Shared Services

Good luck Erfan

Explore topics