Jon Haddad’s Post

Apache Cassandra Committer and Consultant

7mo

Really excited to be working with my friend and fellow committer Jordan West on a performance patch for Apache Cassandra that's already showing a MASSIVE 6x reduction in IOPS usage during compaction which makes a huge impact on EBS. In our initial testing we saw a 3x improvement to compaction throughput, just on one thread, and multiple threads will deliver an even bigger improvement. The same patch will reduce system overhead on *any* deployment, but will have the biggest impact when using disks that use quotas on IOPs. The impact is you'll be able to run WAY denser nodes on EBS for the same cost, more predictable random read performance, and we're not even finished. This same patch should deliver improvements to range scans (hello spark jobs!) and repairs. You can follow along here: https://lnkd.in/d5_hcyJQ

Improve disk access patterns during compaction and streaming

issues.apache.org

2 Comments

Julien Laurenceau

7mo

so refreshing to share that... contrast with a lot of project that almost quit the performance optimization field. Performance is hard but it matters

1 Reaction

To view or add a comment, sign in

More Relevant Posts

DoubleCloud

6,505 followers
5mo
Report this post
Kafka has been there for quite some time. Everybody and their uncle uses it for data streaming and processing. You know it can perform, and we know it. But how do you get the most bang for your buck out of it? We rigorously tested Kafka performance, comparing different environments to find the most cost-effective setup in AWS or GCP. We ran dozens of tests to find which compute generation, CPU, and JVM would yield the best price-performance ratio for Kafka. Specifically, we measured how many millions of rows we can ingest into the Kafka broker per one cent. Read more in our blog (spoiler: ARM rocks): https://lnkd.in/eJ7aRpbw
Like Comment
To view or add a comment, sign in
Stefano Fago

Software Solutions Architect R&D
6mo
Report this post
https://lnkd.in/dHTYqJ6T [ main site: https://lnkd.in/d3mcNX-C ] << ...WarpStream is an Apache Kafka® compatible data streaming platform built directly on top of object storage: no inter-AZ bandwidth costs, no disks to manage, and infinitely scalable, all within your VPC... >>

Zero Disks is Better (for Kafka)

warpstream.com
Like Comment
To view or add a comment, sign in
Geraldo Netto

Innovation | Strategy | Software Engineering
9mo
Report this post
"Our cloud database stores billions of files in object storage. With petabytes of data being queried every day, we started bumping into our cloud storage providers' rate-limits, resulting in decreased reliability & performance. We had large memcached clusters in place to absorb & deamplify reads to object storage - but these could hold at most a few hours’ worth of data, and constantly churned due to the excessive volume of data passing through. The conclusion we came to was: we needed much larger caches, ideally without inflating our cloud costs and adding operational complexity. I'll show how we managed to increase our cache size by 45x and reduce our costs by using a little-known feature of memcached called "extstore". Extstore enables offloading of objects to SSDs which can't fit into memory. In this talk I’ll be covering how we use it, how to monitor it, why we chose it, and other considerations. I'll also cover how we use ephemeral storage provided by public cloud vendors in the form of physically-attached SSDs with incredibly high throughput, low latency, and best of all - low cost! This talk is also a story of how products evolve, and how we as a team are buying time in the short term to keep up our reliability while we evolve our storage design in the medium-long term." https://lnkd.in/eT8qbGcm

P99 CONF 2023 | How Grafana Labs Scaled Up Their Memcached 42x & Cut Costs Too by Danny Kopping

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Michael Sullivan

Web Developer | Digital Solutions | Transforming Ideas into Code
9mo
Report this post
Just published part two to this series, explore the magic of serverless computing as we link Lambda functions to the database!

Cloud Chronicles Part 2: Navigating AWS — Lambda Insights

link.medium.com
Like Comment
To view or add a comment, sign in
Kai Waehner Kai Waehner is an Influencer

Global Field CTO | Author | International Speaker | Follow me with Data in Motion
7mo
Report this post
What is #ApacheFlink? How does it complement #ApacheKafka? Why is it different from #KafkaStreams? How does the #cloud help with #streamprocessing? => I discuss all these questions in this lightboard video in under ten minutes... https://lnkd.in/dsihChH2

What is Apache Flink®?

https://www.youtube.com/

1 Comment
Like Comment
To view or add a comment, sign in
Santwana Trivedi

Helping businesses set their data in motion
2mo
Report this post
What is #ApacheFlink? How does it complement #ApacheKafka? Why is it different from #KafkaStreams? How does the #cloud help with #streamprocessing? => Watch Kai Waehner discuss about it in under 10 minutes in this lightboard video https://lnkd.in/gEP39AZt

What is Apache Flink®?

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Manikandan S

AI Explorer | Multi-Cloud Specialist | Cloud Architect | Cloud-Native | Thoughtful Leader | FinOps
2mo
Report this post
𝐃𝐨𝐧’𝐭 𝐋𝐞𝐭 𝐘𝐨𝐮𝐫 𝐄𝐊𝐒 𝐃𝐨𝐬𝐚 𝐁𝐮𝐫𝐧! 🍛⁣ ⁣ Running outdated EKS clusters can scorch your AWS budget faster than a dosa left on the tawa. AWS charges extra for older versions—up to $𝟎.𝟔𝟎/𝐡𝐨𝐮𝐫 𝐩𝐞𝐫 𝐜𝐥𝐮𝐬𝐭𝐞𝐫 if you delay upgrades. Avoiding these hidden costs is as easy as flipping your cluster on time. Want to know how to avoid paying extra for “burnt” clusters? ⁣ #awseks #costoptimisation #kubernetes #clusterupgrades ⁣ Check out the full story here: EKS Upgrades:

EKS Upgrades: Avoiding Extra Costs — Don’t Let Your Dosa Burn

link.medium.com
Like Comment
To view or add a comment, sign in
Kai Waehner Kai Waehner is an Influencer

Global Field CTO | Author | International Speaker | Follow me with Data in Motion
2mo
Report this post
What is #ApacheFlink? How does it complement #ApacheKafka? Why is it different from #KafkaStreams? How does the #cloud help with #streamprocessing? => I discuss all these questions in this lightboard video in under ten minutes... https://lnkd.in/dsihChH2

What is Apache Flink®?

https://www.youtube.com/

5 Comments
Like Comment
To view or add a comment, sign in
Jean-Noel Moyne
9mo
Report this post
NATS JetStream is not just streaming but also a proper KV data store with many features beyond just Put, Get and Delete, and can hold it's own when compared to the reference that is Redis in many use cases. Another great video by Jeremy Saenz going through all the KV features currently available (and some of the upcoming 2.11 features) in NATS.

Jeremy Saenz

Building magical software
9mo

I've seen so many teams replace Redis with NATS.io JetStream KV, and now it's your turn to find out why it's so special. Check out our latest video! https://lnkd.in/gAaiDbhe

JetStream KV: A fascinating alternative to Redis...

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Erik Herz

Chief Executive Officer at Vivoh
1mo
Report this post
With the goal of reducing latency, I ordered an AWS Link encoder and will be testing ingest into AWS MediaConnect via both Zixi from this Link encoder and RTP with FEC from FFmpeg (./ffmpeg -re -i -c copy -map 0 -f rtp_mpegts -fec prompeg=l=5:d=20 rtp://:5000). I would love to get any feedback from people who have run similar tests with any configuration recommendations. Which one will have lower latency?
9 Comments
Like Comment
To view or add a comment, sign in