Casper Claesen

Casper Claesen

Hasselt, Flemish Region, Belgium
747 followers 500+ connections

About

Multicloud certificate data engineer (AWS, Azure and Google) who loves to work with…

Activity

Join now to see all activity

Experience

  • Devoteam G Cloud Graphic
  • -

    Kontich, Vlaanderen, België

  • -

    Brussel, Brussels Hoofdstedelijk Gewest, België

Education

  • KU Leuven Graphic

    KU Leuven

    -

    Specialization: software engineering

    Thesis: Configurable, Fine-grained Data Replication for Spike-driven Workloads with Hotspot Objects in NoSQL Storage Systems

    Paper: A YCSB Workload for Benchmarking Hotspot Object Behaviour in NoSQL Databases
    at the 47th International Conference on Very Large Data Bases Copenhagen, Denmark - August 16-20, 2021 (https://vldb.org/2021/ & http://tpc.org/tpctc/tpctc2021/default5.asp)

  • -

Licenses & Certifications

Publications

  • A YCSB Workload for Benchmarking Hotspot Object Behaviour in NoSQL Databases

    Springer, Cham

    Many contemporary applications have to deal with unexpected spikes or unforeseen peaks in demand for specific data objects – so-called hotspot objects. For example in social networks, specific media items can go viral quickly and unexpectedly and therefore, properly provisioning for such behavior is not trivial.

    NoSQL databases are specifically designed for enhanced scalability, high availability, and elasticity to deal with increasing data volumes. Although existing performance…

    Many contemporary applications have to deal with unexpected spikes or unforeseen peaks in demand for specific data objects – so-called hotspot objects. For example in social networks, specific media items can go viral quickly and unexpectedly and therefore, properly provisioning for such behavior is not trivial.

    NoSQL databases are specifically designed for enhanced scalability, high availability, and elasticity to deal with increasing data volumes. Although existing performance benchmarking systems such as the Yahoo! Cloud Serving Benchmark (YCSB) provide support to test the performance properties of different databases under identical workloads, they lack support for testing how well these databases can cope with the above-mentioned unexpected hotspot object behaviour.

    To address this shortcoming and fill the research gap, we present the design and implementation of a new YCSB workload that is rooted upon a formal characterization of hotspot-based spikes. The proposed workload implements the Pitman-Yor distribution and is configurable in a number of parameters such as spike probability and data locality. As such, it allows for more extensive experimental validation of database systems.

    Our functional validation illustrates how the workload can be used to effectively stress-test different types of databases and we present our comparative results of benchmarking two popular NoSQL databases that are Cassandra and MongoDB in terms of their response to spiked workloads.

    Other authors
    See publication

More activity by Casper

View Casper’s full profile

  • See who you know in common
  • Get introduced
  • Contact Casper directly
Join to view full profile

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses