Deduplicating Virtual Machine Checkpoints for Distributed System Simulation

  • Typ:Bachelorarbeit
  • Datum:02.11.2013
  • Betreuung:

    Prof. Dr. Frank Bellosa, Marc Rittinghaus

  • Bearbeitung:Nikolai Baudis
  • Links:PDF
  • Abstract:

    Full system simulation suffers – due to its increased complexity – from a much lower execution speed than hardware-accelerated virtualization, resulting in a slowdown of over 1000. To close this performance gap, SimuBoost aims at increasing full system simulation's execution speed by performing parallelized distributed simulations, which are started using checkpoints from a hardware-accelerated virtualization. To reach an optimal degree of parallelization, checkpoints have to be taken at an interval of 1–2 s, which minimzes the overhead of the checkpointing.

    In this thesis, a checkpointing mechanism that meets the requirements for SimuBoost – minimized data amount and downtime – is proposed. To meet these requirements, the checkpointing mechanism performs incremental checkpointing of virtualized systems in QEMU and deduplicates both memory and block device data via a hash-based cache. The checkpoints are saved in a NoSQL database for a simplified distribution.

    The evaluation shows that the hash-based cache greatly decreases the data amount, especially by reusing data that was already checkpointed in the past: Up to 42 % of memory pages can be saved through deduplication. However, the evaluation also shows that the overhead caused by database queries is too high and that it is responsible for 40–60 % of the total downtime. Therefore, this approach needs to be optimized by combining it with CoW to reach a suitable downtime.


     author = {Nikolai Baudis},
     title = {Deduplicating Virtual Machine Checkpoints for Distributed System Simulation},
     type = {Bachelor Thesis},
     school = {System Architecture Group, Karlsruhe Institute of Technology (KIT), Germany},
     month = nov # "2",
     year = 2013,
     note = {\url{}}