SlideShare a Scribd company logo
Understanding
Storage I/O Under
Load
Avi Kivity, CTO
Pavel Emelyanov, Principal Engineer
Presenter
2
Avi Kivity
● Original maintainer of Linux KVM - Kernel-based
Virtual Machine
● Co-maintainer of Seastar, ScyllaDB
● Co-founder of ScyllaDB
Presenter
3
Pavel Emelyanov
● Ex Linux kernel hacker
● Founder of the CRIU project
● Co-maintainer of Seastar & ScyllaDB
4
+ For distributed, data-intensive apps that require high
performance and low latency
+ 400+ users worldwide
+ Results
+ Comcast: Reduced P99 latencies by 95%
+ FireEye: 1500% improvement in throughput
+ Discord: Reduced C* nodes from ~140 to 6
+ iFood: 9X cost reduction vs. DynamoDB
+ Open Source, Enterprise and Cloud options
+ Fully compatible with Apache Cassandra and Amazon
DynamoDB
About ScyllaDB
1ms <1ms
10ms
1M
10M
ScyllaDB Universe of 400+ Users
400+ Companies Use ScyllaDB
Seamless experiences
across content + devices
Fast computation of flight
pricing
Corporate fleet
management
Real-time analytics
2,000,000 SKU -commerce
management
Real-time location tracking
for friends/family
Video recommendation
management
IoT for industrial
machines
Synchronize browser
properties for millions
Threat intelligence service
using JanusGraph
Real time fraud detection
across 6M
transactions/day
Uber scale, mission critical
chat & messaging app
5
Network security threat
detection
Power ~50M X1 DVRs with
billions of reqs/day
Precision healthcare via
Edison AI
Inventory hub for retail
operations
Property listings and
updates
Unified ML feature store
across the business
Cryptocurrency exchange
app
Geography-based
recommendations
Distributed storage for
distributed ledger tech
Global operations- Avon,
Body Shop + more
Predictable performance
for on sale surges
GPS-based exercise
tracking
Part I
Mixed Workloads
6
Current SSDs are Amazing
7
SSDs are Amazing
+ 6.4 GB/s read
+ 3.3 GB/s write
+ 1M read IOPS
+ 200k write IOPS
+ Often, several disks in a single server!
8
SSDs are Amazing, but not Magic
+ 6.4 GB/s read
+ OR 3.3 GB/s write
+ OR 1M read IOPS
+ OR 200k write IOPS
+ Or some kind of mix
+ But what kind of mix?!
9
+ Online transaction processing (OLTP)
+ There’s a real user at the other end
+ Maintenance workloads
+ Scaling out
+ Compaction
+ Backup
+ Analytics (OLAP)
+ Want to soak up free bandwidth, but not under a tight deadline
+ Multi-tenancy
+ Several OLTP and OLAP workloads on the same disk/data
Why mixed workloads?
10
Prevent one workload from dominating
and hurting another workload
The Challenge
11
Part II
Understanding Disks
12
Introducing Diskplorer
+ Tool to test disks at a variety of mixed workloads
+ Open source: https://github.com/scylladb/diskplorer
+ Python, fio, matplotlib
+ Fancy graphs
+ Hours of fun!
13
AWS i3en.3xlarge
14
AWS i3.3xlarge
15
AWS im4gn.4xlarge
16
GCP n2-standard-16, 8 load SSDs
17
GCP n2-standard-8, 2TB Persistent Disk
18
Clunky hard disk
19
Part III
Working with Disks
20
21
+ Equation for the “safety area” can be complex
+ Bandwidth and IOPS are not instantly measurable
Disk math troubles
Disk math (step 1)
22
23
+ Equation for the “safety area” can be complex
+ Bandwidth and IOPS are not instantly measurable
Disk math troubles
Token bucket algo (and its math)
24
Disk math (step 2)
25
Scheduler algo
26
Scheduler safety area
27
28
+ runtime – total CPU time consumed
+ virtual runtimes – total time normalized by entity “weight”
+ algorithm selects entity with smallest virtual runtime
+ long inactive entity “forgets” idle virtual runtime
CPU scheduler classes
Classes and shares
29
Scheduler real work
30
United States
2445 Faber St, Suite #200
Palo Alto, CA USA 94303
Israel
Maskit 4
Herzliya, Israel 4673304
www.scylladb.com
@scylladb
Thank You!
avi@scylladb.com
@avikivity

More Related Content

PDF
Oracle Client Failover - Under The Hood
Ludovico Caldara
 
PDF
Cilium - Fast IPv6 Container Networking with BPF and XDP
Thomas Graf
 
PDF
All about Zookeeper and ClickHouse Keeper.pdf
Altinity Ltd
 
PPTX
Click-Through Example for Flink’s KafkaConsumer Checkpointing
Robert Metzger
 
PDF
Red Hat Global File System (GFS)
Schubert Zhang
 
PDF
Introduction and Deep Dive Into Containerd
Kohei Tokunaga
 
PPTX
Materialized Views and Secondary Indexes in Scylla: They Are finally here!
ScyllaDB
 
PDF
PostgreSQL Deep Internal
EXEM
 
Oracle Client Failover - Under The Hood
Ludovico Caldara
 
Cilium - Fast IPv6 Container Networking with BPF and XDP
Thomas Graf
 
All about Zookeeper and ClickHouse Keeper.pdf
Altinity Ltd
 
Click-Through Example for Flink’s KafkaConsumer Checkpointing
Robert Metzger
 
Red Hat Global File System (GFS)
Schubert Zhang
 
Introduction and Deep Dive Into Containerd
Kohei Tokunaga
 
Materialized Views and Secondary Indexes in Scylla: They Are finally here!
ScyllaDB
 
PostgreSQL Deep Internal
EXEM
 

What's hot (20)

PPTX
eBPF Basics
Michael Kehoe
 
PDF
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
DataStax
 
PDF
美团技术团队 - KVM性能优化
美团点评技术团队
 
PPTX
The top 3 challenges running multi-tenant Flink at scale
Flink Forward
 
PDF
Apache Calcite (a tutorial given at BOSS '21)
Julian Hyde
 
PDF
patroni-based citrus high availability environment deployment
hyeongchae lee
 
PPTX
Kafka Tutorial - DevOps, Admin and Ops
Jean-Paul Azar
 
PDF
Cloud arch patterns
Corey Huinker
 
PPTX
쿠버네티스의 이해 #1
상욱 송
 
PDF
Replacing iptables with eBPF in Kubernetes with Cilium
Michal Rostecki
 
PPTX
Apache Arrow: In Theory, In Practice
Dremio Corporation
 
PDF
제3회난공불락 오픈소스 인프라세미나 - Pacemaker
Tommy Lee
 
PDF
NATS Streaming - an alternative to Apache Kafka?
Anton Zadorozhniy
 
PDF
QEMU Disk IO Which performs Better: Native or threads?
Pradeep Kumar
 
PDF
Understanding oracle rac internals part 2 - slides
Mohamed Farouk
 
PDF
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Spark Summit
 
PDF
왜 쿠버네티스는 systemd로 cgroup을 관리하려고 할까요
Jo Hoon
 
PDF
Docker Networking Deep Dive
Docker, Inc.
 
PDF
Hive join optimizations
Szehon Ho
 
PDF
Embedded Recipes 2018 - Finding sources of Latency In your system - Steven Ro...
Anne Nicolas
 
eBPF Basics
Michael Kehoe
 
Cassandra at Instagram 2016 (Dikang Gu, Facebook) | Cassandra Summit 2016
DataStax
 
美团技术团队 - KVM性能优化
美团点评技术团队
 
The top 3 challenges running multi-tenant Flink at scale
Flink Forward
 
Apache Calcite (a tutorial given at BOSS '21)
Julian Hyde
 
patroni-based citrus high availability environment deployment
hyeongchae lee
 
Kafka Tutorial - DevOps, Admin and Ops
Jean-Paul Azar
 
Cloud arch patterns
Corey Huinker
 
쿠버네티스의 이해 #1
상욱 송
 
Replacing iptables with eBPF in Kubernetes with Cilium
Michal Rostecki
 
Apache Arrow: In Theory, In Practice
Dremio Corporation
 
제3회난공불락 오픈소스 인프라세미나 - Pacemaker
Tommy Lee
 
NATS Streaming - an alternative to Apache Kafka?
Anton Zadorozhniy
 
QEMU Disk IO Which performs Better: Native or threads?
Pradeep Kumar
 
Understanding oracle rac internals part 2 - slides
Mohamed Farouk
 
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Spark Summit
 
왜 쿠버네티스는 systemd로 cgroup을 관리하려고 할까요
Jo Hoon
 
Docker Networking Deep Dive
Docker, Inc.
 
Hive join optimizations
Szehon Ho
 
Embedded Recipes 2018 - Finding sources of Latency In your system - Steven Ro...
Anne Nicolas
 
Ad

Similar to Understanding Storage I/O Under Load (20)

PDF
Different I/O Access Methods for Linux, What We Chose for ScyllaDB, and Why
ScyllaDB
 
PDF
Under The Hood Of A Shard-Per-Core Database Architecture
ScyllaDB
 
PDF
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
ScyllaDB
 
PDF
Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...
DevOps.com
 
PDF
Using ScyllaDB for Extreme Scale Workloads
MarisaDelao3
 
PDF
Exploring Phantom Traffic Jams in Your Data Flows
ScyllaDB
 
PDF
What’s New in ScyllaDB Open Source 5.0
ScyllaDB
 
PDF
Transforming the Database: Critical Innovations for Performance at Scale
ScyllaDB
 
PPTX
Qnap iei partners_day_2016 1108
qnapivan
 
PPTX
Docker Introduction
Hao Fan
 
PDF
Running a DynamoDB-compatible Database on Managed Kubernetes Services
ScyllaDB
 
PPTX
QNAP NAS Training 2016
Fernando Barrientos
 
PPTX
Docker intro
spiddy
 
PDF
2016-JAN-28 -- High Performance Production Databases on Ceph
Ceph Community
 
PDF
InfiniBand for the enterprise
Anas Kanzoua
 
PDF
Build Low-Latency Applications in Rust on ScyllaDB
ScyllaDB
 
PDF
PyData Boston 2013
Travis Oliphant
 
PPTX
Building a 10x More Efficient Edge Platform
ScyllaDB
 
PDF
Docker Meetup San Francisco: Radical Agility with Docker & AWS
Volker Pilz
 
PDF
Build DynamoDB-Compatible Apps with Python
ScyllaDB
 
Different I/O Access Methods for Linux, What We Chose for ScyllaDB, and Why
ScyllaDB
 
Under The Hood Of A Shard-Per-Core Database Architecture
ScyllaDB
 
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
ScyllaDB
 
Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...
DevOps.com
 
Using ScyllaDB for Extreme Scale Workloads
MarisaDelao3
 
Exploring Phantom Traffic Jams in Your Data Flows
ScyllaDB
 
What’s New in ScyllaDB Open Source 5.0
ScyllaDB
 
Transforming the Database: Critical Innovations for Performance at Scale
ScyllaDB
 
Qnap iei partners_day_2016 1108
qnapivan
 
Docker Introduction
Hao Fan
 
Running a DynamoDB-compatible Database on Managed Kubernetes Services
ScyllaDB
 
QNAP NAS Training 2016
Fernando Barrientos
 
Docker intro
spiddy
 
2016-JAN-28 -- High Performance Production Databases on Ceph
Ceph Community
 
InfiniBand for the enterprise
Anas Kanzoua
 
Build Low-Latency Applications in Rust on ScyllaDB
ScyllaDB
 
PyData Boston 2013
Travis Oliphant
 
Building a 10x More Efficient Edge Platform
ScyllaDB
 
Docker Meetup San Francisco: Radical Agility with Docker & AWS
Volker Pilz
 
Build DynamoDB-Compatible Apps with Python
ScyllaDB
 
Ad

More from ScyllaDB (20)

PDF
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PDF
New Ways to Reduce Database Costs with ScyllaDB
ScyllaDB
 
PDF
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
PDF
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
ScyllaDB
 
PDF
Leading a High-Stakes Database Migration
ScyllaDB
 
PDF
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
ScyllaDB
 
PDF
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
ScyllaDB
 
PDF
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
ScyllaDB
 
PDF
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB
 
PDF
ScyllaDB: 10 Years and Beyond by Dor Laor
ScyllaDB
 
PDF
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
ScyllaDB
 
PDF
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
ScyllaDB
 
PDF
Vector Search with ScyllaDB by Szymon Wasik
ScyllaDB
 
PDF
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
ScyllaDB
 
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
PDF
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
ScyllaDB
 
PDF
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
ScyllaDB
 
PDF
Lessons Learned from Building a Serverless Notifications System by Srushith R...
ScyllaDB
 
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
New Ways to Reduce Database Costs with ScyllaDB
ScyllaDB
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
ScyllaDB
 
Leading a High-Stakes Database Migration
ScyllaDB
 
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
ScyllaDB
 
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
ScyllaDB
 
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
ScyllaDB
 
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB
 
ScyllaDB: 10 Years and Beyond by Dor Laor
ScyllaDB
 
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
ScyllaDB
 
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
ScyllaDB
 
Vector Search with ScyllaDB by Szymon Wasik
ScyllaDB
 
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
ScyllaDB
 
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
ScyllaDB
 
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
ScyllaDB
 
Lessons Learned from Building a Serverless Notifications System by Srushith R...
ScyllaDB
 

Recently uploaded (20)

PDF
Software Development Methodologies in 2025
KodekX
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Chapter 1 Introduction to CV and IP Lecture Note.pdf
Getnet Tigabie Askale -(GM)
 
PDF
Chapter 2 Digital Image Fundamentals.pdf
Getnet Tigabie Askale -(GM)
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PPTX
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
PDF
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
PDF
Why Your AI & Cybersecurity Hiring Still Misses the Mark in 2025
Virtual Employee Pvt. Ltd.
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
Software Development Methodologies in 2025
KodekX
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Chapter 1 Introduction to CV and IP Lecture Note.pdf
Getnet Tigabie Askale -(GM)
 
Chapter 2 Digital Image Fundamentals.pdf
Getnet Tigabie Askale -(GM)
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
Why Your AI & Cybersecurity Hiring Still Misses the Mark in 2025
Virtual Employee Pvt. Ltd.
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 

Understanding Storage I/O Under Load

  • 1. Understanding Storage I/O Under Load Avi Kivity, CTO Pavel Emelyanov, Principal Engineer
  • 2. Presenter 2 Avi Kivity ● Original maintainer of Linux KVM - Kernel-based Virtual Machine ● Co-maintainer of Seastar, ScyllaDB ● Co-founder of ScyllaDB
  • 3. Presenter 3 Pavel Emelyanov ● Ex Linux kernel hacker ● Founder of the CRIU project ● Co-maintainer of Seastar & ScyllaDB
  • 4. 4 + For distributed, data-intensive apps that require high performance and low latency + 400+ users worldwide + Results + Comcast: Reduced P99 latencies by 95% + FireEye: 1500% improvement in throughput + Discord: Reduced C* nodes from ~140 to 6 + iFood: 9X cost reduction vs. DynamoDB + Open Source, Enterprise and Cloud options + Fully compatible with Apache Cassandra and Amazon DynamoDB About ScyllaDB 1ms <1ms 10ms 1M 10M ScyllaDB Universe of 400+ Users
  • 5. 400+ Companies Use ScyllaDB Seamless experiences across content + devices Fast computation of flight pricing Corporate fleet management Real-time analytics 2,000,000 SKU -commerce management Real-time location tracking for friends/family Video recommendation management IoT for industrial machines Synchronize browser properties for millions Threat intelligence service using JanusGraph Real time fraud detection across 6M transactions/day Uber scale, mission critical chat & messaging app 5 Network security threat detection Power ~50M X1 DVRs with billions of reqs/day Precision healthcare via Edison AI Inventory hub for retail operations Property listings and updates Unified ML feature store across the business Cryptocurrency exchange app Geography-based recommendations Distributed storage for distributed ledger tech Global operations- Avon, Body Shop + more Predictable performance for on sale surges GPS-based exercise tracking
  • 7. Current SSDs are Amazing 7
  • 8. SSDs are Amazing + 6.4 GB/s read + 3.3 GB/s write + 1M read IOPS + 200k write IOPS + Often, several disks in a single server! 8
  • 9. SSDs are Amazing, but not Magic + 6.4 GB/s read + OR 3.3 GB/s write + OR 1M read IOPS + OR 200k write IOPS + Or some kind of mix + But what kind of mix?! 9
  • 10. + Online transaction processing (OLTP) + There’s a real user at the other end + Maintenance workloads + Scaling out + Compaction + Backup + Analytics (OLAP) + Want to soak up free bandwidth, but not under a tight deadline + Multi-tenancy + Several OLTP and OLAP workloads on the same disk/data Why mixed workloads? 10
  • 11. Prevent one workload from dominating and hurting another workload The Challenge 11
  • 13. Introducing Diskplorer + Tool to test disks at a variety of mixed workloads + Open source: https://github.com/scylladb/diskplorer + Python, fio, matplotlib + Fancy graphs + Hours of fun! 13
  • 17. GCP n2-standard-16, 8 load SSDs 17
  • 18. GCP n2-standard-8, 2TB Persistent Disk 18
  • 21. 21 + Equation for the “safety area” can be complex + Bandwidth and IOPS are not instantly measurable Disk math troubles
  • 23. 23 + Equation for the “safety area” can be complex + Bandwidth and IOPS are not instantly measurable Disk math troubles
  • 24. Token bucket algo (and its math) 24
  • 28. 28 + runtime – total CPU time consumed + virtual runtimes – total time normalized by entity “weight” + algorithm selects entity with smallest virtual runtime + long inactive entity “forgets” idle virtual runtime CPU scheduler classes
  • 31. United States 2445 Faber St, Suite #200 Palo Alto, CA USA 94303 Israel Maskit 4 Herzliya, Israel 4673304 www.scylladb.com @scylladb Thank You! avi@scylladb.com @avikivity