sched-ext Tutorial

Extensible Scheduler Class (better known as sched-ext) is a Linux kernel feature which enables implementing kernel thread schedulers in BPF (Berkeley Package Filter) and dynamically loading them. Essentially, this allows end-users to change their schedulers in userspace without the need to build another kernel just to have a different scheduler.

The schedulers can be found in the scx-scheds and scx-scheds-git package.
Terminal window
```
# Stable branch + scx_loader and scxctl tools.
sudo pacman -S scx-scheds scx-tools

# Bleeding edge branch (This branch includes the latest changes from the master branch.) + scx_loader and scxctl tools.
sudo pacman -S scx-scheds-git scx-tools-git
```
The scx-scheds-git package may not include every experimental scheduler, as some are developed in separate feature branches that are not yet merged into the master branch.

How to Launch and Manage the Scheduler

To start the scheduler, open your terminal and enter the following command:
Example of starting rusty
```
sudo scx_rusty
```

This will launch the rusty scheduler and detach the default scheduler.

To stop the scheduler. Press CTRL + C and the scheduler will then be stopped and the default kernel scheduler will take over again.

scxctl is a CLI DBUS client for interacting with scx_loader.

Features:
- Get the current scheduler and mode
- List all available schedulers
- Start a scheduler in a given mode, or with given arguments
- Switch between schedulers and modes
- Stop the running scheduler
- Restart the running scheduler

scxctl start --sched flash --mode gaming

scxctl stop

scxctl restore

scxctl switch --sched bpfland --mode gaming

scxctl start --sched cosmos --args="-c,75,-m,0-15"

scxctl switch --sched flash --args="-s,20000"

$ scxctl --help
Usage: scxctl <COMMAND>

Commands:
  get      Get the info on the running scheduler
  list     List all supported schedulers
  start    Start a scheduler in a mode or with arguments
  switch   Switch schedulers or modes, optionally with arguments
  stop     Stop the current scheduler
  restart  Restart the current scheduler with original configuration
  restore  Restore the default scheduler from configuration
  help     Print this message or the help of the given subcommand(s)

Options:
  -h, --help     Print help
  -V, --version  Print version

As the name implies, it is a utility that functions as a loader and manager for the sched-ext framework using the D-Bus interface.

While it does not require systemd, it can still be utilized in conjunction with it. Check the transition guide for reference.

Has the ability to stop, start, restart, read information about a scx scheduler and more.
- You can use tools like dbus-send or gdbus to communicate with it.

This guide explains how to use scx_loader with the dbus-send command.

dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.scx.Loader.StartScheduler string:scx_rusty uint32:0

# This example starts scx_bpfland with the following flags: -k -c 0
dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.scx.Loader.StartSchedulerWithArgs string:scx_bpfland array:string:"-k","-c","0"

dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.scx.Loader.StopScheduler

# scx_loader will switch to the default scheduler set in the scx_loader config file
dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.scx.Loader.RestoreDefault

dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.scx.Loader.SwitchScheduler string:scx_lavd uint32:2
# This switches to scx_lavd with the scheduler mode 2 meaning it starts LAVD in powersaving

dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.scx.Loader.SwitchSchedulerWithArgs string:scx_bpfland array:string:"-k","-c","0"

dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.freedesktop.DBus.Properties.Get string:org.scx.Loader string:CurrentScheduler

dbus-send --system --print-reply --dest=org.scx.Loader /org/scx/Loader org.freedesktop.DBus.Properties.Get string:org.scx.Loader string:SupportedSchedulers

You can access and configure them via the sched-ext scheduler config button.

SCX Manager is a standalone GUI tool derived from the CachyOS Kernel Manager. It allows users to manage the sched-ext framework and its schedulers through the scx_loader.

Features:

Check which scheduler is currently active
Select a scheduler or profile: (Auto, Gaming, Power save, Low Latency or Server)
Set additional flags
Disable the current scheduler

Screenshot

Scheduler Guide: Profiles and Use Cases

Since there are many schedulers to choose from, we want to give a little introduction about the schedulers in hand.

Feel free to report any issue or feedback to their scheduler repository.

Use scx_schedulername --help to see the available flags and a brief description of what they do.

scx_rusty --help

scx_beerland

Developed by: Andrea Righi (arighi GitHub)

Production Ready?

scx_beerland is a scheduler designed to prioritize locality and scalability.

Prioritizes keeping tasks on the same CPU to maintain cache locality, while also ensuring good scalability across many CPUs by using local DSQs (per-CPU runqueues) when the system is not saturated.

Use cases:
- Cache-intensive workloads
- Systems with a large amount of CPUs
- Gaming: Its known to work surprinsingly well in certain games, although your mileage may vary
- Server: Good for general purpose server workloads due to its scalability and locality optimizations.
- Can be used for desktop usage as well.

Scheduler Modes

None for the moment.

scx_bpfland

Developed by: Andrea Righi (arighi GitHub)

Production Ready?

A vruntime-based sched_ext scheduler that prioritizes interactive workloads. Highly flexible and easy to adapt.

Bpfland when making decisions on which cores to use, it takes in consideration their cache layout and which cores share the same L2/L3 cache leading to fewer cache misses = more performance.

Use cases:
- Gaming
- Desktop usage
- Multimedia/Audio production
- Great interactivity under intensive workloads
- Power saving
- Server

Scheduler Modes

Low Latency

Command-line Flags: -m performance -w
Description: Meant to lower latency at the cost of throughput. Suitable for soft real-time applications like Audio Processing and Multimedia.

Power Save

Command-line Flags: -s 20000 -m powersave -I 100 -t 100
Description: Prioritizes power efficiency. Favors less performant cores (e.g E-cores on Intel).

Server

Command-line Flags: -s 20000 -S
Description: Prioritize tasks with strict affinity. This option can increase throughput at the cost of latency and it is more suitable for server workloads.

scx_cake

Developed by: RitzDaCat (RitzDaCat GitHub)

Production Ready?

scx_cake is an experimental BPF CPU scheduler that adapts the network CAKE algorithm’s DRR++ (Deficit Round Robin++) for CPU scheduling.

4-Tier Classification — Tasks sorted by EWMA avg_runtime into Critical / Interactive / Frame / Bulk
Zero Global Atomics — Per-CPU BSS arrays with MESI-guarded writes eliminate bus locking
Kernel-Delegated Idle Selection — scx_bpf_select_cpu_dfl() for authoritative, zero-staleness CPU selection
Per-LLC DSQ Sharding — Eliminates cross-CCD lock contention on multi-chiplet CPUs
DRR++ Deficit Tracking — Network CAKE’s flow fairness algorithm adapted for CPU task scheduling

Designed for gaming workloads on modern AMD and Intel hardware.

Use cases:
- Gaming

4-Tier System

scx_cake classifies every task into one of four tiers based on its EWMA (Exponential Weighted Moving Average) runtime. Classification is automatic and continuous — tasks move between tiers as their behavior changes.

Tier Gates

Tier	Name	avg_runtime	Typical Workload	Quantum	Starvation
T0	Critical	< 100µs	IRQ handlers, input drivers, audio (PipeWire), network	0.5ms	3ms
T1	Interactive	< 2ms	Compositors, game physics, game AI, short workers	2.0ms	8ms
T2	Frame	< 8ms	Game render threads, video encoding	4.0ms	40ms
T3	Bulk	≥ 8ms	Compilation, background indexing, batch jobs	8.0ms	100ms

[!TIP] No game task should be in T3. Game render threads run 2-8ms per frame → T2. Physics/AI run 0.5-2ms → T1. Input handlers run < 100µs → T0. Only tasks doing 8ms+ uninterrupted CPU work (shader compilation, loading screens) land in T3.

How Classification Works

Initial placement: Based on nice value — nice < 0 → T0, nice 0-10 → T1, nice > 10 → T3
Runtime authority: After ~3 stops, the EWMA avg_runtime becomes authoritative. A nice -5 task that runs 50ms bursts will reclassify to T3 regardless of nice value.
Hysteresis: 10% deadband prevents oscillation at tier boundaries. Promotion requires avg_runtime clearly below the gate; demotion is immediate.
Graduated backoff: Once a tier is stable for 3+ consecutive stops, reclassification frequency drops per-tier: T0 rechecks every 1024th stop, T3 every 16th. Instability resets to full-frequency checking.

DRR++ Deficit Tracking

Adapted from network CAKE’s flow fairness:

Each task starts with a deficit (quantum + new-flow bonus ≈ 10ms credit)
Each execution bout consumes deficit proportional to runtime
When deficit exhausts → new-flow bonus removed → task competes normally
This gives newly spawned threads (game launching a worker) instant responsiveness that naturally decays

DVFS (CPU Frequency Scaling)

Each tier maps to a CPU performance target via RODATA lookup table:

Tier	Target	Rationale
T0-T2	100% (max frequency)	Gaming workloads need full performance
T3	75%	Background work can run slightly slower to save power

On Intel hybrid CPUs (has_hybrid = true), targets are scaled by each core’s cpuperf_cap to prevent over-requesting frequency on E-cores.

Profiles (`--profile, -p`)

Profile	Quantum	Starvation	Use Case
gaming	2ms	100ms	(Default) Balanced for most games
esports	1ms	50ms	Competitive FPS, ultra-low latency
legacy	4ms	200ms	Older CPUs, battery saving
default	2ms	100ms	Alias for gaming

CLI Arguments

Argument	Default	Description
`--profile, -p <PROFILE>`	`gaming`	Select preset profile
`--quantum <µs>`	profile	Base time slice in microseconds
`--new-flow-bonus <µs>`	profile	Extra deficit for newly woken tasks
`--starvation <µs>`	profile	Max run time before forced preemption
`--verbose, -v`	`false`	Enable live TUI stats display
`--interval <secs>`	`1`	TUI refresh interval

Per-Tier Tuning (Gaming Profile)

Tier	Quantum Multiplier	Effective Slice	Starvation Limit
T0 Critical	0.75x	1.5ms	3ms
T1 Interactive	1.0x	2.0ms	8ms
T2 Frame	1.2x	2.4ms	40ms
T3 Bulk	1.4x	2.8ms	100ms

scx_cosmos

Developed by: Andrea Righi (arighi GitHub)

Production Ready?

Lightweight scheduler optimized for preserving task-to-CPU locality.

When the system is not saturated, the scheduler prioritizes keeping tasks on the same CPU using local DSQs. This not only maintains locality but also reduces locking contention compared to shared DSQs, enabling good scalability across many CPUs.

Use cases:
- General-purpose scheduler: the scheduler should adapt itself both for server workloads or desktop workloads.

Scheduler Modes

Auto

Command-line Flags: -s 20000 -d -c 0 -p 0
Description: Sacrifices cache locality and power efficiency for even distribution across all CPUs. Leaning more over to interactive workloads.

Gaming

Command-line Flags: -c 0 -p 0
Description: Disable CPU load tracking and always enforce deadline-based scheduling to improve responsiveness.

Power Save

Command-line Flags: -m powersave -d -p 5000
Description: Prioritizes power efficiency. Favor less performant cores (e.g., E-cores on Intel) and disables deferred wakeups, reducing throughput while increasing power efficiency. CPU load polling increased to 5ms.

Low Latency

Command-line Flags: -m performance -c 0 -p 0 -w
Description: Meant to lower latency at the cost of throughput. Suitable for soft real-time applications like Audio Processing and Multimedia. Always enforce deadline-based scheduling and synchronous wake up optimizations to improve performance predictability.

Server

Command-line Flags: -s 20000
Description: Enable address space affinity to improve locality and performance in certain cache-sensitive workloads. Polling increased to 20ms.

scx_flash

Developed by: Andrea Righi (arighi GitHub)

Production Ready?

A scheduler that focuses on ensuring fairness among tasks and performance predictability.

It operates using an earliest deadline first (EDF) policy, where each task is assigned a “latency” weight. This weight is dynamically adjusted based on how often a task release the CPU before its full time slice is used.

Tasks that release the CPU early are given a higher latency weight, prioritizing them over tasks that fully consume their time slice.

Use cases:
- Gaming
- Latency sensitive workloads such as multimedia or real-time audio processing
- Need for responsiveness under over-stressed situations
- Consistency in performance
- Server

Scheduler Modes

Low Latency

Command-line Flags: -m performance -w -C 0
Description: Meant to lower latency at the cost of throughput. Suitable for soft real-time applications like Audio Processing and Multimedia.

Gaming

Command-line Flags: -m all
Description: Optimizes for high performance in games.

Power Save

Command-line Flags: -m powersave -I 10000 -t 10000 -s 10000 -S 1000
Description: Prioritizes power efficiency. Favor less performant cores (e.g., E-cores on Intel) and introduces a forced idle cycle every 10ms to increase power saving.

Server

Command-line Flags: -m all -s 20000 -S 1000 -I -1 -D -L
Description: Tuned for server workloads. Trades responsiveness for throughput.

scx_lavd

Developed by: Changwoo Min (multics69 GitHub).

Production Ready?

Brief introduction to LAVD from Changwoo:

LAVD is a new scheduling algorithm which is still under development. It is motivated by gaming workloads, which are latency-critical and communication-heavy. It aims to minimize latency spikes while maintaining overall good throughput and fair use of CPU time among tasks.

Use cases:
- Gaming
- Audio Production
- Latency sensitive workloads
- Desktop usage
- Great interactivity under intensive workloads
- Power saving

One of the main and awesome capabilities that LAVD includes is Core Compaction. which without going into technical details: When CPU usage < 50%, Currently active cores will run for longer and at a higher frequency. Meanwhile Idle Cores will stay in C-State (Sleep) for a much longer duration achieving less overall power usage.

Scheduler Modes

Gaming & Low Latency

Command-line Flags: --performance
Description: Maximizes performance by using all available cores, prioritizing physical cores.

Power Save

Command-line Flags: --powersave
Description: Minimizes power consumption while maintaining reasonable performance. Prioritizes efficient cores and threads over physical cores.

scx_pandemonium

Developed by: Will Clingan (willclngn GitHub)

A behavioral classification scheduler that uses EWMA-driven scoring (wakeup frequency, context switch rate, runtime variance) to sort tasks into three dispatch tiers — LAT_CRITICAL, INTERACTIVE, and BATCH — each with its own time slice, preemption rules, and DSQ routing. A CoDel-inspired sojourn rescue mechanism tracks batch queue wait times and rescues aging tasks before they stall, with thresholds that adapt to dispatch rate and core count. Dual burst detection (CUSUM change-point + wakeup rate counter) handles fork storms, and a Rust adaptive control loop adjusts scheduling knobs once per second based on workload regime detection and BPF histogram telemetry.

Compositors (KWin, GNOME Shell, Hyprland, Sway, and others) are automatically boosted to LAT_CRITICAL. A persistent process database learns task classifications across reboots.

Use cases:
- Gaming
- Desktop usage
- Multimedia/Audio production
- Codebase compilation
- Interactivity under intensive workloads
- Mixed workloads

Scheduler Modes

Default (Adaptive)

Command-line Flags: (none — runs adaptive by default)
Description: Full adaptive mode. The Rust control loop detects workload regime (LIGHT / MIXED / HEAVY) and adjusts scheduling parameters in real-time. Best for general desktop use and gaming.

BPF Only

Command-line Flags: --no-adaptive
Description: Disables the Rust adaptive control loop. The BPF scheduler runs with static tuning knobs. Lower overhead, useful for benchmarking or if the adaptive layer is overcorrecting on your workload.

Verbose / Debug

Command-line Flags: -v or --verbose
Description: Enables detailed telemetry output including per-tier dispatch counts, sojourn times, and behavioral classification stats. Useful for diagnosing scheduling behavior.

scx_p2dq

Production Ready?
- Yes. If tuned correctly for your specific workload and hardware.

Developed by: Daniel Hodges (hodgesds GitHub)

A general purpose scheduler that focuses on pick two load balancing between LLCs. Keeps high cache locality and work conservation while providing reasonable latency.

Use cases:
- Server
- Desktop environments
- Gaming (with some manual tuning)

Scheduler Modes

Gaming

Command-line Flags: --task-slice true -f --sched-mode performance
Description: Improves consistency in gaming performance and increases bias towards scheduling on higher performance cores.

Low Latency

Command-line Flags: -y -f --task-slice true
Description: Lowers latency by making interactive tasks stick more to the CPU they were assigned to and increasing the stability on slice time.

Power Save

Command-line Flags: --sched-mode efficiency
Description: Enhances power efficiency by prioritizing power efficient cores.

Server

Command-line Flags: --keep-running
Description: Improves server workloads by allowing tasks to run beyond their slice if the CPU is idle.

scx_tickless

Developed by: Andrea Righi (arighi Github)

Production Ready?
- This scheduler is still experimental and not recommended for production use.

scx_tickless is a server-oriented scheduler designed for cloud computing, virtualization, and high-performance computing workloads.

The scheduler works by routing all scheduling events through a pool of primary CPUs assigned to handle these events. This allows disabling the scheduler’s tick on other CPUs, reducing OS noise.

Use cases:
- Cloud computing
- Virtualization
- High performance computing workloads
- Server

Scheduler Modes

Gaming

Command-line Flags: -f 5000 -s 5000
Description: Boosts gaming performance by increasing how often the scheduler detects CPU contention and triggers context switches with a shorter time slice.

Power Save

Command-line Flags: -f 50
Description: Enhances power efficiency by lowering contention checks.

Low Latency

Command-line Flags: -f 5000 -s 1000
Description: Similar to the gaming profile but with a further reduced slice.

Server

Command-line Flags: -f 100
Description: Reduced how often the scheduler checks for CPU contention to improve throughput at the cost of responsiveness.

scx_rustland

Developed by: Andrea Righi (arighi GitHub)

Production Ready?

For performance-critical production scenarios, other schedulers are likely to exhibit better performance, as offloading all scheduling decisions to user-space comes with a certain cost (even if it’s minimal).

However, a scheduler entirely implemented in user-space holds the potential for seamless integration with sophisticated libraries, tracing tools, external services (e.g., AI), etc.

Hence, there might be situations where the benefits outweigh the overhead, justifying the use of this scheduler in a production environment.

Shares similarities with bpfland, Made with the intention of being easy to read and understand how it works due to its implementation in userspace.

Keep in mind that there is a slight throughput disadvantage when using a userspace scheduler.

Use cases:
- Low latency workloads (Gaming, video conferences and live streaming)
- Desktop usage

scx_rusty

Developed by: David Vernet (Byte-Lab GitHub)

Production Ready?
- Yes. If tuned correctly,

Rusty offers a wide range of features that enhance its capabilities, providing greater flexibility for various use cases. One of these features is tunability, allowing you to customize Rusty to suit your preferences and specific requirements.

Use cases:
- Gaming
- Latency sensitive workloads
- Desktop usage
- Multimedia/Audio production
- Great interactivity under intensive workloads
- Power saving

Configuration and performance testing

LAVD Autopilot & Autopower

Quotes from Changwoo Min:

In autopilot mode, the scheduler adjusts its power mode Powersave, Balanced, or Performance based on the system load, specifically CPU utilization
Autopower: Automatically decide the scheduler’s power mode based on the system’s energy profile aka EPP (Energy Performance Preference).

# Autopower can be activated by the following flag:
--autopower
# e.g:
scx_lavd --autopower

ananicy-cpp & sched-ext

In order to disable/stop ananicy-cpp, run the following command:

systemctl disable --now ananicy-cpp

scx_loader Power Profile Switching

Implemented in the power-profiles-daemon package provided by CachyOS which includes a custom patch to support scx_loader power profile switching.

If scx_loader is currently running, when game-performance is used, it will automatically switch the active scheduler to the Gaming profile when a game is launched, and revert back to the default profile when the game is closed.
When changing between power profiles e.g. in KDE Plasma or GNOME using the power profile switcher, scx_loader will automatically switch to the corresponding scheduler profile:

Power Profile	Scheduler Profile
Power Saver	Power Save
Balanced	Auto
Performance	Gaming

Benchmarking and comparing schedulers with cachyos-benchmarker

The cachyos-benchmarker tool provides an easy way to evaluate and compare the performance of different CPU schedulers.

It runs a comprehensive suite of benchmarks to measure CPU, memory, and overall system performance under various workloads.

The following benchmarks are included:

Test	Measures	Tool
stress-ng cpu-cache-mem	CPU, cache, and memory performance	`stress-ng`
FFmpeg compilation	Parallel build performance	`make`
x265 encoding	Video encoding throughput	`x265`
argon2 hashing	Multithreaded password hashing	`argon2`
perf sched msg	Context switching and IPC performance	`perf`
perf memcpy	Memory throughput `memcpy()`	`perf`
prime calculation	Integer arithmetic and parallelism	`primesieve`
NAMD	Molecular dynamics (scientific workload)	`namd3`
Blender render	CPU-only 3D rendering	`blender`
xz compression	Compression throughput	`xz`
Kernel defconfig build	Kernel compilation performance	`make`
y-cruncher	Mathematical precision and memory stress	`y-cruncher`

cachyos-benchmarker can be used for several purposes, including:

Testing scheduler stability Run the full benchmark suite to detect stalls, crashes, or regressions introduced by scheduler changes. If you are using scx_loader, you can collect logs in case of a stall or crash with:
Terminal window
```
journalctl --unit scx_loader.service --boot 0 > crash.log
```
This will create a file named crash.log in your current directory.
Comparing scheduler performance
- Evaluate performance differences between schedulers. e.G. BPFLAND vs LAVD
Measuring the effect of kernel or scheduler updates
- Compare runs before and after applying patches or version changes to check for performance regressions or improvements.
Testing configuration tweaks
- Assess the impact of changes such as CPU governor settings, SMT toggling, or modified scheduler flags.

Requirements

4 GB RAM or more
At least 8 GB of free storage space
Time and patience - the full benchmark can take over an hour on slower systems

Installation

To install cachyos-benchmarker, run the following command:

sudo pacman -S cachyos-benchmarker

Running the benchmark

Execute cachyos-benchmarker:

cachyos-benchmarker ~/cachyos-benchmarker/
# You can replace ~/cachyos-benchmarker/ with any directory you want the logs to be saved in.

Wait until the preparation steps finish.
Follow the prompts:
- Do you want to drop page cache now? Root privileges needed! (y/N) y
- Please enter a name for this run, or leave empty for default:
Wait for the tests to finish.

Once finished, the following will happen:

Creation of a log file with name like benchie_<name>_<DATE>.log which contains detailed information about the benchmark run.

Example: benchie_p2dq_2025-09-29-2115.log
The benchmark_scraper.py script will automatically execute to generate a summary report in HTML format.

What does the script do?:

Reads all benchie_*.log files in the specified directory.
Extracts the benchmark names, times, and scores.
Sorts or aggregates them.

Prints a clean summary of the results to your terminal and creates an HTML file that can be opened in a browser.

Terminal output example:

stress-ng cpu-cache-mem: 15.26
y-cruncher pi 1b: 31.23
perf sched msg fork thread: 8.892
perf memcpy: 13.53
namd 92K atoms: 53.54
calculating prime numbers: 11.126
argon2 hashing: 6.62
ffmpeg compilation: 53.38
xz compression: 61.13
kernel defconfig: 130.73
blender render: 96.29
x265 encoding: 24.99

Total time (s): 506.72
Total score: 70.71

Name: p2dq
Date: 2025-09-29-2115

System:    Kernel: 6.17.0-1.1-cachyos-p2dq arch: x86_64 bits: 64
           Desktop: KDE Plasma v: 6.4.5 Distro: CachyOS
Memory:    System RAM: total: 32 GiB available: 30.61 GiB used: 7.54 GiB (24.6%)
           Device-1: Channel-A DIMM 0 type: LPDDR5 size: 8 GiB speed: 7500 MT/s
           Device-2: Channel-B DIMM 0 type: LPDDR5 size: 8 GiB speed: 7500 MT/s
           Device-3: Channel-C DIMM 0 type: LPDDR5 size: 8 GiB speed: 7500 MT/s
           Device-4: Channel-D DIMM 0 type: LPDDR5 size: 8 GiB speed: 7500 MT/s
CPU:       Info: 8-core model: AMD Ryzen 7 8845HS w/ Radeon 780M Graphics bits: 64 type: MT MCP cache: L2: 8 MiB
           Speed (MHz): avg: 3366 min/max: 419/5138 cores: 1: 3366 2: 3366 3: 3366 4: 3366 5: 3366 6: 3366 7: 3366 8: 3366 9: 3366 10: 3366 11: 3366 12: 3366 13: 3366 14: 3366 15: 3366 16: 3366

SCX Scheduler: p2dq_1.0.21_gf90c2aa1_dirty_x86_64_unknown_linux_gnu

SCX Version: p2dq_1.0.21_gf90c2aa1_dirty_x86_64_unknown_linux_gnu

 Version         : 0.5.1-1

HTML example of a test result comparing two different branches of the same scheduler:

To compare two or more runs, place the .log files in the same directory before running benchmark_scraper.py. The tool will automatically detect and compare them in the HTML report.

Testing scheduler latency with schbench

schbench is a scheduler benchmark designed to measure scheduler latency under a simulated server-style workload. It spawns a configurable number of “worker” and “message” threads, where messages repeatedly wake up workers. By measuring the latency distribution from wakeup to execution of these worker threads, it provides critical insight into a kernel’s ability to handle thread wakeups, balancing, and CPU contention, especially under load.

Use cases

You can use schbench to:

Evaluate scheduler latency: Identify how quickly threads are scheduled after waking up.
Compare wakeup performance between schedulers: Detect improvements or regressions in context switching and wakeup latency.
Test the effect of kernel or scheduler patches: Assess if tuning or updates affect scheduling fairness and responsiveness.

Installation

schbench is available in the CachyOS repositories:

sudo pacman -S schbench

Running the benchmark

A simple way to run schbenchfor a general latency test is:

schbench -m 2 -t 8 -r 60

This example runs:

2 message threads (-m 2)
8 worker threads per message thread (-t 8)
for 60 seconds total runtime (-r 60)

You can adjust these values depending on your CPU core count and the desired load level.

Here is a table explaining some of the key options:

Option	Description
`-C, --calibrate`	Run calibration and report timing (no benchmark).
`-L, --no-locking`	Disable spinlocks during CPU work (default: locking enabled).
`-m, --message-threads <n>`	Number of message threads (default: 1).
`-t, --threads <n>`	Worker threads per message thread (default: number of CPUs).
`-r, --runtime <sec>`	Benchmark duration (default: 30).
`-F, --cache_footprint <KB>`	Cache footprint size (default: 256).
`-n, --operations <count>`	Number of “think time” operations to perform (default: 5).
`-A, --auto-rps`	Automatically grow RPS until CPU utilization target is reached.
`-R, --rps <count>`	Requests per second mode.
`-p, --pipe <bytes>`	Simulate a pipe transfer test.
`-w, --warmuptime <sec>`	Warm-up duration before collecting stats (default: 0).
`-i, --intervaltime <sec>`	Interval for printing latencies (default: 10).
`-z, --zerotime <sec>`	Interval for zeroing latency stats (default: never).

Understanding the output

After each run, schbench prints latency percentiles like:

Output example

Wakeup Latencies percentiles (usec) runtime 10 (s) (2406 total samples)
  50.0th: 60         (648 samples)
  90.0th: 2034       (968 samples)
* 99.0th: 4104       (211 samples)
  99.9th: 10128      (22 samples)
  min=1, max=10308
Request Latencies percentiles (usec) runtime 10 (s) (2394 total samples)
  50.0th: 49216      (726 samples)
  90.0th: 69760      (954 samples)
* 99.0th: 166656     (212 samples)
  99.9th: 273920     (21 samples)
  min=11770, max=334247
RPS percentiles (requests) runtime 10 (s) (11 total samples)
  20.0th: 234        (3 samples)
* 50.0th: 238        (3 samples)
  90.0th: 241        (4 samples)
  min=230, max=248
current rps: 230.99
Wakeup Latencies percentiles (usec) runtime 10 (s) (2406 total samples)
  50.0th: 60         (648 samples)
  90.0th: 2034       (968 samples)
* 99.0th: 4104       (211 samples)
  99.9th: 10128      (22 samples)
  min=1, max=10308
Request Latencies percentiles (usec) runtime 10 (s) (2406 total samples)
  50.0th: 49216      (729 samples)
  90.0th: 69760      (956 samples)
* 99.0th: 165632     (212 samples)
  99.9th: 273920     (22 samples)
  min=11770, max=334247
RPS percentiles (requests) runtime 10 (s) (11 total samples)
  20.0th: 234        (3 samples)
* 50.0th: 238        (3 samples)
  90.0th: 241        (4 samples)
  min=230, max=248
average rps: 240.60

How to interpret the results

Wakeup Latencies:
- Measures how quickly threads wake up after being signaled.
  - Lower values here (especially the 99th percentile) mean the scheduler is more responsive.
Request Latencies:
- Represents the time taken to complete requests between threads.
  - Lower latency indicates better inter-thread communication and scheduling efficiency.
RPS (Requests Per Second):
- Shows the sustained throughput:
  - A higher average RPS indicates the scheduler can handle more work per second under the given configuration.

In conclusion:

A good scheduler will show low wakeup and request latencies with consistent RPS.
A less efficient scheduler may exhibit high latency spikes or unstable RPS values over time.

Recommendations for benchmarking games

If your desire is to benchmark games to compare how different schedulers perform, here are some tips to get the most accurate results:

Use built-in benchmarks: Many modern games come with built-in benchmarking tools. These are designed to provide consistent results by running the same sequence of events each time.
- Check out this website for a list of games that include built-in benchmarks.
Consistent settings: Ensure that the game settings (resolution, graphics quality, etc.) are the same for each test run.
Close background applications: Other applications running in the background can affect performance. Close unnecessary programs to minimize their impact.
If you’re not using a built-in benchmark, try to perform the same actions in the game for each test run. This could include following the same path, engaging in similar combat scenarios, or performing the same tasks.
- Even not aiming at the same spot can lead to different performance results.
Multiple runs: Perform multiple runs of the benchmark and take the average to account for variability.
Use performance monitoring tools: Tools like MangoHud or GOverlay can provide realtime performance metrics such as FPS, frame times, and CPU/GPU usage.
Take advantage of keyboard shortcuts or macros:
- One example is to create a keybinding on which you can switch between different schedulers or change their modes on the fly while in-game.
  - This can be done using a tool like scxctl or by creating custom scripts that change the active scheduler and its mode.

This website contains a list of benchmarks done by the community using different schedulers or testing various settings.

In order to upload your own benchmarks. You’ll have to link your Discord account to the website and then you can submit your own benchmarks.

Then click on the New benchmark button and fill in the required information.

You can upload multiple results for the same game using different schedulers or settings.
Accepts both MangoHud and Afterburner logs.
Allows searching by title or description.

Transitioning from scx.service to scx_loader: A Comprehensive Guide

First let’s start with a close-up comparison between the scx.service file structure against the scx_loader configuration file structure.

If you previously had LAVD running with the old scx.service like this example below:

# List of scx_schedulers: scx_bpfland scx_central scx_flash scx_lavd scx_layered scx_nest scx_qmap scx_rlfifo scx_rustland scx_rusty scx_simple scx_userland
SCX_SCHEDULER=scx_lavd

# Set custom flags for the scheduler
SCX_FLAGS='--performance'

Then the equivalent on the scx_loader configuration file will look like:

default_sched = "scx_lavd"
default_mode = "Auto"

[scheds.scx_lavd]
auto_mode = ["--performance"]

For more information on how to configure the scx_loader file

Follow the guide below for an easy transition from the scx systemd service to the new scx_loader utility.

 systemctl disable --now scx.service && systemctl enable --now scx_loader.service

# Micro editor is going to create a new file.
sudo micro /etc/scx_loader.toml
# Add the following lines:

default_sched = "scx_bpfland" # Edit this line to the scheduler you want scx_loader to start at boot
default_mode = "Auto" # Possible values: "Auto", "Gaming", "LowLatency", "PowerSave".

# Press CTRL + S to save changes and CTRL + Q to exit Micro.

Restarting the scx_loader
```
systemctl restart scx_loader.service
```
- You’re done, the scx_loader will now load and start the desired scheduler.

Checking the service status
```
systemctl status scx_loader.service
```
Viewing all the service log entries
```
journalctl -u scx_loader.service
```
Viewing only the logs of the current session.
```
journalctl -u scx_loader.service -b 0
```

In order to get a more detailed log, follow these steps.

Edit the service file
```
 sudo systemctl edit scx_loader.service
```
Add the following line under the [Service] section
```
 Environment=RUST_LOG=trace
```

 sudo systemctl restart scx_loader.service

Check the logs again for a more detailed debugging information.

FAQ

Why X scheduler performs worse than the other?

There are many variables to consider when comparing them. For example, how do they measure a task’s weight? Do they prioritize interactive tasks over non-interactive ones? Ultimately, it depends on their design choices.

Why everyone keeps saying this X scheduler is the best for X case but it does not perform as well for me?

Like the previous answer, the choice of CPU and its design such as the core layout, how they share cache across the cores and other related factors can lead to the scheduler operating less efficiently.
That’s why having choices is one of the highlights from the sched-ext framework, so don’t be scared to try one and see which one works best for your use case. Examples: fps stability, maximum performance, responsiveness under intensive workloads etc.

The use cases of these schedulers are quite similar… why is that?

Primarily because they are multipurpose schedulers, which means they can accommodate a variety of workloads, even if they may not excel in every area.

To determine which scheduler suits you best, there’s no better advice than to try it out for yourself.

Why am I missing a scheduler that some users are mentioning or testing in the CachyOS Discord server?

Make sure you’re using the bleeding edge version of the scx-scheds package named as scx-scheds-git

One of the reasons will be that this scheduler is very new and is currently being tested by the users, therefore it has not yet been added to the scx-scheds-git package.

Why did the scheduler suddenly crash? Is it unstable?

There could be a few reasons on why this happened:
- One of the most common reason is that you were using ananicy-cpp alongside the scheduler. This why we added this warning
- Another reason could be that the workload you were running exceeded the limits and capacity of the scheduler causing it to stall.
  - Example of an unreasonable workload: hackbench
- Or the more obvious reason, you’ve found a bug in the scheduler, if so. Please report it as an issue in their GitHub or let them know about it in the CachyOS Discord channel sched-ext

I have previously used the scx_loader in the Kernel Manager GUI. Do I still need to follow the transition steps?

In this particular case, no, it is not necessary because the Kernel Manager already handles the transition process.
- Unless you have previously added custom flags in /etc/default/scx and still want to use them.

sched-ext Tutorial

How to Launch and Manage the Scheduler

Scheduler Guide: Profiles and Use Cases

Scheduler Modes

Scheduler Modes

Low Latency

Power Save

Server

4-Tier System

Tier Gates

How Classification Works

DRR++ Deficit Tracking

DVFS (CPU Frequency Scaling)

Profiles (--profile, -p)

CLI Arguments

Per-Tier Tuning (Gaming Profile)

Scheduler Modes

Auto

Gaming

Power Save

Low Latency

Server

Scheduler Modes

Low Latency

Gaming

Power Save

Server

Scheduler Modes

Gaming & Low Latency

Power Save

Scheduler Modes

Default (Adaptive)

BPF Only

Verbose / Debug

Scheduler Modes

Gaming

Low Latency

Power Save

Server

Scheduler Modes

Gaming

Power Save

Low Latency

Server

Configuration and performance testing

LAVD Autopilot & Autopower

ananicy-cpp & sched-ext

scx_loader Power Profile Switching

Benchmarking and comparing schedulers with cachyos-benchmarker

Requirements

Installation

Running the benchmark

Testing scheduler latency with schbench

Use cases

Installation

Running the benchmark

Understanding the output

How to interpret the results

Recommendations for benchmarking games

Uploading and sharing your benchmarks

Transitioning from scx.service to scx_loader: A Comprehensive Guide

Debugging in the scx_loader

FAQ

Why X scheduler performs worse than the other?

Why everyone keeps saying this X scheduler is the best for X case but it does not perform as well for me?

The use cases of these schedulers are quite similar… why is that?

Why am I missing a scheduler that some users are mentioning or testing in the CachyOS Discord server?

Why did the scheduler suddenly crash? Is it unstable?

I have previously used the scx_loader in the Kernel Manager GUI. Do I still need to follow the transition steps?

Learn More

Profiles (`--profile, -p`)