scx-upstream

mirror of https://github.com/sched-ext/scx.git synced 2024-12-13 12:07:17 +00:00

Author	SHA1	Message	Date
Daniel Hodges	5fdd257862	scx_layered: Pass layer spec for core growth algo Pass in the layer spec when determining the layer core growth algo. This should make it easier to implement layer growth algos that are spec specific. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-10 10:27:08 -07:00
Tejun Heo	56bb963136	build: Use a single top-level rust workspace Rust build was using two separate workspaces - rust/ and scheds/rust. There's no reason to separate them and it makes doc generation tricky. Use single top level workspace so that we can drive all rust building from cargo.	2024-09-08 14:23:48 -10:00
patso	120211d731	split build and test jobs split build and test jobs to reduce ci turnaround time and make it clear what is failing when something fails. also add virtiofsd to deps to make test compilation faster (most test time is compliation) and remove all force 9ps.	2024-09-08 02:54:24 -04:00
Changwoo Min	17e0e08e6e	Merge pull request #621 from multics69/lavd-greedy-fix scx_lavd: improve greedy ratio calculation and more	2024-09-07 10:52:00 +09:00
Tejun Heo	6f8917ceca	Merge pull request #624 from JakeHillion/cleanup-layer_growth_algo scx_layered: clean up Layer::new layer_growth_algo	2024-09-06 15:10:41 -10:00
Jake Hillion	2c008b2afa	scx_layered: clean up Layer::new layer_growth_algo	2024-09-06 18:25:50 +01:00
Changwoo Min	36df970a8f	scx_lavd: add debug print for turbo cores Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-06 19:23:17 +09:00
Changwoo Min	351a1c6656	scx_lavd: enable autopilot mode by default Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-06 19:23:12 +09:00
Andrea Righi	8231f8586a	scx_rlfifo: better documentation and code readability Simplify scx_rlfifo code, add detailed documentation of the scx_rustland_core API and get rid of the additional task queue, since it just makes the code bigger, slower and it doesn't really provide any benefit (considering that we are dispatching the tasks in FIFO order anyway). Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-06 11:25:24 +02:00
Andrea Righi	ed879bae28	scx_rustland_core: expose enq_flags to user-space Pass the enqueue flags to the user-space scheduler through the QueuedTask struct. These flags allow the user-space scheduler to make more informed scheduling decisions. Also bump up scx_rustland_core minor version to reflect the new API (we are not breaking the old API, so we don't need to bump the major version in this case). Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-06 11:25:24 +02:00
Changwoo Min	ebe9375b6a	scx_lavd: pretty printing of status Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-06 16:27:20 +09:00
Changwoo Min	461cb9a3a0	scx_lavd: fix calculation of greedy_ratio The service time (taskc->svc_time) should be the sum of total CPU time consumed not jut a delta. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-06 16:22:40 +09:00
Tejun Heo	46fc2e1a49	version: v1.0.4	2024-09-05 18:12:45 -10:00
Tejun Heo	cd555741d0	rust: Synchronize depency versions	2024-09-05 17:10:02 -10:00
Changwoo Min	e3243c5d51	Merge pull request #612 from multics69/lavd-monitor scx_lavd: add --monitor flag and two micro-optimizations	2024-09-06 09:33:55 +09:00
Changwoo Min	d9274bd8e6	scx_lavd: drop time slice boost for big cores Unexpectedly, little cores, which have relative short time slices, have more chance to schedule performance-critical tasks. Hence it is better to keep the time slice same regardless the core types. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-06 09:32:38 +09:00
Changwoo Min	fdecba227c	scx_lavd: print more info with --monitor Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-06 09:32:31 +09:00
Daniel Hodges	0fa369b914	Merge pull request #619 from hodgesds/stats-fixes scx_layered: Fix stats typo	2024-09-05 15:44:15 -04:00
Daniel Hodges	25e1642bbc	scx_layered: Fix stats typo Small typo fix Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-05 14:12:28 -04:00
Andrea Righi	918cfc613d	scx_bpfland: optimize producer/consumer workloads When selecting an idle CPU for a task that has been woken up, prioritize reusing the same CPU if the waker and wakee share the same L3 cache. Otherwise, attempt to migrate the wakee to the waker's CPU, provided it is allowed by the wakee's scheduling domain. This seems to consistently improve FPS performance when the system is not operating over its full capacity. Example: $ __GL_SYNC_TO_VBLANK=0 vblank_mode=0 glxgears -geometry 800x600 - before: ~18305.77 FPS - after: ~19060.62 FPS Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-05 19:02:09 +02:00
Andrea Righi	28050dcd7d	Merge pull request #615 from sched-ext/bpfland-auto scx_bpfland: enable "auto" mode by default	2024-09-05 19:01:50 +02:00
Daniel Hodges	e6ed9b05ba	Merge pull request #614 from hodgesds/layered-stats-fix scx_layered: Fix stats formatting	2024-09-05 12:54:56 -04:00
Andrea Righi	844c00fd26	scx_bpfland: enable "auto" mode by default Rename "turbo domain" to "preferred domain", that conceptually is more generic and introduce the new option `--preferred-domain CPUMASK`, which allows users to define the preferred domain, specifying a cpumask as a hex number. By default ("auto") the scheduler will always try to detect and use the fastest CPUs in the system. Moreover, adjust the cpufreq logic to use "auto" both with the "balance_power" and "balance_performance" EPP profiles. Then, enable "auto" mode by default: the scheduler will try to automatically determine the optimal primary domain, preferred domain and cpufreq level, based on the selected scheduler and energy profiles. Tested-by: Piotr Gorski < piotr.gorski@cachyos.org > Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-05 16:11:12 +02:00
Daniel Hodges	76ad880475	scx_layered: Fix stats formatting Fix formatting precision of stats to have lower precision for readability. The existing formatting is hard to read: tot= 1538 local=31.27 open_idle= 2.73 affn_viol=23.80 proc=4ms busy= 1.1 util= 16.6 load= 32.7 fallback_cpu= 6 excl_coll=0.06501950585175553 excl_preempt=0.26007802340702213 excl_idle=0.16384915474642392 excl_wakeup=0.25097529258777634 With this fix stats are far more readable formatting: tot= 441 local=33.56 open_idle= 0.00 affn_viol=20.63 proc=3ms busy= 0.4 util= 6.3 load= 33.6 fallback_cpu= 6 excl_coll=0.454 excl_preempt=0.000 excl_idle=0.132 excl_wakeup=0.200 Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-05 06:44:54 -04:00
Changwoo Min	f490a55d54	scx_lavd: accmulate more system-wide statistics Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-05 16:03:14 +09:00
Changwoo Min	e5d27d0553	scx_lavd: print basic system status when --monior is given Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-05 16:03:14 +09:00
Changwoo Min	6b717a3f3d	scx_lavd: add --help-stats option Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-05 16:03:14 +09:00
Changwoo Min	ca1c86eb9c	scx_lavd: improve pick_idle_cpu() for pinned tasks When a pinned task cannot run on either active or overflow sets, we try to stay on the previous CPU which is still okay to run on. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-05 16:03:14 +09:00
Andrea Righi	afc7b5404b	Merge pull request #600 from sched-ext/bpfland-cpufreq scx_bpfland: improve cpufreq awareness	2024-09-05 07:32:10 +02:00
Tejun Heo	f010eda5c0	meson: Remove scheds/rust/*/meson.build These aren't used since `43950c65` ("build: Use workspace to group rust sub-projects"). Drop them.	2024-09-04 06:40:17 -10:00
Andrea Righi	c3cab45f6a	scx_rustland_core: bump up version to 2.0.1 Bump up scx_rustland_core version to include this critical fix that allows to prevent scheduler stalls: `94a3594` ("scx_rustland_core: always dispatch per-cpu kthreads directly") Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-04 08:00:25 +02:00
Andrea Righi	918f1db4bd	scx_bpfland: dynamically adjust cpufreq level in auto mode In auto mode, rather than keeping the previous fixed cpuperf factor, dynamically calculate it based on CPU utilization and apply it before a task runs within its allocated time slot. Interactive tasks consistently receive the maximum scaling factor to ensure optimal performance. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-03 21:36:48 +02:00
Daniel Hodges	9c5717577f	Merge pull request #601 from hodgesds/namespace-helpers scx_helpers: Add pid namespace helpers	2024-09-03 14:38:26 -04:00
Daniel Hodges	8f4e9e5e3b	scx_helpers: Add pid namespace helpers Add pid namespace helpers for translating namespace pids. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-03 11:21:32 -07:00
Andrea Righi	fe6ac15015	scx_bpfland: improve turbo domain CPU selection Always consider the turbo domain when running in "auto" mode. Additionally, when the turbo domain is used, split the CPU idle selection logic into two stages: 1) in ops.select_cpu(), provide the task with a second opportunity to remain within the same LLC 2) in ops.enqueue(), perform another check for an idle CPU, allowing the task to move to a different LLC if an idle CPU within the same LLC is not available. This allows tasks to stick more on turbo-boosted CPUs and CPUs within the same LLC. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-03 09:59:29 +02:00
Andrea Righi	70b93ed641	scx_bpfland: skip idle CPU selection for tasks with changing affinity When tasks are changing CPU affinity it is pointless to try to find an optimal idle CPU. In this case just skip the the idle CPU selection step and let the task being dispatched to a global DSQ if needed. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-03 09:59:29 +02:00
Andrea Righi	802d104b46	scx_bpfland: add basic cpufreq support Add hints for the cpufreq governor based on the selected scheduler's performance profile and the current energy performance preference (EPP). With this change applied the scheduler works as following: scheduler profile (--primary-domain option): - default: - use all cores - cpufreq: use default scaling factor - powersave: - use E-cores - cpufreq: use min scaling factor - performance: - use P-cores - cpufreq: use max scaling factor - auto: - EPP: power, powersave - use E-cores - cpufreq: use min scaling factor - EPP: balance_power (typically battery-powered systems) - use E-cores - cpufreq: use default scaling factor - EPP: balance_performance, performance - use P-cores - cpufreq: use max scaling factor Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-03 09:59:29 +02:00
Andrea Righi	d0fb29a0f7	scx_rustland: aggressively prioritize interactive tasks scx_rustland was originally designed as a PoC to showcase the benefits of implementing specialized schedulers via sched_ext, focusing on a very specific use case: prioritize game responsiveness regardless of what runs in the background. Its original design was subsequently modified to better serve as a general-purpose scheduler, balancing the prioritization of interactive tasks with CPU-intensive ones to prevent over-prioritization. With scx_bpfland serving as a more "general-purpose" scheduler, it makes sense to revisit scx_rustland's original goal and make it much more aggressive at prioritizing interactive tasks, determined in function of their average amount of context switches. This change makes scx_rustland again a really good PoC to showcase the benefits of having specialized schedulers, by focusing only at a very specific use case: provide a high and stable frames-per-second (fps) while a kernel build is running in the background. = Results = - Test: Run a WebGL application [1] while building the kernel (make -j32) - Hardware: 8-cores Intel 11th Gen Intel(R) Core(TM) i7-1195G7 @ 2.90GHz +----------------------+--------+--------+ \| Scheduler \| avg fps\| stdev \| +----------------------+--------+--------+ \| EEVDF \| 28 \| 4.00 \| \| scx_rustland-before \| 43 \| 1.25 \| \| scx_rustland-after \| 60 \| 0.25 \| +----------------------+--------+--------+ [1] https://webglsamples.org/aquarium/aquarium.html Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-09-02 15:53:35 +02:00
Changwoo Min	172fe1efc6	Merge pull request #597 from multics69/lavd-turbo-tuning2 scx_lavd: misc updates (verifier, README, monitor option name, and micro-optimization)	2024-09-02 18:00:26 +09:00
Changwoo Min	0108b83050	scx_lavd: make the old verifier happy (bpf_cpumask_set_cpu) An old BPF verifier does not allow calling bpf_cpumask_set_cpu() in the BPF syscall context, so we defer actual bpf_cpumask_set_cpu() to the timer handler, update_sys_stat(), to workaround the problem. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-02 18:00:12 +09:00
Changwoo Min	3bc2fd4977	scx_lavd: update README Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-02 18:00:12 +09:00
Changwoo Min	afbebaeed6	scx_lavd: check a core type of previous cpu at pick_idle_cpu() If a task is performance-critical, pick_idle_cpu() checks if the previous core is a big core or not. If not, don't try to run on previous core since a performance-critical task is better to run on a big core. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-01 17:28:16 +09:00
Changwoo Min	f2122c4197	Merge pull request #595 from multics69/lavd-turbo-tuning scx_lavd: improve the autopilot mode	2024-09-01 16:24:41 +09:00
Andrea Righi	1595445a63	Merge pull request #594 from sched-ext/scx-rustland-core-version-2 scx_rustland_core: bump up major version to 2.0.0	2024-09-01 08:57:32 +02:00
Changwoo Min	5ca4501139	scx_lavd: dynamically decide autopilot's low watermark A single threshold for a low watermark does not work well across systems with various numbers of cores and core types. Instead of using a single low watermark value, we dynamically decide the low watermark: 1) until one little core is fully utilized or 2) until two big cores are fully utilized. This works better across systems. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-09-01 12:46:57 +09:00
Andrea Righi	0aa71c832b	scx_rustland_core: bump up major version to 2.0.0 The scx_rustland_core API has been redesigned recently, breaking the compatibility with the past. Considering that Rust crates should update their major version when the previous API becomes incompatible [1], bump up the version to 2.0.0. [1] https://semver.org/ Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-31 23:23:26 +02:00
Andrea Righi	2cbf252019	scx_bpfland: directly dispatch only per-cpu kthreads with local_kthreads We want to directly dispatch only kthreads when local_kthreads is enabled, not all tasks that can run on a single CPU. Fixes: `7cc1846` ("scx_bpfland: always rely on prev_cpu with single-CPU tasks") Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-31 16:35:54 +02:00
Changwoo Min	4a7b806dd2	scx_lavd: when no_freq_scaling, always set to the max freq When the no_freq_scaling changes during runtime in the autopilot mode, the last target freq set would not be 1024. So the performance mode enabled by the autopilot mode would not run in the best profile. Hence, we set the target freq to 1024 always when no_freq_scaling is set. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-31 18:22:33 +09:00
Daniel Hodges	63a2eecce8	Merge pull request #592 from hodgesds/layered-ts-fixes scx_layered: Fix layer timeslice not being applied	2024-08-30 15:34:57 -04:00
Daniel Hodges	e04b612688	scx_layered: Fix layer timeslice not being applied Fix a small bug where the layer timeslice is not applied. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-30 11:53:42 -07:00

1 2 3 4 5 ...

949 Commits