scx-upstream

mirror of https://github.com/sched-ext/scx.git synced 2024-12-14 20:37:16 +00:00

Author	SHA1	Message	Date
Daniel Hodges	30258cff1b	scx_layered: Update docs for layer_preempt_weight_disable Update docs for layer_preempt_weight_disable and layer_growth_weight_disable. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-09 06:37:54 -07:00
Daniel Hodges	edc673460d	scx_layered: Rename load_adj statistic Rename the `load_adj` statistic to `load_frac_adj`, which is a more accurate representation of what the statistic is calculating. The statistic is a fractional representation of the load of a layer adjusted for infeasible weights. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-09 06:23:37 -07:00
Jake Hillion	c23efb1ed3	Merge pull request #749 from JakeHillion/pr749 layered: split dispatch into no_topo version	2024-10-09 13:15:12 +00:00
Jake Hillion	19d09c3cc1	layered: split dispatch into no_topo version Refactor layered_dispatch into two functions: layered_dispatch_no_topo and layered_dispatch. layered_dispatch will delegate to layered_dispatch_no_topo in the disable_topology case. Although this code doesn't run when loaded by BPF due to the global constant bool blocking it, it makes the functions really hard to parse as a human. As they diverge more and more it makes sense to split them into separate manageable functions. This is basically a mechanical change. I duplicated the existing function, replaced all `disable_topology` with true in `no_topo` and false in the existing function, then removed all branches which can't be hit. Test plan: - Runs on my dev box (6.9.0 fbkernel) with `scx_layered --run-example -n`. - As above with `-t`. - CI.	2024-10-09 13:33:06 +01:00
Daniel Hodges	ef30517936	Merge pull request #758 from hodgesds/layered-ci-fixes scx_layered: Update CI to show stats	2024-10-09 12:30:34 +00:00
Daniel Hodges	feab01dd44	scx_layered: Update CI to show stats Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-09 05:18:04 -07:00
Daniel Hodges	2b5829e275	Merge pull request #763 from ryantimwilson/rusty-default-weights-fix [rusty] Fix load stats when host is under-utilized	2024-10-09 12:14:51 +00:00
likewhatevs	29bb3110ec	Merge pull request #765 from likewhatevs/update-dispatch scx_layered: enable configuring layer iteration when no topo	2024-10-09 06:22:40 +00:00
Pat Somaru	8e2f195af1	enable configuring layer iteration when no topo enable configuring layer iteration order in dispatch when topology is disabled. replace some member_vptr's in that iteration with regular accesses	2024-10-09 01:53:19 -04:00
Andrea Righi	e3e381dc8e	Merge pull request #755 from sched-ext/bpfland-prevent-kthread-stall scx_bpfland: prevent per-CPU DSQ stall with per-CPU kthreads	2024-10-09 05:28:59 +00:00
Ryan Wilson	fbdb6664ec	[rusty] Fix load stats when host is under-utilized	2024-10-08 21:08:07 -07:00
likewhatevs	0ed36a17da	Merge pull request #764 from likewhatevs/fix-ci-2 scx_layered: fix ci	2024-10-09 02:14:55 +00:00
Pat Somaru	59f35fcbec	update stress test settings to constants used in test_scheds	2024-10-08 22:08:13 -04:00
Pat Somaru	c90144d761	Revert "Merge pull request #746 from likewhatevs/layered-delay" This reverts commit `2077b9a799`, reversing changes made to `eb73005d07`.	2024-10-08 22:01:05 -04:00
likewhatevs	e1232b5efc	Merge pull request #760 from hodgesds/layered-example-fix scx_layered: Make stress-ng non exclusive in example	2024-10-08 20:48:18 -04:00
Daniel Hodges	e6773d43b1	scx_layered: Make stress-ng non exclusive in example Test CI hosts are VMs currently and making stress-ng exclusive may starve the host. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-08 10:49:51 -07:00
Daniel Hodges	66f967c06d	Merge pull request #756 from hodgesds/layered-example-stress scx_layered: Add stress-ng example layer	2024-10-08 15:31:44 +00:00
likewhatevs	e1f6c792fe	Merge pull request #757 from JakeHillion/pr757 layered: cleanup warnings in bpf compilation	2024-10-08 15:29:12 +00:00
Jake Hillion	85daa2be32	layered: cleanup warnings in bpf compilation clang is correctly warning that we use various uninitialised variables. clean these up so real errors are easier to read. The largest change here is to non-topological layered_dispatch. The matching_dsq logic seems to be incorrect. It checks whether an uninitialised variable is 0, if it is sets it, then only uses the variable if the value is 0. I have changed this to default to -1, then use the value if it is no longer -1.	2024-10-08 16:25:43 +01:00
Daniel Hodges	e0ddff1403	scx_layered: Add verbose output on stress tests Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-08 08:20:59 -07:00
Daniel Hodges	b803d59e1e	scx_layered: Add verbose output on CI logs Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-08 08:10:49 -07:00
Daniel Hodges	f3191afca7	scx_layered: Add stress-ng example layer Add a stress-ng example layer, which will be used for CI testing with stress-ng. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-08 07:56:54 -07:00
Andrea Righi	c8a9207371	scx_bpfland: prevent per-CPU DSQ stall with per-CPU kthreads Since per-CPU kthreads may show an inconsistent prev_cpu and/or cpumask, dispatch them directly to local DSQ and allow to preempt the current running task. This allows to prevent per-CPU kthread stalls and it also helps to prioritize them, as are usually important for system performance and responsiveness. Moreover, change the behavior of --local-kthreads to prioritize all kthreads when this option is used. This addresses issue #728. NOTE: ideally we may want to fix this in the kernel by making sure to always expose a consistent prev_cpu and cpumask also for kthreads, but at the moment this change allows to prevent some annoying stalls and performance-wise it doesn't seem to introduce any regression. In fact, the usual gaming/fps benchmarks show even a slight improvement in responsiveness with this change applied. Thanks to YUBY from the CachyOS community for all the extremely valuable help with the intensive stress tests. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-10-08 15:02:31 +02:00
Daniel Hodges	d7576d4b44	Merge pull request #754 from minosfuture/cpu_pool_doc scx_layered: Add doc comment to CpuPool	2024-10-08 12:22:55 +00:00
likewhatevs	2077b9a799	Merge pull request #746 from likewhatevs/layered-delay scx_layered: lighten/reduce nested loops in layered dispatch	2024-10-08 11:32:55 +00:00
Ming Yang	0dbb8c2374	scx_layered: Add doc comment to CpuPool Add doc comment to `CpuPool` as a quick reference for each member. Most importantly, differentiate "cpu" and "core", as logical core and physical core, respectively. Signed-off-by: Ming Yang <minos.future@gmail.com>	2024-10-07 21:48:46 -07:00
Pat Somaru	51d9e90d39	formatting	2024-10-07 18:54:30 -04:00
Pat Somaru	d2ac627942	formatting	2024-10-07 18:47:27 -04:00
Pat Somaru	3369836970	formatting	2024-10-07 18:44:44 -04:00
Pat Somaru	e0ce4711d4	flatten and simplify dispatch	2024-10-07 18:36:07 -04:00
Daniel Hodges	eb73005d07	Merge pull request #747 from hodgesds/layered-idle-order scx_layered: Update idle topology selection order	2024-10-07 20:01:38 +00:00
Daniel Hodges	95c0553913	Merge pull request #751 from ryantimwilson/hotplug-crash-bpf scx_rusty: Fix BPF crash during CPU hotplug	2024-10-07 19:09:21 +00:00
Ryan Wilson	a76778a4ab	scx_rusty: Fix BPF crash during CPU hotplug When hotplugging CPUs in rapid succession, scx_rusty would crash with: ``` scx_bpf_error (Failed to lookup dom[4294967295] ``` The root cause is if the scheduler is restarted fast enough, a task on a previously hotplugged CPU may not have moved off that CPU yet. Thus, the CPU -> domain map would contain an invalid domain (u32::max) and we would fail to lookup the domain correctly in rusty_select_cpu for prev_cpu. To fix this, if the CPU is offline, we do not try to allocate to the same NUMA node (assuming hotplug is a rare operation) beyond domestic domain. Instead we use greedy allocation - first idle, then busy - then any CPU.	2024-10-07 11:59:36 -07:00
Tejun Heo	75903f1031	Merge pull request #750 from hodgesds/ebuild-docs docs: Add Gentoo installation instructions	2024-10-07 18:24:45 +00:00
Daniel Hodges	cf25f887e2	docs: Add Gentoo installation instructions Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-07 11:22:10 -07:00
Daniel Hodges	0b497d6df0	scx_layered: Update idle topology selection order Update the idle topology selection order, the current logic is: core architecture (big/little) -> LLC -> NUMA -> Machine It's probably better to try to keep cache lines clean and do: LLC -> core architecture (big/little) -> NUMA -> Machine Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-07 10:34:11 -07:00
Tejun Heo	4979cb8762	Merge pull request #739 from CachyOS/feature/scx-loader-switch-sched scx_loader: Add SwitchScheduler methods to DBUS interface	2024-10-07 16:40:12 +00:00
Tejun Heo	1296de8cb3	Merge pull request #742 from sirlucjan/drop-journald2 Drop journald from libalpn hooks	2024-10-07 16:39:01 +00:00
Daniel Hodges	62b37b6d78	Merge pull request #745 from hodgesds/layered-topo-refactor scx_layered: Improve perf on non topo aware paths	2024-10-07 16:27:50 +00:00
Daniel Hodges	024a2aa658	scx_layered: Improve perf on non topo aware paths Improve the performance on non topology aware paths by skipping some map lookups and uneccessary initializations. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-07 07:56:18 -07:00
Daniel Hodges	198cfce940	Merge pull request #740 from hodgesds/layered-topo-smt scx_layered: Use idle smt mask for idle selection	2024-10-07 14:13:48 +00:00
Daniel Hodges	24fba4ab8d	scx_layered: Add idle smt layer configuration Add support for layer configuration for idle CPU selection. This allows layers to choose whether or not to restrict idle CPU selection to SMT idle CPUs. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-07 06:58:54 -07:00
Daniel Hodges	2f280ac025	scx_layered: Use idle smt mask for idle selection In the non topology aware code the idle smt mask is used for finding idle cpus. Update topology aware idle selection to also use the idle smt mask. In certain benchmarks this can improve performance. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-07 05:40:59 -07:00
Daniel Hodges	30feecc5ae	Merge pull request #743 from hodgesds/layered-big-little-mask scx_layered: Add big cpumask	2024-10-07 11:05:01 +00:00
Daniel Hodges	d86638ef0b	scx_layered: Add big cpumask Add big cpumask to scx_layered and prefer selecting big idle cores when using the BigLittle growth algo. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-10-06 14:05:12 -04:00
Andrea Righi	2801b109fd	Merge pull request #741 from sched-ext/bpfland-rework-lowlatency-mode scx_bpfland: rework lowlatency mode	2024-10-06 16:10:16 +00:00
Piotr Gorski	5414d052f1	Drop journald from libalpn hooks Signed-off-by: Piotr Gorski <lucjan.lucjanov@gmail.com>	2024-10-06 13:12:56 +02:00
Andrea Righi	9a29547e5b	scx_bpfland: rework lowlatency mode In lowlatency mode (option --lowlatency) tasks are ordered using a deadline that is evaluated as the vruntime minus a certain "bonus", determined in function of the max time slice and the average amount of voluntary context switches, to amplify the priority boost of the tasks that are voluntarily releasing the CPU (which are typically interactive). However, this method can be extremely unfair in some cases: tasks with short bursts of voluntary context switches may receive a huge priority boost, making the rest of the system almost unresponsive (see massive hackbench stress tests for example). To prevent this rework the task's deadline logic to use the vruntime and a "deadline component" that is a function of the average used time slice, scaled using a dynamic task priority (evaluated as the static task priority and the its average amount of voluntary context switches). This logic seems to prevent excessive prioritization of tasks performing short intensive bursts of voluntary context switches. It also makes lowlatency mode in scx_bpfland (somehow) more similar to the deadline logic used by scx_rusty. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-10-05 17:44:09 +02:00
Changwoo Min	a673dcf809	Merge pull request #736 from multics69/scx-futex-v1 scx_lavd: split main.bpf.c into multiple files	2024-10-05 13:11:15 +09:00
Vladislav Nepogodin	7bd61f4334	scx_loader: Add SwitchScheduler methods to DBUS interface These methods allow switching between different schedulers without requiring manual stopping and starting.	2024-10-05 02:57:17 +04:00

1 2 3 4 5 ...

2002 Commits