JakeHillion/scx

mirror of https://github.com/JakeHillion/scx.git synced 2024-12-01 21:37:12 +00:00

Author	SHA1	Message	Date
Daniel Hodges	632fcfe4ae	Merge pull request #648 from hodgesds/layered-llc-stats scx_layered: Add stats for XNUMA/XLLC migrations	2024-09-12 13:23:23 -04:00
Daniel Hodges	dde6e0c7f9	scx_utils: Add node/llc id to core topology Add ids for node/llc in the Core topology struct.	2024-09-12 10:05:02 -07:00
Daniel Hodges	aee19dd9a1	scx_layered: Add topology aware core growth selection Add topology aware core growth selection. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-12 06:48:51 -07:00
Daniel Hodges	14a19dc3ca	scx_layered: Add random layer growth algo Add a random layer growth algo. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-12 05:35:54 -07:00
Daniel Hodges	ae57f8d1f9	scx_rusty: Initialize node cpumask Initialize the node cpumask, which was previously uninitialized causing metric calculations to be wrong when attempting to lookup CPUs in the node cpumask. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-11 13:14:44 -07:00
Jake Hillion	8ca45cfa37	lint: enable cargo fmt (#643 ) Use `cargo fmt` with a specific nightly branch in the CI to enforce formatting. Globally format these files while the diff is still small so we can stay on top of it. Test plan: - CI lint check passes.	2024-09-11 10:03:20 +01:00
Daniel Hodges	43ec8bfe82	scx_layered: Add stats for XNUMA/XLLC migrations Add stats for XNUMA/XLLC migrations. An example of the output is shown: ``` hodgesd : util/frac= 5.4/ 0.1 load/frac= 301.0/ 0.3 tasks= 476 tot= 3168 local=97.82 wake/exp/reenq= 2.18/ 0.00/ 0.00 keep/max/busy= 0.03/ 0.00/ 0.03 kick= 0.00 yield/ign= 0.09/ 0 open_idle= 0.00 mig= 6.82 xnuma_mig= 6.82 xllc_mig= 4.86 affn_viol= 0.00 preempt/first/idle/fail= 0.00/ 0.00/ 0.00/ 0.00 min_exec= 0.00/ 0.00ms cpus= 2 [ 2, 4] 00000000 00000010 00001000 normal : util/frac= 28.7/ 0.7 load/frac= 101704.7/ 95.8 tasks= 2450 tot= 4660 local=99.06 wake/exp/reenq= 0.88/ 0.06/ 0.00 keep/max/busy= 1.03/ 0.00/ 0.00 kick= 0.06 yield/ign= 0.04/ 400 open_idle=15.73 mig=23.45 xnuma_mig=23.45 xllc_mig= 3.07 affn_viol= 0.00 preempt/first/idle/fail= 0.00/ 0.00/ 0.00/ 0.88 min_exec= 0.00/ 0.00ms cpus= 2 [ 2, 2] 00000001 00000100 00000000 excl_coll=12.55 excl_preempt= 0.00 random : util/frac= 0.0/ 0.0 load/frac= 0.0/ 0.0 tasks= 0 tot= 0 local= 0.00 wake/exp/reenq= 0.00/ 0.00/ 0.00 keep/max/busy= 0.00/ 0.00/ 0.00 kick= 0.00 yield/ign= 0.00/ 0 open_idle= 0.00 mig= 0.00 xnuma_mig= 0.00 xllc_mig= 0.00 affn_viol= 0.00 preempt/first/idle/fail= 0.00/ 0.00/ 0.00/ 0.00 min_exec= 0.00/ 0.00ms cpus= 0 [ 0, 0] 00000000 00000000 00000000 excl_coll= 0.00 excl_preempt= 0.00 stress-ng: util/frac= 4189.1/ 99.2 load/frac= 4200.0/ 4.0 tasks= 43 tot= 62 local= 0.00 wake/exp/reenq= 0.00/100.0/ 0.00 keep/max/busy=2433.9/177.4/ 0.00 kick=100.0 yield/ign= 3.23/ 0 open_idle= 0.00 mig=54.84 xnuma_mig=54.84 xllc_mig=35.48 affn_viol= 0.00 preempt/first/idle/fail= 0.00/ 0.00/ 0.00/ 0.00 min_exec= 0.00/ 0.00ms cpus= 4 [ 4, 4] 00000300 00030000 00000000 excl_coll= 0.00 excl_preempt= 0.00 ``` Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-10 19:53:28 -07:00
likewhatevs	85863d0e1c	Merge pull request #644 from hodgesds/layered-topo-order scx_layered: Pass layer spec for core growth algo	2024-09-10 14:49:37 -04:00
Daniel Hodges	5fdd257862	scx_layered: Pass layer spec for core growth algo Pass in the layer spec when determining the layer core growth algo. This should make it easier to implement layer growth algos that are spec specific. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-10 10:27:08 -07:00
Samuel Nair	c6af1aa1c8	scx_layered: Fix typo in stats	2024-09-10 08:44:57 -07:00
Jake Hillion	2c008b2afa	scx_layered: clean up Layer::new layer_growth_algo	2024-09-06 18:25:50 +01:00
Tejun Heo	46fc2e1a49	version: v1.0.4	2024-09-05 18:12:45 -10:00
Daniel Hodges	0fa369b914	Merge pull request #619 from hodgesds/stats-fixes scx_layered: Fix stats typo	2024-09-05 15:44:15 -04:00
Daniel Hodges	25e1642bbc	scx_layered: Fix stats typo Small typo fix Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-05 14:12:28 -04:00
Daniel Hodges	76ad880475	scx_layered: Fix stats formatting Fix formatting precision of stats to have lower precision for readability. The existing formatting is hard to read: tot= 1538 local=31.27 open_idle= 2.73 affn_viol=23.80 proc=4ms busy= 1.1 util= 16.6 load= 32.7 fallback_cpu= 6 excl_coll=0.06501950585175553 excl_preempt=0.26007802340702213 excl_idle=0.16384915474642392 excl_wakeup=0.25097529258777634 With this fix stats are far more readable formatting: tot= 441 local=33.56 open_idle= 0.00 affn_viol=20.63 proc=3ms busy= 0.4 util= 6.3 load= 33.6 fallback_cpu= 6 excl_coll=0.454 excl_preempt=0.000 excl_idle=0.132 excl_wakeup=0.200 Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-09-05 06:44:54 -04:00
Tejun Heo	f010eda5c0	meson: Remove scheds/rust/*/meson.build These aren't used since `43950c65` ("build: Use workspace to group rust sub-projects"). Drop them.	2024-09-04 06:40:17 -10:00
Daniel Hodges	e04b612688	scx_layered: Fix layer timeslice not being applied Fix a small bug where the layer timeslice is not applied. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-30 11:53:42 -07:00
Daniel Hodges	47184e9d19	Merge pull request #582 from hodgesds/layered-growth-interface scx_layered: Add layer growth config	2024-08-29 18:49:59 -04:00
Daniel Hodges	7e0329e45c	scx_layered: Add layer growth config Add a per layer config for different implementations of layer growth algorithms. Convert the existing default logic into a default layer growth algorithm and add a linear implementation. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-28 19:17:24 -07:00
Daniel Hodges	cf765562c7	scx_layered: Update docs for layer slice setting Add docs for layer slice setting. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-28 22:12:07 -04:00
Daniel Hodges	a23308e7b0	scx_layered: Add more docs on tuning Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-28 12:38:05 -07:00
Daniel Hodges	96326b1ef3	scx_layered: Add additional docs Add some additional docs on tuning layered. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-28 12:27:26 -07:00
Daniel Hodges	cc450f1a4b	scx_layered: Add per layer timeslice Allow setting a different timeslice per layer. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-28 11:21:03 -07:00
Daniel Hodges	c511b42b7b	scx_layered: Make verification easier on older kernels Refactor some BPF code to make verification easier on older kernels. This is to make it easier to maintain backports. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-28 08:05:10 -07:00
Daniel Hodges	41cebb807a	Merge pull request #569 from anh0516/main scx_layered: Clean up in-code documentation; add commas for consistency	2024-08-27 09:47:29 -04:00
Avraham Hollander	7a43801d76	Add quotes for clarity	2024-08-26 13:20:01 -04:00
Avraham Hollander	07039f1f07	scx_layered: Documentation cleanup	2024-08-26 13:03:52 -04:00
Daniel Hodges	c45c2de39f	scx_layered: Update help for tgid matching Forgot to add doc for tgid matching Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-26 07:06:21 -07:00
Tejun Heo	43950c65bd	build: Use workspace to group rust sub-projects meson build script was building each rust sub-project under rust/ and scheds/rust/ separately. This means that each rust project is built independently which leads to a couple problems - 1. There are a lot of shared dependencies but they have to be built over and over again for each proejct. 2. Concurrency management becomes sad - we either have to unleash multiple cargo builds at the same time possibly thrashing the system or build one by one. We've been trying to solve this from meson side in vain. Thankfully, in issue #546, @vimproved suggested using cargo workspace which makes the sub-projects share the same target directory and built together by the same cargo instance while still allowing each project to behave independently for development and publishing purposes. Make the following changes: - Create two cargo workspaces - one under rust/, the other under scheds/rust/. Each contains all rust projects underneath it. - Don't let meson descend into rust/. These are libraries used by the rust schedulers. No need to build them from meson. Cargo will build them as needed. - Change the rust_scheds build target to invoke `cargo build` in scheds/rust/ and let cargo do its thing. - Remove per-scheduler meson.build files and instead generate custom_targets in scheds/rust/meson.build which invokes `cargo build -p $SCHED`. - This changes rust binary directory. Update README and meson-scripts/install_rust_user_scheds accordingly. - Remove per-scheduler Cargo.lock as scheds/rust/Cargo.lock is shared by all schedulers now. - Unify .gitignore handling. The followings are build times on Ryzen 3975W: Before: ________________________________________________________ Executed in 165.93 secs fish external usr time 40.55 mins 2.71 millis 40.55 mins sys time 3.34 mins 36.40 millis 3.34 mins After: ________________________________________________________ Executed in 36.04 secs fish external usr time 336.42 secs 0.00 millis 336.42 secs sys time 36.65 secs 43.95 millis 36.61 secs Wallclock time is reduced 5x and CPU time 7x.	2024-08-25 00:47:58 -10:00
Tejun Heo	625381280c	scx_stats: Shorten exported names and add prelude module Let's make it a bit easier to use: - Shorten exported names by changing the prefix from ScxStats to Stats. This should be distinctive enough and more inline with how most libraries name their exports. - Importing the right set of traits can be tricky. Introduce prelude module so that importing is a bit less painful.	2024-08-24 22:04:25 -10:00
Tejun Heo	1bba713a29	Merge pull request #542 from sched-ext/htejun/scx_stats scx_stats, scx_rusty, scx_layered: Implement `--help-stats`	2024-08-24 15:38:36 -10:00
Daniel Hodges	5a2012763e	scx_layered: Add layer match for tgid Add layer match for tgid. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-23 23:00:28 -04:00
Tejun Heo	25e437753c	scx_layered, scx_rusty: Implement --help-stats which shows all the defined stats. While at it, make some cosmetic updates.	2024-08-23 12:39:47 -10:00
Tejun Heo	405bcc63fe	scx_stats: Make ScxStatsServerData a public carrier of data needed for stats server And move related ops into it. This is a bit more natural and will also allow doing other operaitons (e.g. describing stats) without launching the server.	2024-08-23 12:23:57 -10:00
Tejun Heo	9e3b4e6db0	scx_stats: A bit of cleanups and renames	2024-08-23 09:09:02 -10:00
Tejun Heo	b6ccb87bec	Merge pull request #539 from sched-ext/htejun/scx_rusty scx_rusty: Convert to scx_stats	2024-08-23 08:42:47 -10:00
Tejun Heo	8c8912ccea	Merge branch 'main' into htejun/scx_rusty	2024-08-23 07:50:23 -10:00
Tejun Heo	44a0f1b124	scx_utils: Factor out monitor_stats() from scx_rusty and scx_layered	2024-08-23 06:46:19 -10:00
Tejun Heo	ae3024e938	scx_layered: Add --stats and make --monitor behavior consistent with scx_rusty	2024-08-23 05:52:52 -10:00
Daniel Hodges	11b978a892	scx_layered: Add pid/ppid matches Add matches for pid/ppid. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-23 07:20:05 -07:00
Tejun Heo	d6ac5fbd9c	scx_layered: Drop SCX_OPS_ENQ_LAST The meaning of SCX_OPS_ENQ_LAST will change with future kernel updates and enqueueing on local DSQ will no longer be sufficient to avoid stalls. No reason to do it anyway. Just drop it.	2024-08-21 13:13:59 -10:00
Tejun Heo	4d1f0639d8	Version: v1.0.3	2024-08-21 06:42:11 -10:00
Daniel Hodges	f2a6661a85	Merge pull request #524 from hodgesds/layered-core-fixes scx_layered: Fix core selection	2024-08-21 08:13:33 -04:00
Daniel Hodges	4d1c932619	scx_layered: Fix core selection Fix a bug introduced in #510 where it assumed core ids are incremental. This refactors the core ordering for layers to be far more simple and provide some space for layer core isolation in low utilization. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-20 19:26:53 -07:00
Tejun Heo	c0418250f4	scx_layered: Add --run-example option So that scx_layered can be run in CI environment in a single command.	2024-08-19 20:50:10 -10:00
Daniel Hodges	05a2721f8e	Merge pull request #510 from hodgesds/layered-core-topo-selection scx_layered: Use topology for core selection	2024-08-19 20:01:16 -04:00
Tejun Heo	d01b49bd0e	scx_layered: Fix verification failure `4fccc06905` ("scx_layered: Fix uninitialized variable") causes the following verification failure. Fix it by moving assignments below range checking. Validating match_layer() func#1... 283: R1=scalar() R2=scalar() R3=mem_or_null(id=49,sz=1) R10=fp0 ; int match_layer(u32 layer_id, pid_t pid, const char cgrp_path) @ main.bpf.c:1029 283: (7b) (u64 )(r10 -24) = r3 ; R3=mem_or_null(id=49,sz=1) R10=fp0 fp-24_w=mem_or_null(id=49,sz=1) 284: (bc) w7 = w1 ; R1=scalar() R7_w=scalar(smin=0,smax=umax=0xffffffff,var_off=(0x0; 0xffffffff)) ; struct layer layer = &layers[layer_id]; @ main.bpf.c:1033 285: (bc) w1 = w7 ; R1_w=scalar(id=50,smin=0,smax=umax=0xffffffff,var_off=(0x0; 0xffffffff)) R7_w=scalar(id=50,smin=0,smax=umax=0xffffffff,var_off=(0x0; 0xffffffff)) 286: (27) r1 = 1061192 ; R1_w=scalar(smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) 287: (18) r8 = 0xffffc90002a26000 ; R8_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080) 289: (0f) r8 += r1 ; R1_w=scalar(smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) R8_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) ; u32 nr_match_ors = layer->nr_match_ors; @ main.bpf.c:1034 290: (bf) r1 = r8 ; R1_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) R8_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) 291: (07) r1 += 1060992 ; R1_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,off=0x103080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) 292: (61) r1 = (u32 *)(r1 +0) R1 unbounded memory access, make sure to bounds check any such access processed 1099 insns (limit 1000000) max_states_per_insn 2 total_states 72 peak_states 72 mark_read 9 -- END PROG LOAD LOG --	2024-08-19 13:18:20 -10:00
Daniel Hodges	b3793e0069	scx_layered: Use topology for core selection Currently the core selection logic in scx_layered uses the first available core in the bitmask. This is suboptimal when the scheduler is configured with specific NUMA/LLC restrictions. The ideal core selection logic should try to find the least used cores within the preferred scheduling domain and allocate new cpus from shared cores within that domain. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-19 15:51:35 -07:00
Tejun Heo	3498a2b899	Merge pull request #514 from sched-ext/htejun/scx_stats scx_stats, scx_layered: Implement independent stats client sessions	2024-08-19 11:24:53 -10:00
Tejun Heo	f6bc52d31e	scx_layered: Make --monitor behavior more useful - If --monitor is specified with layer specs, the scheduler also starts stats monitoring on a thread. - Standalone monitoring mode no longer exits when the scheduler isn't there.	2024-08-19 10:55:02 -10:00

1 2 3 4

178 Commits