JakeHillion/scx

mirror of https://github.com/JakeHillion/scx.git synced 2024-12-03 22:37:11 +00:00

Author	SHA1	Message	Date
Tejun Heo	d01b49bd0e	scx_layered: Fix verification failure `4fccc06905` ("scx_layered: Fix uninitialized variable") causes the following verification failure. Fix it by moving assignments below range checking. Validating match_layer() func#1... 283: R1=scalar() R2=scalar() R3=mem_or_null(id=49,sz=1) R10=fp0 ; int match_layer(u32 layer_id, pid_t pid, const char cgrp_path) @ main.bpf.c:1029 283: (7b) (u64 )(r10 -24) = r3 ; R3=mem_or_null(id=49,sz=1) R10=fp0 fp-24_w=mem_or_null(id=49,sz=1) 284: (bc) w7 = w1 ; R1=scalar() R7_w=scalar(smin=0,smax=umax=0xffffffff,var_off=(0x0; 0xffffffff)) ; struct layer layer = &layers[layer_id]; @ main.bpf.c:1033 285: (bc) w1 = w7 ; R1_w=scalar(id=50,smin=0,smax=umax=0xffffffff,var_off=(0x0; 0xffffffff)) R7_w=scalar(id=50,smin=0,smax=umax=0xffffffff,var_off=(0x0; 0xffffffff)) 286: (27) r1 = 1061192 ; R1_w=scalar(smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) 287: (18) r8 = 0xffffc90002a26000 ; R8_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080) 289: (0f) r8 += r1 ; R1_w=scalar(smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) R8_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) ; u32 nr_match_ors = layer->nr_match_ors; @ main.bpf.c:1034 290: (bf) r1 = r8 ; R1_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) R8_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) 291: (07) r1 += 1060992 ; R1_w=map_value(map=bpf_bpf.bss,ks=4,vs=16979080,off=0x103080,smin=0,smax=umax=0x103147ffefceb8,smax32=0x7ffffff8,umax32=0xfffffff8,var_off=(0x0; 0x1ffffffffffff8)) 292: (61) r1 = (u32 *)(r1 +0) R1 unbounded memory access, make sure to bounds check any such access processed 1099 insns (limit 1000000) max_states_per_insn 2 total_states 72 peak_states 72 mark_read 9 -- END PROG LOAD LOG --	2024-08-19 13:18:20 -10:00
Tejun Heo	3498a2b899	Merge pull request #514 from sched-ext/htejun/scx_stats scx_stats, scx_layered: Implement independent stats client sessions	2024-08-19 11:24:53 -10:00
Tejun Heo	f6bc52d31e	scx_layered: Make --monitor behavior more useful - If --monitor is specified with layer specs, the scheduler also starts stats monitoring on a thread. - Standalone monitoring mode no longer exits when the scheduler isn't there.	2024-08-19 10:55:02 -10:00
Tejun Heo	d03e48eb75	scx_layered: Implement per-stats-client nr_layer_cpus_ranges tracking With this, every client sees the correct nr_layer_cpus_ranges without interfering with each other.	2024-08-19 09:12:51 -10:00
Tejun Heo	448aacfd60	scx_layered: Initialize Stats.prev_layer_cycles properly on new() So that new stats session doesn't start with an inflated utilization number.	2024-08-19 08:40:40 -10:00
Tejun Heo	25d7e6f787	scx_layered: Implement on-demand statistics generation Instead of keeping one copy of sched_stats, each stats server session carries their own so that stats can be generated independently by each client at any interval. CPU allocation min/max tracking is broken for now.	2024-08-19 08:27:36 -10:00
Tejun Heo	27c530e17e	scx_stats: Add missing trait exports	2024-08-19 07:16:43 -10:00
Tejun Heo	0cf5ca605d	scx_layered: Move processing_dur accounting into Stats and protect it with Arc<Mutex<>>	2024-08-19 06:25:23 -10:00
Tejun Heo	a77fe372d6	scx_stats: Make server shutdown when connection is dropped and add communication channel This will make implementing connection sessions easier where each stats client connection maintains a set of states.	2024-08-19 06:23:16 -10:00
I Hsin Cheng	4fccc06905	scx_layered: Fix uninitialized variable Fix the uninitialized variable "layer" in the function match_layer which caused the compiling process to fail. "layer" is supposed to be the same as "&layers[layer_id]". Signed-off-by: I Hsin Cheng <richard120310@gmail.com>	2024-08-17 23:32:53 +08:00
Tejun Heo	3a688cfde7	scx_stats: Add support for no-value user attributes and a bunch of other changes - Allow no-value user attributes which are automatically assigned "true" when specified. - Make "top" attribute string "true" instead of bool true for consistency. Testing for existence is always enough for value-less attributes. - Don't drop leading "_" from user attribute names when storing in dicts. Dropping makes things more confusing. - Add "_om_skip" to scx_layered fields which don't jive well with OM. scxstats_to_openmetrics.py is updated accordignly and no longer generates warnings on those fields. - Examples and README updated accordingly.	2024-08-16 07:52:02 -10:00
Tejun Heo	c16b48d7b2	scheds/rust: Include Cargo.lock in the repo Binary packages are expected to include Cargo.lock in the repo so that the produced binaries match across different builds.	2024-08-15 23:08:35 -10:00
Tejun Heo	22167aeb14	Merge pull request #502 from sched-ext/htejun/scx_stats scx_stats: Refine scx_stats and implement scxstats_to_openmetrics.py	2024-08-15 22:55:11 -10:00
Tejun Heo	570ca56c57	scx_layered: s/_om_field_prefix/_om_prefix/	2024-08-15 21:29:58 -10:00
Tejun Heo	af01dd19ec	Merge pull request #500 from sched-ext/htejun/scx_stats scx_stats, scx_layered: Add `om_prefix` attribute and fix s/stat/stats/ stragglers	2024-08-15 21:27:38 -10:00
Tejun Heo	ea453e51d3	scx_stats: Rename "all" attribute to "top" and clean up examples a bit	2024-08-15 21:24:55 -10:00
Tejun Heo	a910fa451a	scx_layered: Add _om attributes to LayerStats for OpenMetrics piping	2024-08-15 19:11:49 -10:00
Tejun Heo	6a5d6f7c27	scx_stats: Replace field_prefix attribute with '_' prefixed user attributes	2024-08-15 19:09:59 -10:00
Tejun Heo	a9922deaa2	scx_stats: Add "all" attribute and rename metadata type strings	2024-08-15 14:50:00 -10:00
Tejun Heo	ebc1a89c34	scx_stats: s/stat/stats/ stragglers	2024-08-15 14:00:00 -10:00
Tejun Heo	bafd67b568	scx_stats: Fix parsing for multiple stat attributes The code was assuming single attribute per #[stat()] block. Update it so that there can be multiple comma separated attributes in a single block.	2024-08-15 13:46:20 -10:00
Tejun Heo	8f361af077	scx_layered: Shorten stat field descriptions	2024-08-15 13:25:48 -10:00
Tejun Heo	1912e05f0b	Merge pull request #499 from sched-ext/htejun/scx_stats scx_stats: Misc changes to sync dep versions and publish on crates.io	2024-08-15 12:32:44 -10:00
Tejun Heo	0b9c8b5cbd	scx_stats: Update versions to 0.2.0 to republish	2024-08-15 12:29:27 -10:00
Daniel Hodges	0319afc88e	scx_layered: Update nr_cpus when resizing layers After updating scx_layered to be topology aware the nr_cpus field on the layer was not being updated properly. Update layer growing/shrinking logic to correctly update the nr_cpus count. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-15 13:22:26 -07:00
Tejun Heo	b614cf848f	scx_layered: Make monitor time based iterations dumber This makes ctrl-c a bit more responsive without complicating code.	2024-08-15 09:23:29 -10:00
Tejun Heo	45fb724ee2	scx_layered: Restore cpumask reporting	2024-08-15 09:12:29 -10:00
Tejun Heo	751a38e34e	scx_layered: Refactor stats printing code	2024-08-15 08:53:19 -10:00
Tejun Heo	a4f424056e	scx_layered: Move stats server launching to stats.rs	2024-08-15 06:30:42 -10:00
Tejun Heo	17afc72479	scx_stats: Rename cleanups - s/stat/stats/ on several stragglers. - Rename traits so that they are more distinctive from struct and other names and follow the convention.	2024-08-15 06:24:56 -10:00
Tejun Heo	a091d5ea7d	scx_layered: s/monitor.rs/stats.rs/ and make stats refresh code struct ops	2024-08-15 06:13:05 -10:00
Tejun Heo	8aae9a5de2	scx_stats: s/scx_stat/scx_stats/ Use plural form which is more widespread and also used in scheduler implementations. No functional changes.	2024-08-15 05:31:34 -10:00
Tejun Heo	6e466d18df	scx_layered: Initial switch to scx_stat - This makes the scheduler side simpler and allows on-demand monitoring. - OpenMetrics support is dropped for now. Will add a generic tool for it. - This is a naive conversion. Will be further refined. scx_layered no longer prints statistics by default. To watch statistics, run `scx_layered --monitor` while the scheduler is running.	2024-08-14 13:48:41 -10:00
Tejun Heo	7820ec9b46	scx_stat, scx_layered: cargo fmt	2024-08-14 11:47:37 -10:00
Daniel Hodges	646cefd46d	Merge pull request #477 from hodgesds/layered-global-match scx_rusty: Make layer matching a global function	2024-08-12 09:14:58 -04:00
Daniel Hodges	be5213e129	scx_rusty: Make layer matching a global function Layer matching currently takes a large number of bpf instructions. Moving layer matching to a global function will reduce the overall instruction count and allow for other layer matching methods such as glob. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-12 05:44:34 -07:00
Tejun Heo	63c4a0191f	Merge branch 'main' into topic/inlined-skeleton-members	2024-08-08 14:23:37 -10:00
Tejun Heo	cd6a4d72c7	Bump versions for 1.0.2 release	2024-08-08 14:10:16 -10:00
Tejun Heo	7c3ffe96e1	Unify crate dependency versions Different sub-projects are using different versions for the same crates. Synchronize them to the latest.	2024-08-08 13:26:47 -10:00
Daniel Hodges	d5efcd3245	scx_layered: Fix cred declaration The use of the cred struct should be const. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-06 05:22:12 -07:00
Daniel Hodges	1f922b9a73	scx_layered: Add support for disabling topology awareness Add a parameter to disable topology awareness. This is useful when trying to compare the scheduling performance of topology aware scheduling compared to the previous scheduling strategy. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-02 08:07:19 -07:00
Daniel Hodges	de7b5fe190	scx_layered: Fix dispatch fallback CPU selection When the previous CPU for a task is not known do not fall back to dispatching to CPU 0, use the current CPU. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-31 12:35:22 -07:00
Daniel Hodges	4f12bebaa5	scx_layered: Add per cpu layer iterator offset Add a per cpu counter offset to round robin when iterating on layers. This is to make selection from different layers more fair. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-30 10:44:41 -07:00
Daniel Hodges	4c3fd6cd9b	scx_layered: Rename UserId and GroupId TLDR; rename UserId and GroupId to UIDEquals and GIDEquals. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-24 15:09:08 -07:00
Daniel Hodges	55f6d68eef	scx_layered: Add user and group layers Add a layer match based on either the effective user id or the effective group id. This allows for creating layers for individual users or groups. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-24 15:09:08 -07:00
Daniel Hodges	2803f9c127	scx_layered: Fix formatting issues Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-24 14:39:02 -07:00
Daniel Hodges	0814abf0b8	scx_layered: Add node topology awareness Add NUMA node topology awareness for scx_layared. This borrows some of the NUMA handling from scx_rusty and allows layers to set a node mask. Different layer kinds will use the node mask differently. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-24 09:53:48 -07:00
David Vernet	4f11e2abe2	layered: Don't dispatch to LO_FALLBACK_DSQ Non-kthreads with custom affinities in non-open layers are dispatched into a LO_FALLBACK_DSQ, with the idea being that they're penalized for their custom affinities. When a host is fully utilized, these tasks can end up being starved due to LO_FALLBACK_DSQ being consumed only when there are no other layers to consume from. In internal workloads at Meta, we've observed that this can happen in practice. Longer term, we can probably address this by implementing layer weights and applying that to fallback DSQs to avoid starvation. For now, let's just dispatch them to HI_FALLBACK_DSQ to avoid this starvation issue. Signed-off-by: David Vernet <void@manifault.com>	2024-07-19 19:14:18 -05:00
Daniel Hodges	b98a9f56a8	scx_layered: Add separate module for metrics Refactor the main module for scx_layered to move metrics into a separate module. This change does no functional differences, only code structure. This will make it a little easier to navigate the logic in the main scheduler code. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-19 07:40:24 -07:00
Daniel Müller	565aec3662	rust: Update libbpf-rs & libbpf-cargo to 0.24 Update libbpf-rs & libbpf-cargo to 0.24. Among other things, generated skeletons now contain directly accessible map and program objects, no longer necessitating the use of accessor methods. As a result, the risk for mutability conflicts is reduced greatly. Signed-off-by: Daniel Müller <deso@posteo.net>	2024-07-16 11:48:52 -07:00

1 2 3

131 Commits