JakeHillion/scx

mirror of https://github.com/JakeHillion/scx.git synced 2024-12-01 21:37:12 +00:00

Author	SHA1	Message	Date
David Vernet	a7b39f24e2	Merge pull request #165 from sched-ext/reduce-rust-build-load build: limit the maximum amount of parallel cargo build	2024-02-29 09:19:42 -06:00
Andrea Righi	274eb8b4d8	build: limit the maximum amount of parallel cargo build Each cargo build is already parallelized, spreading multiple rustc across all the available CPUs by default. Allowing to run multiple instances of cargo at the same time doesn't provide any benefit and it can only increase the risk of triggering OOM conditions or overloading the build system. Therefore, limit the amount of parallel cargo build instances to 1. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-29 15:25:32 +01:00
David Vernet	cf7404fb03	Merge pull request #164 from sirlucjan/services-update2 scx: update /etc/default/scx	2024-02-28 12:30:01 -06:00
Piotr Gorski	f87fe20de2	scx: update /etc/default/scx Signed-off-by: Piotr Gorski <lucjan.lucjanov@gmail.com>	2024-02-28 18:59:55 +01:00
David Vernet	7278d88632	Merge pull request #161 from sched-ext/scx-user Introduce scx_rustland_core: a generic layer to implement user-space schedulers in Rust	2024-02-28 10:57:19 -06:00
Andrea Righi	0d1c6555a4	scx_rustland_core: generate source files in-tree There is no need to generate source code in a temporary directory with RustLandBuilder(), we can simply generate code in-tree and exclude the generated source files from .gitignore. Having the generated source files in-tree can help to debug potential build issues (and it also allows to drop the the tempfile crate dependency). Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
Andrea Righi	06d8170f9f	scx_utils: introduce Builder() Introduce a Builder() class in scx_utils that can be used by other scx crates (such as scx_rustland_core) to prevent code duplication. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
Andrea Righi	2ac1a5924f	scx_rustland_core: introduce RustLandBuilder() Introduce a wrapper to scx_utils::BpfBuilder that can be used to build the BPF component provided by scx_rustland_core. The source of the BPF components (main.bpf.c) is included in the crate as an array of bytes, the content is then unpacked in a temporary file to perform the build. The RustLandBuilder() helper is also used to generate bpf.rs (that implements the low-level user-space Rust connector to the BPF commponent). Schedulers based on scx_rustland_core can simply use RustLandBuilder(), to build the backend provided by scx_rustland_core. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
Andrea Righi	e23426e299	scx_rustland_core: introduce method bpf.update_tasks() Introduce a helper function to update the counter of queued and scheduled tasks (used to notify the BPF component if the user-space scheduler has still some pending work to do). Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
Andrea Righi	00e25530bc	scx_rlfifo: simple user-space FIFO scheduler written in Rust Implement a FIFO scheduler as an example usage of scx_rustland_core. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
Andrea Righi	cf43129d89	scx_rustland: update documentation scx_rustland has significantly evolved since its original design. With the introduction of scx_rustland_core and the inclusion of the scx_rlfifo example, scx_rustland's focus can be shifted from solely being an "easy-to-read Rust scheduler template" to a fully functional scheduler. For this reason, update the README and documentation to reflect its revised design, objectives, and intended use cases. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
Andrea Righi	871a6c10f9	scx_rustland_core: include scx_rustland backend Move the BPF component of scx_rustland to scx_rustland_core and make it available to other user-space schedulers. NOTE: main.bpf.c and bpf.rs are not pre-compiled in the scx_rustland_core crate, they need to be included in the user-space scheduler's source code in order to be compiled/linked properly. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
Andrea Righi	416d6a940f	rust: introduce scx_rustland_core crate Introduce a separate crate (scx_rustland_core) that can be used to implement sched-ext schedulers in Rust that run in user-space. This commit only provides the basic layout for the new crate and the abstraction to the custom allocator. In general, any scheduler that has a user-space component needs to use the custom allocator to prevent potential deadlock conditions, caused by page faults (a kthread needs to run to resolve the page fault, but the scheduler is blocked waiting for the user-space page fault to be resolved => deadlock). However, we don't want to necessarily enforce this constraint to all the existing Rust schedulers, some of them may do all user-space allocations in safe paths, hence the separate scx_rustland_core crate. Merging this code in scx_utils would force all the Rust schedulers to use the custom allocator. In a future commit the scx_rustland backend will be moved to scx_rustland_core, making it a totally generic BPF scheduler framework that can be used to implement user-space schedulers in Rust. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-28 17:49:44 +01:00
David Vernet	4dfb898a08	Merge pull request #159 from sched-ext/load_balancer Add new infeasible.rs crate	2024-02-26 11:15:50 -06:00
David Vernet	c71946e16a	Merge pull request #162 from jordalgo/cargo-gate Gate cargo build options behind 'enable_rust'	2024-02-26 11:07:26 -06:00
David Vernet	8b04a2687f	rusty: Use new infeasible crate Now that we have a new 'infeasible' crate that abstracts the logic for implementing the infeasible weights solution. Let's update rusty to use it. Signed-off-by: David Vernet <void@manifault.com>	2024-02-26 10:51:54 -06:00
David Vernet	b1dc37889f	infeasible: Add a new infeasible crate for load balancing We want to avoid every scheduler implementation from having to implement the solution to the infeasible weights problem, but we also want to enable sufficient flexibility where not every program has to have the same partition of scheduling domains, etc. To enable this, a new infeasible crate is added which encapsulates all of the logic for being given duty cycle and weight, and performing the necessary math to adjust for infeasibility. Signed-off-by: David Vernet <void@manifault.com>	2024-02-26 10:51:52 -06:00
Jordan Rome	9cedac45ee	Gate cargo build options behind 'enable_rust' This is to not have to require cargo when building the c schedulers.	2024-02-25 17:51:43 -08:00
David Vernet	869b1c2a4c	Merge pull request #160 from sched-ext/ci-exclude-example-scheds ci: eclude scx_qmap and scx_userland from testing	2024-02-24 11:55:30 -06:00
Andrea Righi	d1cfe1765d	ci: eclude scx_qmap and scx_userland from testing These two schedulers are provided mostly as examples / PoC, so we should exclude them from our periodic testing, to prevent triggering false positives in our CI. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-24 10:24:06 +01:00
Tejun Heo	77ee1cd41f	Merge pull request #158 from sched-ext/htejun sync-to-kernel.sh: Sync scx_central and scx_flatcg	2024-02-23 14:21:56 -10:00
Tejun Heo	7e578bb034	sync-to-kernel.sh: Sync scx_central and scx_flatcg They got added back as example schedulers in the kernel tree.	2024-02-23 14:21:03 -10:00
Tejun Heo	be9708aade	Merge pull request #157 from sched-ext/htejun common: Cosmetic change for consistency	2024-02-23 10:55:44 -10:00
Tejun Heo	b997b4ad17	common: Cosmetic change for consistency	2024-02-23 10:55:00 -10:00
David Vernet	70960ab162	Merge pull request #155 from sched-ext/rustland_refactor scx_rustland: cpu topology refactoring	2024-02-23 14:27:48 -06:00
Tejun Heo	948b2f0fc8	Merge pull request #156 from sched-ext/htejun scheds/sync-to-kernel.sh: Warn and skip if destination file is missing instead of failing	2024-02-23 09:32:38 -10:00
Tejun Heo	c789b4dd66	scheds/sync-to-kernel.sh: Warn and skip if destination file is missing instead of failing We aren't gonna sync all headers to the kernel tree.	2024-02-23 09:28:41 -10:00
David Vernet	87eab38506	rustland: Update rustland to use topology.rs The new topology crate allows us to replace the custom rustland topology logic with the logic in the topology crate itself. Signed-off-by: David Vernet <void@manifault.com>	2024-02-23 13:09:06 -06:00
David Vernet	f85cde7038	topology: Add maps to cores and cpus from root Topology object For convenience, let's provide callers with a way to easily look up cores and CPUs from the root topology object. Signed-off-by: David Vernet <void@manifault.com>	2024-02-23 13:09:02 -06:00
David Vernet	9904d03289	Merge pull request #154 from sched-ext/topology_refactor Topology refactor	2024-02-23 11:02:33 -06:00
David Vernet	43624a87ce	rusty: Use new topology crate Now that we have this new Topology crate, let's update Rusty to use it instead of using the old one. Signed-off-by: David Vernet <void@manifault.com>	2024-02-23 10:39:55 -06:00
David Vernet	c5a3b83bbd	topology: Add new topology crate The topology.rs crate is insufficiently generic, and reflects implementation details of scx_rusty more than it provides generic use cases for modeling a host's topology. This adds a new topology2.rs crate that will replace topology.rs. We have this as an intermediate commit so that we don't bundle updating scx_rusty with adding this crate. Signed-off-by: David Vernet <void@manifault.com>	2024-02-23 10:39:00 -06:00
David Vernet	608df7f96f	Merge pull request #152 from sched-ext/topology_refactor Add cpumask.rs crate	2024-02-22 18:07:55 -06:00
David Vernet	a2f531e429	topology: Refactor topology.rs to use cpumask crate Now that we have cpumask.rs, we can remove some logic from topology.rs and have it create and use Cpumasks. Signed-off-by: David Vernet <void@manifault.com>	2024-02-22 17:25:44 -06:00
David Vernet	bd15eb8e41	cpumask: Add new Cpumask crate Let's add a Cpumask trait that schedulers can use to avoid all having to deal directly with BitVec and the like. Signed-off-by: David Vernet <void@manifault.com>	2024-02-22 17:24:26 -06:00
Tejun Heo	4dc77f8ddf	Merge pull request #149 from davemarchevsky/davemarchevsky_nice_equals scx_layered: Add MATCH_NICE_EQUALS match kind	2024-02-22 06:38:17 -10:00
Dave Marchevsky	9f510f18cd	scx_layered: Add MATCH_NICE_EQUALS match kind I have a usecase where specific nice values are used to bucket tasks into groups that are handled separately by different `scx_layered` policies, with no implications of relative priority between niceness X, X + 1, X - 1, etc. In other words, nicevals are used as simple tags in this scenario. If we wanted to treat a specific niceness this way e.g. `11`, we could do so with AND'd MATCH_NICE_{ABOVE,BELOW} like so: ```json "matches" : [ [ { "NiceAbove": 10 }, { "NiceBelow": 12 }, ], ], ``` But this is unnecessarily verbose and doesn't communicate the intent of the match very well. Adding a `NiceEquals` matcher simplifies the config and makes intent obvious: ```json "matches" : [ [ { "NiceEquals": 11 }, ], ], ``` This PR adds support for such a matcher. Also, rename `layer_match.nice_above_or_below` to just `layer_match.nice`, as the former doesn't describe the newly-added matcher's use of the field. It's still obvious that `layer_match.nice` is relevant to MATCH_NICE_{ABOVE, BELOW, EQUALS}. Signed-off-by: Dave Marchevsky <davemarchevsky@fb.com>	2024-02-22 04:08:07 -08:00
Tejun Heo	b77996f9c5	Merge pull request #150 from sched-ext/refresh_after_attach Refresh after attach	2024-02-21 21:51:02 -10:00
David Vernet	615b594e1c	layered: Don't refresh cpumasks before attaching As mentioned in the previous commit, for some reason we're sometimes (non-deterministically) not seeing the updated cpumask / layer values in BPF if we initialize the cpumasks here before attaching. Let's undo this for now so to avoid observing buggy behavior, until we figure it out. Signed-off-by: David Vernet <void@manifault.com>	2024-02-21 19:19:45 -06:00
David Vernet	68d317079a	Revert "layered: Set layered cpumask in scheduler init call" This reverts commit `56ff3437a2`. For some reason we seem to be non-deterministically failing to see the updated layer values in BPF if we initialize before attaching. Let's just undo this specific part so that we can unblock this being broken, and we can figure it out async. Signed-off-by: David Vernet <void@manifault.com>	2024-02-21 19:17:19 -06:00
David Vernet	ebce76b0cb	Merge pull request #148 from sched-ext/layered_fixes layered: Fix static configuration, and dispatch for Grouped layers	2024-02-21 16:12:12 -06:00
David Vernet	31df8fbd09	layered: Consume from layer with cpumask in layered_dispatch Currently, in layered_dispatch, we do the following: 1. Iterate over all preempt=true layers, and first try to consume from them. 2. Iterate over all confined layers, and consume from them if we find a layer with a cpumask that contains the consuming CPU. 3. Iterate over all grouped and open layers and consume from them in ordered sequence. In (2), we're only iterating over confined layers, but we should also be iterating over grouped layers. Otherwise, despite a consuming CPU being allocated to a specific grouped layer, the CPU will consume from whichever grouped or open layer has a task that's ready to run. Signed-off-by: David Vernet <void@manifault.com>	2024-02-21 15:38:23 -06:00
David Vernet	56ff3437a2	layered: Set layered cpumask in scheduler init call In layered_init, we're currently setting all bits in every layers' cpumask, and then asynchronously updating the cpumasks at later time to reflect their actual values at runtime. Now that we're updating the layered code to initialize the cpumasks before we attach the scheduler, we can instead have the init path actually refresh and initialize the cpumasks directly. Signed-off-by: David Vernet <void@manifault.com>	2024-02-21 15:38:23 -06:00
David Vernet	1f834e7f94	layered: Initialize layers before attaching scheduler We currently have a bug in layered wherein we could fail to propagate layer updates from user space to kernel space if a layer is never adjusted after it's first initialized. For example, in the following configuration: [ { "name": "workload.slice", "comment": "main workload slice", "matches": [ [ { "CgroupPrefix": "workload.slice/" } ] ], "kind": { "Grouped": { "cpus_range": [30, 30], "util_range": [ 0.0, 1.0 ], "preempt": false } } }, { "name": "normal", "comment": "the rest", "matches": [ [] ], "kind": { "Grouped": { "cpus_range": [2, 2], "util_range": [ 0.0, 1.0 ], "preempt": false } } } ] Both layers are static, and need only be resized a single time, so the configuration would never be propagated to the kernel due to us never calling update_bpf_layer_cpumask(). Let's instead have the initialization propagate changes to the skeleton before we attach the scheduler. This has the advantage both of fixing the bug mentioned above where a static configuration is never propagated to the kernel, and that we don't have a short period when the scheduler is first attached where we don't make optimal scheduling decisions due to the layer resizing not being propagated. Signed-off-by: David Vernet <void@manifault.com>	2024-02-21 15:38:21 -06:00
David Vernet	59bd206559	Merge pull request #146 from sched-ext/rusty_topology	2024-02-21 08:33:25 -06:00
David Vernet	49065de8df	scx: Demote panic! to warn! in topology crate We currently panic! if we're building a Topology that detects more than two siblings on a physical core. This can and will likely happen on multi-socket machines. Given that we're planning to add support for detecting NUMA nodes soon, let's just demote the panic! to a warn!. Signed-off-by: David Vernet <void@manifault.com>	2024-02-20 21:01:17 -06:00
Tejun Heo	22d635c385	Merge pull request #141 from jordalgo/rusty-logging Add libbpf logging to rust schedulers	2024-02-20 13:52:39 -10:00
Tejun Heo	4c63c7884c	Merge pull request #145 from sched-ext/rustland-cmdline-options scx_rustland: additional comand line options	2024-02-20 13:51:59 -10:00
Andrea Righi	80de48ec83	scx_rustland: introduce --builtin-idle Add a command line option to enable/disable the sched-ext built-in idle selection logic in the user-space scheduler. With this option the user-space scheduler will try to dispatch tasks on the CPU selected during the .select_cpu() phase (using the built-in idle selection logic). Without this option the user-space scheduler will try to dispatch tasks to the first CPU available. The former can be useful to improve throughput, since tasks are more likely to stick on the same CPU, while the latter can provide better system responsiveness, especially when the system is significantly busy. Given that, by default, tasks can be dispatched directly bypassing the user-space scheduler if an idle CPU is found during .select_cpu(), the user-space scheduler is primarily engaged only when the system is busy (no idle CPUs are available). Under these circumstances, it is typically more efficient to dispatch tasks on the first available CPU. Hence, the default behavior is to ignore built-in idle selection logic in the user-space scheduler. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-21 00:25:14 +01:00
Andrea Righi	e487d71032	scx_rustland: simply CPU selection by relying on built-in idle selection Checking if a CPU is idle or busy in the user-space scheduler is a bit redundant, considering that we also rely on the built-in idle selection logic in the BPF part. Therefore get rid of the additional idle selection logic in the user-space scheduler and rely on the built-in idle selection. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-02-21 00:25:14 +01:00

... 2 3 4 5 6 ...

578 Commits