scx-upstream

mirror of https://github.com/sched-ext/scx.git synced 2024-12-02 23:37:12 +00:00

Author	SHA1	Message	Date
Andrea Righi	d67dfe50f9	scx_rustland: treat the CPU running the user-space scheduler as idle Considering the CPU where the user-space scheduler is running as busy doesn't really provide any benefit, since the user-space scheduler is constantly dispatching an amount of tasks equal to the amount of idle CPUs and then yields (therefore its own CPU should be considered idle). Considering the CPU where the user-space scheduler is running as busy doesn't provide any benefit, as the scheduler consistently dispatches tasks equal to the number of idle CPUs and then yields (therefore its own CPU should be considered idle). This also allows to reduce the overall user-space scheduler CPU utilization, especially when the system is mostly idle, without introducing any measurable performance regression. Measuring the average CPU utilization of a (mostly) idle system over a time period of 60 sec: - wihout this patch: 5.41% avg cpu util - with this patch: 2.26% avg cpu util Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-29 21:14:58 +01:00
Andrea Righi	05f5c69747	ci: use virtme-ng to test the schedulers Use virtme-ng to run the schedulers after they're built; virtme-ng allows to pick an arbitrary sched-ext enabled kernel and run it virtualizing the entire user-space root filesystem, so we can basically exceute the recompiled schedulers inside such kernel. This should allow to catch potential run-time issue in advance (both in the kernel and the schedulers). The sched-ext kernel is taken from the Ubuntu ppa (ppa:arighi/sched-ext) at the moment, since it is the easiest / fastest way to get a precompiled sched-ext kernel to run inside the Ubuntu 22.04 testing environment. The schedulers are tested using the new meson target "test_sched", the specific actions are defined in meson-scripts/test_sched. By default each test has a timeout of 30 sec, after the virtme-ng completes the boot (that should be enough to initialize the scheduler and run the scheduler for some seconds), while the total lifetime of the virtme-ng guest is set to 60 sec, after this time the guest will be killed (this allows to catch potential kernel crashes / hangs). If a single scheduler fails the test, the entire "test_sched" action will be interrupted and the overall test result will be considered a failure. At the moment scx_layered is excluded from the tests, because it requires a special configuration (we should probably pre-generate a default config in the workflow actions and change the scheduler to use the default config if it's executed without any argument). Moreover, scx_flatcg is also temporarily excluded from the tests, because of these known issues: - https://github.com/sched-ext/scx/issues/49 - https://github.com/sched-ext/sched_ext/pull/101 Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-29 15:54:10 +01:00
Andrea Righi	dbc8e23980	scx_userland: flush stdout when printing stats Periodically flush stdout to help following the scheduler progress during testing. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-29 15:53:12 +01:00
Andrea Righi	614a1ff901	scx_flatcg: flush stdout when printing stats Periodically flush stdout to help following the scheduler progress during testing. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-29 15:53:12 +01:00
Tejun Heo	3206464405	Merge pull request #55 from arighi/scx-rustland-doc scx_rustland: add documentation to scheds/rust/README.md	2023-12-29 17:35:09 +09:00
Andrea Righi	cc17780c24	scx_rustland: add documentation to scheds/rust/README.md Add documentation for scx_rustland to the README.md files of the Rust schedulers. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-29 09:13:54 +01:00
Tejun Heo	d2a173fc51	Merge pull request #53 from sched-ext/htejun Suppress the deprecation warning from bindgen and bump versions	2023-12-29 07:07:06 +09:00
Tejun Heo	98773131df	Bump versions to publish scx_utils fedora compat change	2023-12-29 06:58:45 +09:00
Tejun Heo	c47a4b6716	scx_utils: Explain what's going on with bindgen version and suppress deprecation warning This is a followup to https://github.com/sched-ext/scx/pull/50. See the comment in BpfBuilder::bindgen_bpf_intf() for details.	2023-12-29 06:56:07 +09:00
Tejun Heo	1d868dbf89	Merge pull request #50 from jordalgo/downgrade-bindgen Downgrade bindgen to 0.68	2023-12-29 06:28:20 +09:00
Tejun Heo	e230e86272	Merge pull request #52 from arighi/scx-rustland-update-idle scx_rustland: introduce update_idle callback	2023-12-29 06:10:40 +09:00
Andrea Righi	6df4d7e0c6	scx_rustland: introduce an update_idle() callback Move the logic to activate the userspace scheduler to an update_idle() callback, which is called when the CPU is about to go idle. This disables the built-in idle tracking mechanism, so it allows to rely completely on the internal CPU ownership logic (via get_cpu_owner() and set_cpu_owner()) and it also allows to share the idle state with the user-space scheduler via the BPF_MAP_TYPE_ARRAY cpu_map. Moreover, when the user-space scheduler is activated, kick the idle cpu to trigger immediate dispatch and avoid bubbles in the scheduling pipeline. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-28 14:41:08 +01:00
Andrea Righi	1baae38e7f	Revert "scx_rustland: always dispatch kthreads on the local CPU" This reverts commit `9237e1d` ("scx_rustland: always dispatch kthreads on the local CPU"). Do not always prioritize all kthreads, we may have unbound workqueue workers that can consume a lot of CPU cycles (e.g., encryption workers), so we definitely want to apply the scheduling for those. Therefore, restore the old behavior to prioritize only per-CPU kthreads. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-28 14:40:03 +01:00
Tejun Heo	990cd058fe	Merge pull request #48 from arighi/scx-rustland-userspace-interlocking scx_rustland: clarify and improve BPF / userspace interlocking	2023-12-28 08:26:55 +09:00
Jordan Rome	c8a721b033	Downgrade bindgen to 0.68 This is so we can package scx_utils into fedora without having to upgrade rust-bindgen (https://bodhi.fedoraproject.org/updates/FEDORA-2023-18e7f124e1). To make this happen we need to stop using the `CargoCallbacks::new` constructor which was added in 0.69. Old way seems legit according to the docs: https://rust-lang.github.io/rust-bindgen/non-system-libraries.html	2023-12-27 12:19:28 -08:00
Andrea Righi	9237e1d835	scx_rustland: always dispatch kthreads on the local CPU Adding extra overhead to any kthread can potentially slow down the entire system, so make sure this never happens by dispatching all kthreads directly on the same local CPU (not just the per-CPU kthreads), bypassing the user-space scheduler. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-27 14:15:46 +01:00
Andrea Righi	f0ece7af6b	scx_rustland: wake-up user-space scheduler when a CPU is released Trigger the user-space scheduler only upon a task's CPU release event (avoiding its activation during each enqueue event) and only if there are tasks waiting to be processed by the user-space scheduler. This should save unnecessary calls to the user-space scheduler, reducing the overall overhead of the scheduler. Moreover, rename nr_enqueues to nr_queued and store the amount of tasks currently queued to the user-space scheduler (that are waiting to be dispatched). Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-27 14:15:46 +01:00
Andrea Righi	7d01be9568	scx_rustland: provide get/set_cpu_owner() Provide the following primitives to get and set CPU ownership in the BPF part. This improves code readability and these primitives can be used by the BPF part as a baseline to implement a better CPU idle tracking in the future. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-27 14:15:39 +01:00
Andrea Righi	cd7e1c6248	scx_rustland: clarify BPF / user-space interlocking BPF doesn't have full memory model yet, and while strict atomicity might not be necessary in this context, it is advisable to enhance clarity in the interlocking model. To achieve this, provide the following primitives to operate on usersched_needed: static void set_usersched_needed(void) static bool test_and_clear_usersched_needed(void) Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-26 14:28:24 +01:00
Tejun Heo	8443d8ac16	Merge pull request #47 from arighi/scx-rustland-cpu scx_rustland improvements	2023-12-24 06:29:15 +09:00
Andrea Righi	e038a530ae	scx_rustland: dispatch tasks in batch Dispatch tasks in a batch equal to the amount of idle CPUs in the system. This allows to reduce the pressure on the dispatcher queues, improving the effectiveness of the scheduler (by having more tasks sitting in the scheduler task pool) and mitigating potential priority inversion issues. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-23 10:44:03 +01:00
Andrea Righi	4d98862674	scx_rustland: expose CPU information to the user-space scheduler Provide an interface for the BPF dispatcher and user-space scheduler to share CPU information. This information can empower the user-space scheduler to make more informed decisions and enable the implementation of a broader range of scheduling policies. With this change the BPF dispatcher provides a CPU map (one entry per CPU) that stores the pid that is running on each CPU (0 if the CPU is idle). The CPU map is updated by the BPF dispatcher in the .running() and .stopping() callbacks. The dispatcher then sends to the user-space scheduler a suggestion of the candidate CPU for each task that needs to run (that is always the previously used CPU), along with all the task's information. The user-space scheduler can decide to confirm the selected CPU or to choose a different one, using all the shared CPU information. Lastly, the selected CPU is communicated back to the dispatcher along with all the task's information and the BPF dispatcher takes care of executing the task on the selected CPU, eventually triggering a migration. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-23 10:38:56 +01:00
Andrea Righi	968ac80a3f	scx_rustland: handle graceful vs non-graceful exit Do not report an exit error message if it's empty. Moreover, distinguish between a graceful exit vs a non-graceful exit. In general, try to follow the behavior of user_exit_info.h for the C schedulers. NOTE: in the future the whole exit handling probably can be moved to a more generic place (scx_utils) to prevent code duplication across schedulers and also to prevent small inconsistencies like this one. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-22 19:44:14 +01:00
Tejun Heo	c7b52d485d	Merge pull request #45 from sirlucjan/0.1.3 Bump to 0.1.3	2023-12-22 08:50:45 +09:00
Piotr Gorski	c6eb66616f	Bump to 0.1.3 Signed-off-by: Piotr Gorski <lucjan.lucjanov@gmail.com>	2023-12-22 00:48:50 +01:00
Tejun Heo	d3e8e52b1a	Merge pull request #44 from arighi/scx-rustland scx_rustland: rename from scx_rustlite	2023-12-22 08:40:01 +09:00
Andrea Righi	f7f0e3236c	scx_rustland: rename from scx_rustlite Rename scx_rustlite to scx_rustland to better represent the mirroring of scx_userland (in C), but implemented in Rust. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-22 00:20:14 +01:00
David Vernet	4cadb92003	Merge pull request #38 from arighi/scx-rustlite scx_rustlite: simple vtime-based scheduler written in Rust	2023-12-21 13:31:52 -06:00
Andrea Righi	086c6dffc8	scx_rustlite: simple user-space scheduler written in Rust This scheduler is made of a BPF component (dispatcher) that implements the low level sched-ext functionalities and a user-space counterpart (scheduler), written in Rust, that implements the actual scheduling policy. The main goal of this scheduler is to be easy to read and well documented, so that newcomers (i.e., students, researchers, junior devs, etc.) can use this as a template to quickly experiment scheduling theory. For this reason the design of this scheduler is mostly focused on simplicity and code readability. Moreover, the BPF dispatcher is completely agnostic of the particular scheduling policy implemented by the user-space scheduler. For this reason developers that are willing to use this scheduler to experiment scheduling policies should be able to simply modify the Rust component, without having to deal with any internal kernel / BPF details. Future improvements: - Transfer the responsibility of determining the CPU for executing a particular task to the user-space scheduler. Right now this logic is still fully implemented in the BPF part and the user-space scheduler can only decide the order of execution of the tasks, that significantly restricts the scheduling policies that can be implemented in the user-space scheduler. - Experiment the possibility to send tasks from the user-space scheduler to the BPF dispatcher using a batch size, instead of draining the task queue completely and sending all the tasks at once every single time. A batch size should help to reduce the overhead and it should also help to reduce the wakeups of the user-space scheduler. Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2023-12-21 18:53:30 +01:00
Tejun Heo	cfb41a77fc	Merge pull request #43 from sched-ext/ubuntu_2204_ci ci: Run CI job on Ubuntu 22.04	2023-12-21 06:55:45 +09:00
David Vernet	1bf04d0972	ci: Run CI job on Ubuntu 22.04 Andrea pointed out that we can and should be using Ubuntu 22.04. Unfortunately it still doesn't ship some of the deps we need like clang-17, but it does at least ship virtme-ng, so it's good for us to use this so that we can actually test running the schedulers in a virtme-ng VM when it supports being run in docker. Also, update the job to run on pushes, and not just when a PR is opened Suggested-by: Andrea Righi <andrea.righi@canonical.com> Signed-off-by: David Vernet <void@manifault.com>	2023-12-20 15:29:49 -06:00
Tejun Heo	79b0c3ea89	Merge pull request #41 from multics69/link-blogs Update README for additional resources (blog posts and articles)	2023-12-19 16:57:32 +09:00
Changwoo Min	23cecf2532	Update README for additional resources (blog posts and articles)	2023-12-19 12:28:44 +09:00
David Vernet	eb7b3c99f0	Merge pull request #40 from sched-ext/ci scx: Add CI action that builds schedulers for PRs	2023-12-18 21:17:47 -06:00
David Vernet	4523b10e45	scx: Add CI action that builds schedulers for PRs When Ubuntu ships with sched_ext, we can also maybe test loading the schedulers (not sure if the runners can run as root though). For now, we should at least have a CI job that lets us verify that the schedulers can _build_. To that end, this patch adds a basic CI action that builds the schedulers. As is, this is a bit brittle in that we're having to manually download and install a few dependencies. I don't see a better way for now without hosting our own runners with our own containers, but that's a bigger investment. For now, hopefully this will get us _some_ coverage. Signed-off-by: David Vernet <void@manifault.com>	2023-12-18 21:12:50 -06:00
Tejun Heo	3049d60883	Merge pull request #39 from sched-ext/nest_fixes Fix some things in Nest	2023-12-18 13:15:41 -10:00
David Vernet	318c06fa9c	nest: Skip out of idle cpu selection on exec() path The core sched code calls select_task_rq() in a few places: the task wakeup path (typical path), the fork() path, and the exec() path. For nest scheduling, we don't want to select a core from the nest on the exec() path. If we were previously able to find an idle core, we would have found it on the fork() path, so we don't gain much by checking on the exec() path. In fact, it's actually harmful, because we could incorrectly blow up the primary nest unnecessarily by bumping the same task between multiple cores for no reason. Let's just opt-out of select_task_rq calls on the exec() path. Suggested-by: Julia Lawall <julia.lawall@inria.fr> Signed-off-by: David Vernet <void@manifault.com>	2023-12-18 13:51:15 -06:00
David Vernet	ab0e36f9ce	scx_nest: Apply r_impatient if no task is found in primary nest Julia pointed out that our current implementation of r_impatient is incorrect. r_impatient is meant to be a mechanism for more aggressively growing the primary nest if a task repeatedly isn't able to find a core. Right now, we trigger r_impatient if we're not able to find an attached or previous core in the primary nest, but we _should_ be triggering it only if we're unable to find _any_ core in the primary nest. Fixing the implementation to do this drastically decreases how aggressively we grow the primary nest when r_impatient is in effect. Reported-by: Julia Lawall <julia.lawall@inria.fr> Signed-off-by: David Vernet <void@manifault.com>	2023-12-18 11:05:36 -06:00
Tejun Heo	239d5d1d2c	Merge pull request #37 from jordalgo/folder-restructure Restructure scheds folder names	2023-12-17 15:15:54 -10:00
Jordan Rome	e9a9d32ab6	Restructure scheds folder names - combine c and kernel-examples as it's confusing to have both - rename 'rust-user' and 'c-user' to just 'rust' and 'c', which is simpler - update and fix sync-to-kernel.sh	2023-12-17 13:14:31 -08:00
Tejun Heo	52381a9764	Merge pull request #36 from danielocfb/topic/libbpf-rs-update rust: Update libbpf-rs & libbpf-cargo to 0.22	2023-12-14 12:41:20 -10:00
Daniel Müller	fed1dae9da	rust: Update libbpf-rs & libbpf-cargo to 0.22 This is a follow on to #32, which got reverted. I wrongly assumed that scx_rusty resides in the sched_ext tree and consumes published version of scx_utils. With this change we update the other in-tree dependencies. I built scx_layered & scx_rusty. I bumped scx-utils to 0.4, because the libbpf-cargo seems to be part of the public API surface and libbpf-cargo 0.21 and 0.22 are not compatible with each other. Signed-off-by: Daniel Müller <deso@posteo.net>	2023-12-14 14:33:58 -08:00
Tejun Heo	185ad61ad8	Merge pull request #34 from jordalgo/readme-meson-build Update meson install in readme	2023-12-14 10:34:43 -10:00
Tejun Heo	300bd192b5	Merge pull request #35 from sched-ext/htejun Revert "scx_utils: Update libbpf-cargo to 0.22"	2023-12-14 10:33:56 -10:00
Tejun Heo	995520e415	Revert "scx_utils: Update libbpf-cargo to 0.22" This reverts commit `6d8ba3e3d7`. Piotr Gorski reports build breakage after the commit: https://paste.cachyos.org/p/aa14709.txt	2023-12-14 10:32:25 -10:00
Jordan Rome	8b02b41f51	Update meson install in readme	2023-12-14 12:26:51 -08:00
Tejun Heo	4f5ca907ac	Merge pull request #32 from danielocfb/topic/libbpf-cargo-0.22 scx_utils: Update libbpf-cargo to 0.22	2023-12-14 09:28:47 -10:00
Daniel Müller	6d8ba3e3d7	scx_utils: Update libbpf-cargo to 0.22 It's the latest and greatest. Given the repository setup, this is likely a breaking change from a semver perspective.	2023-12-14 11:18:27 -08:00
David Vernet	b8f70fa09a	Merge pull request #31 from jordalgo/minor-rusty-refactor minor refactor of scx_rusty	2023-12-14 09:35:18 -06:00
Jordan Rome	ba35b97bb7	minor refactor of scx_rusty	2023-12-14 07:33:53 -08:00

... 3 4 5 6 7 ...

351 Commits