JakeHillion/scx

mirror of https://github.com/JakeHillion/scx.git synced 2024-12-01 21:37:12 +00:00

Author	SHA1	Message	Date
Tejun Heo	43950c65bd	build: Use workspace to group rust sub-projects meson build script was building each rust sub-project under rust/ and scheds/rust/ separately. This means that each rust project is built independently which leads to a couple problems - 1. There are a lot of shared dependencies but they have to be built over and over again for each proejct. 2. Concurrency management becomes sad - we either have to unleash multiple cargo builds at the same time possibly thrashing the system or build one by one. We've been trying to solve this from meson side in vain. Thankfully, in issue #546, @vimproved suggested using cargo workspace which makes the sub-projects share the same target directory and built together by the same cargo instance while still allowing each project to behave independently for development and publishing purposes. Make the following changes: - Create two cargo workspaces - one under rust/, the other under scheds/rust/. Each contains all rust projects underneath it. - Don't let meson descend into rust/. These are libraries used by the rust schedulers. No need to build them from meson. Cargo will build them as needed. - Change the rust_scheds build target to invoke `cargo build` in scheds/rust/ and let cargo do its thing. - Remove per-scheduler meson.build files and instead generate custom_targets in scheds/rust/meson.build which invokes `cargo build -p $SCHED`. - This changes rust binary directory. Update README and meson-scripts/install_rust_user_scheds accordingly. - Remove per-scheduler Cargo.lock as scheds/rust/Cargo.lock is shared by all schedulers now. - Unify .gitignore handling. The followings are build times on Ryzen 3975W: Before: ________________________________________________________ Executed in 165.93 secs fish external usr time 40.55 mins 2.71 millis 40.55 mins sys time 3.34 mins 36.40 millis 3.34 mins After: ________________________________________________________ Executed in 36.04 secs fish external usr time 336.42 secs 0.00 millis 336.42 secs sys time 36.65 secs 43.95 millis 36.61 secs Wallclock time is reduced 5x and CPU time 7x.	2024-08-25 00:47:58 -10:00
Tejun Heo	625381280c	scx_stats: Shorten exported names and add prelude module Let's make it a bit easier to use: - Shorten exported names by changing the prefix from ScxStats to Stats. This should be distinctive enough and more inline with how most libraries name their exports. - Importing the right set of traits can be tricky. Introduce prelude module so that importing is a bit less painful.	2024-08-24 22:04:25 -10:00
Tejun Heo	1bba713a29	Merge pull request #542 from sched-ext/htejun/scx_stats scx_stats, scx_rusty, scx_layered: Implement `--help-stats`	2024-08-24 15:38:36 -10:00
Avraham Hollander	66b5dd0de9	Clean up scx_rusty help info a bit	2024-08-24 11:56:12 -04:00
Tejun Heo	25e437753c	scx_layered, scx_rusty: Implement --help-stats which shows all the defined stats. While at it, make some cosmetic updates.	2024-08-23 12:39:47 -10:00
Tejun Heo	405bcc63fe	scx_stats: Make ScxStatsServerData a public carrier of data needed for stats server And move related ops into it. This is a bit more natural and will also allow doing other operaitons (e.g. describing stats) without launching the server.	2024-08-23 12:23:57 -10:00
Tejun Heo	9e3b4e6db0	scx_stats: A bit of cleanups and renames	2024-08-23 09:09:02 -10:00
Tejun Heo	8c8912ccea	Merge branch 'main' into htejun/scx_rusty	2024-08-23 07:50:23 -10:00
Tejun Heo	44a0f1b124	scx_utils: Factor out monitor_stats() from scx_rusty and scx_layered	2024-08-23 06:46:19 -10:00
Tejun Heo	0f04a93dd1	scx_rusty: Add stat descriptions and make minor adjustments	2024-08-23 05:46:13 -10:00
Tejun Heo	36865234f8	scx_rusty: Add scx_stats annotations necessary for openmetrics translation	2024-08-23 04:59:08 -10:00
Tejun Heo	2f3f473cd3	scx_rusty: Improve timestamp reporting	2024-08-23 04:31:27 -10:00
Tejun Heo	76934f3aab	scx_rusty: Convert to scx_stats This allows scx_rusty to avoid generating excessive logs for statistics while still allowing detailed monitoring on demand.	2024-08-22 19:44:12 -10:00
Tejun Heo	16c07a5cd9	scx_rusty: Don't reset bpf_stats, remember prev states and calculate delta This will ease transition to scx_stats.	2024-08-22 13:02:23 -10:00
Tejun Heo	13fa48a871	scx_rusty: Separate out stats generation and formatting to prepare for scx_stats conversion.	2024-08-22 10:03:10 -10:00
Tejun Heo	b4564520e5	scx_rusty: Simplify Stats structs and take id out of the structs to prepare for scx_stats conversion. While at it, make some cosmetic changes.	2024-08-22 08:45:33 -10:00
Tejun Heo	4834dec684	scx_rusty: Move stats structs to stats.rs and rename for consistency	2024-08-21 22:04:38 -10:00
Tejun Heo	f726f0b73b	Version: Cargo.lock	2024-08-21 06:45:19 -10:00
Tejun Heo	4d1f0639d8	Version: v1.0.3	2024-08-21 06:42:11 -10:00
Tejun Heo	5cf4212330	Revert "rusty: Integrate stats with the metrics framework" This reverts commit `83373b1f4e` in prepration for converting to scx_stats.	2024-08-20 21:59:25 -10:00
Tejun Heo	516a7590db	scx_rusty: Revert log_recorder conversion scx_rusty will be converted to scx_stats in a similar fashin with scx_layered. Undo log_recorder conversion in preparation.	2024-08-20 21:59:20 -10:00
Tejun Heo	1da249f063	scx_utils::topology: Always use NR_CPU_IDS and NR_CPUS_POSSIBLE Always use the LazyLock versions and drop the counterparts from Topology.	2024-08-20 21:57:56 -10:00
Tejun Heo	f7c193e528	scx_utils, scx_rusty: Minor updates to version handling - Update scx_utils/build.rs so that 12 char SHA1 is generated instead of full one. - Add --version to scx_rusty. Use custom one as we don't want to use the default cargo version one.	2024-08-20 21:03:05 -10:00
Tejun Heo	8f786be08f	scx_rusty: cargo fmt	2024-08-20 21:03:05 -10:00
Tejun Heo	4440567949	scx_rusty: Update Cargo.lock	2024-08-20 21:03:05 -10:00
I Hsin Cheng	5d85937842	scx_rusty: Fix typo Signed-off-by: I Hsin Cheng <richard120310@gmail.com>	2024-08-16 22:03:59 +08:00
Tejun Heo	c16b48d7b2	scheds/rust: Include Cargo.lock in the repo Binary packages are expected to include Cargo.lock in the repo so that the produced binaries match across different builds.	2024-08-15 23:08:35 -10:00
I Hsin Cheng	15b40de408	scx_rusty: Fix logical error when filtering tasks The logic of tasks filtering were moved from find_first_candidate() into a vector filter operation in commit `1c3b563`. However, it was forgotten to transfer the logic with "NOT" since now .filter() will populate the tasks we want, rather than .skip_while() which was throwing unwanted tasks out. That's why the logic here should be reverse so we won't take kworker or migrated tasks into considerations. Signed-off-by: I Hsin Cheng <richard120310@gmail.com>	2024-08-10 22:56:20 +08:00
Tejun Heo	63c4a0191f	Merge branch 'main' into topic/inlined-skeleton-members	2024-08-08 14:23:37 -10:00
Tejun Heo	cd6a4d72c7	Bump versions for 1.0.2 release	2024-08-08 14:10:16 -10:00
Tejun Heo	7c3ffe96e1	Unify crate dependency versions Different sub-projects are using different versions for the same crates. Synchronize them to the latest.	2024-08-08 13:26:47 -10:00
David Vernet	5401876430	Revert "rusty: Rework deadline as a signed sum"	2024-07-25 14:50:45 -05:00
David Vernet	09536aa15d	Merge pull request #309 from sched-ext/rusty_improved_dl rusty: Rework deadline as a signed sum	2024-07-25 13:44:54 -05:00
David Vernet	c1ad602ce5	rusty: Transfer latency priority between CPU-intensive and interactive tasks In some scenarios, a CPU-intensive task may be on the critical path for interactive workloads. For example, you may have a game with CPU-intensive tasks that are crunching the logic for the game, and that's required for the game to proceed without being choppy. To support such workflows, this change adds logic to allow a non-interactive task to inherit the lower (i.e. stronger) latency priority of another task if it wakes or is woken by that task. Signed-off-by: David Vernet <void@manifault.com>	2024-07-25 11:55:40 -05:00
David Vernet	933ea9baa1	rusty: Rework deadline as a signed sum Currently, a task's deadline is computed as its vtime + a scaled function of its average runtime (with its deadline being scaled down if it's more interactive). This makes sense intuitively, as we do want an interactive task to have an earlier deadline, but it also has some flaws. For one thing, we're currently ignoring duty cycle when determining a task's deadline. This has a few implications. Firstly, because we reward tasks with higher waker and blocked frequencies due to considering them to be part of a work chain, we implicitly penalize tasks that rarely ever use the CPU because those frequencies are low. While those tasks are likely not part of a work chain, they also should get an interactivity boost just by pure virtue of not using the CPU very often. This should in theory be addressed by vruntime, but because we cap the amount of vtime that a task can accumulate to one slice, it may not be adequately reflected after a task runs for the first time. Another problem is that we're minimizing a task's deadline if it's interactive, but we're also not really penalizing a task that's a super CPU hog by increasing its deadline. We sort of do a bit by applying a higher niceness which gives it a higher deadline for a lower weight, but its somewhat minimal considering that we're using niceness, and that the best an interactive task can do is minimize its deadline to near zero relative to its vtime. What we really want to do is "negatively" scale an interactive task's deadline with the same magnitude as we "positively" scale a CPU-hogging task's deadline. To do this, we make two major changes to how we compute deadline: 1. Instead of using niceness, we now instead use our own straightforward scaling factor. This was chosen arbitrarily to be a scaling by 1000, but we can and should improve this in the future. 2. We now create a _signed_ linear latency priority factor as a sum of the three following inputs: - Work-chain factor (log_2 of product of blocked freq and waker freq) - Inverse duty cycle factor (log_2 of the inverse of a task's duty cycle -- higher duty cycle means lower factor) - Average runtime factor (Higher avg runtime means higher average runtime factor) We then compute the latency priority as: lat_prio := Average runtime factor - (work-chain factor + duty cycle factor) This gives us a signed value that can be negative. With this, we can compute a non-negative weight value by calculating a weight from the absolute value of lat_prio, and use this to scale slice_ns. If lat_prio is negative we calculate a task's deadline as its vtime MINUS its scaled slice_ns, and if it's positive, it's the task's vtime PLUS scaled slice_ns. This ends up working well because you get a higher weight both for highly interactive tasks, and highly CPU-hogging / non-interactive tasks, which lets you scale a task's deadline "more negatively" for interactive tasks, and "more positively" for the CPU hogs. With this change, we get a significant improvement in FPS. On a 7950X, if I run the following workload: $ stress-ng -c $((8 * $(nproc))) 1. I get 60 FPS when playing Stellaris (while time is progressing at max speed), whereas EEVDF gets 6-7 FPS. 2. I get ~15-40 FPS while playing Civ6, whereas EEVDF seems to get < 1 FPS. The Civ6 benchmark doesn't even start after over 4 minutes in the initial frame with EEVDF, but gets us 13s / turn with rusty. 3. It seems that EEVDF has improved with Terraria in v6.9. It was able to maintain ~30-55 FPS, as opposed to the ~5-10FPS we've seen in the past. rusty is still able to maintain a solid 60-62FPS consistently with no problem, however.	2024-07-25 11:55:03 -05:00
Daniel Müller	98af514972	scx_rusty: Simplify LoadBalancer::populate_tasks_by_load() Simplify LoadBalancer::populate_tasks_by_load() by cutting out the heap allocation bits, by moving mutable accesses in front of immutable ones. Because multiple immutable accesses (between bss and rodata) do not conflict, we don't need the intermediate PID storage. Signed-off-by: Daniel Müller <deso@posteo.net>	2024-07-23 13:59:26 -07:00
I Hsin Cheng	2525b94af4	scx_rusty: Remove unused variable Remove unused variable "has_preferred_dom". Signed-off-by: I Hsin Cheng <richard120310@gmail.com>	2024-07-17 20:30:17 +08:00
I Hsin Cheng	bf2f0fbf35	scx_rusty: Remove skip_while in find_first_candidate Followed commit `1c3b563`, move the checking of task.migrated.get() into the vector filter. In this way, we can remove the skip_while() call in find_first_candidate(). Signed-off-by: I Hsin Cheng <richard120310@gmail.com>	2024-07-17 20:27:12 +08:00
Daniel Müller	565aec3662	rust: Update libbpf-rs & libbpf-cargo to 0.24 Update libbpf-rs & libbpf-cargo to 0.24. Among other things, generated skeletons now contain directly accessible map and program objects, no longer necessitating the use of accessor methods. As a result, the risk for mutability conflicts is reduced greatly. Signed-off-by: Daniel Müller <deso@posteo.net>	2024-07-16 11:48:52 -07:00
Daniel Hodges	27122a8a00	scx_rusty: refactor mempolicy handling bpf code and load balancing This change refactors some of the helper methods for getting the preferred node for tasks using mempolicy. The load balancing logic in try_find_move_task is updated to allow for a filter, which is used to filter for tasks with a preferred mempolicy. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-16 09:40:00 -07:00
Daniel Hodges	43a263aa75	scx_rusty: Use preferred node mask with balancer Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-16 08:11:19 -07:00
Daniel Hodges	bab6e9523c	scx_rusty: Add mempolicy checks to rusty This change makes scx_rusty mempolicy aware. When a process uses set_mempolicy it can change NUMA memory preferences and cause performance issues when tasks are scheduled on remote NUMA nodes. This change modifies task_pick_domain to use the new helper method that returns the preferred node id. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-07-16 08:11:19 -07:00
I Hsin Cheng	1c3b563caf	scx_rusty: Pre-check task domain mask with pull domain mask Instead of performing domain mask checking inside "find_first_candidate()" every time, check whether the tasks within push domain are abled to run on pull domain by performing the mask check at vector generation stage. This way can also avoid repeated computation generated by the same (task, pull_dom) pair as they'll try to check whether the pull domain is in the task domain mask. Also since whether a task is a kworker won't change in time, we can perform the check earlier and put it in the filter, too. Signed-off-by: I Hsin Cheng <richard120310@gmail.com>	2024-07-16 21:48:06 +08:00
Tejun Heo	51334b5c4d	Bump versions for 1.0.1 release	2024-07-15 13:21:52 -10:00
Tejun Heo	761ec142ce	Bump most versions to 1.0.0 sched_ext is about to be merged upstream. There are some compatibility breaking changes and we're making the current sched_ext/for-6.11 1edab907b57d ("sched_ext/scx_qmap: Pick idle CPU for direct dispatch on !wakeup enqueues") the baseline. Tag everything except scx_mitosis as 1.0.0. As scx_mitosis is still in early development and is currently temporarily disabled, only the patchlevel is bumped.	2024-07-12 11:34:14 -10:00
Andrea Righi	d76551bbd3	scx_rusty: fix stats map initialization The stats map in scx_rusty is a BPF_MAP_TYPE_PERCPU_ARRAY, with its size determined by num_possible_cpus(). Initializing it with nr_cpu_ids() can result in errors such as: Error: Failed to zero stat Caused by: number of values 6 != number of cpus 8 Fix by using num_possible_cpus() to initialize it. Fixes: `263e02f6` ("rusty: Use nr_cpu_ids instead of nr_cpus_possible") Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-06-30 17:37:14 +02:00
Andrea Righi	cf4883fbf8	meson: introduce serialize build option With commit `5d20f89a` ("scheds-rust: build rust schedulers in sequence"), schedulers are now built serially one after the other to prevent meson and cargo from forking NxN parallel tasks. However, this change has made building a single scheduler much more cumbersome, due to the chain of dependencies. For example, building scx_rusty using the specific meson target would still result in all schedulers being built, because they all depend on each other. To address this issue, introduce the new meson build option `serialize=true\|false` (default is false). This option allows to disable the schedulers' build chain, restoring the old behavior. With this option enabled, it is now possible to build just a single scheduler, parallelizing the cargo build properly, without triggering the build of the others. Example: $ meson setup build -Dbuildtype=release -Dserialize=false $ meson compile -C build scx_rusty Signed-off-by: Andrea Righi <andrea.righi@canonical.com>	2024-06-28 10:17:37 +02:00
David Vernet	fe3ce64a9b	Revert "scx_rusty: Refactor ridx assignment in populate_tasks_by_load"	2024-06-26 17:35:22 -04:00
David Vernet	d42bae4fcf	rusty: Print build ID when rusty is loaded When someone is testing schedulers, we often have to ask what version the scheduler is running as. Now that we can access the build ID from rust schedulers, let's update scx_rusty to print the build ID when rusty first starts running. This results in output such as the following: ``` [void@maniforge scx]$ rusty 19:04:26 [INFO] Running scx_rusty (build ID: 0.8.1-g2043d2537f37c8d75753bb65eb75bca965067564 x86_64-unknown-linux-gnu/debug) 19:04:26 [INFO] NUMA[00] mask= 0b11111111111111111111111111111111 19:04:26 [INFO] DOM[00] mask= 0b00000000111111110000000011111111 19:04:26 [INFO] DOM[01] mask= 0b11111111000000001111111100000000 19:04:26 [INFO] Rusty scheduler started! ``` Signed-off-by: David Vernet <void@manifault.com>	2024-06-25 11:44:46 -05:00
David Vernet	9d9ece11aa	Merge pull request #384 from jfernandez/log-recorder scx_utils: Add log_recorder module for metrics-rs	2024-06-25 11:43:37 -05:00

1 2 3

137 Commits