JakeHillion/scx

mirror of https://github.com/JakeHillion/scx.git synced 2024-11-29 20:50:22 +00:00

Author	SHA1	Message	Date
Changwoo Min	00430c3ded	scx_lavd: make a loop easier to correctly verify With an ill combination of old kernel and old LLVM, the BPF verifier incorrectly detects an infinite loop. After changing the loop with a constant end, the old verifier can pass the code. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-27 17:11:20 +09:00
Changwoo Min	09cff560aa	Merge pull request #566 from multics69/lavd-turbo scx_lavd: prioritize the turbo boost-able cores	2024-08-27 08:47:25 +09:00
Daniel Hodges	83cd26eb9e	Merge pull request #564 from hodgesds/layered-help scx_layered: Update help for tgid matching	2024-08-26 14:52:53 -04:00
Andrea Righi	35db89e90d	Merge pull request #568 from sched-ext/rustland-core-design-improv scx_rustland_core: small core design improvements	2024-08-26 20:06:21 +02:00
Andrea Righi	1427d7d347	scx_rlfifo: enhance code design Refactor the code design to make it more suitable as a template for implementing advanced scheduling policies. In particular, create separate loops for task consumption and task dispatching. This will make the scheduler easier to adapt as a foundation for implementing more complex scheduling policies. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-26 16:10:54 +02:00
Daniel Hodges	c45c2de39f	scx_layered: Update help for tgid matching Forgot to add doc for tgid matching Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-26 07:06:21 -07:00
Changwoo Min	9807e561f0	scx_lavd: prioritize the turbo boost-able cores Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 17:57:33 +09:00
Changwoo Min	cd5b2bf664	scx_lavd: replace nix signal handler to ctrlc Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 17:57:33 +09:00
Changwoo Min	e887c56da0	scx_lavd: add "--version" option, which prints the current version Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 17:57:33 +09:00
Changwoo Min	0f97ca3066	scx_lavd: drop time slice calculation in ops.select_cpu() Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 17:55:00 +09:00
Changwoo Min	4e3c36ca3f	scx_lavd: handle the missing cases in time slice calculation Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	be7d06e280	scx_lavd: make the old BPF verifier happy :-( Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	82f55b95b2	scx_lavd: add a fast path in pick_idle_cpu() when SMT is not activated Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	38779dbe8b	scx_lavd: improve pick_idle_cpu() Now it checks an active cpumask within a previous core's compute domain before checking the full active CPUs. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	d1d9e97d08	scx_lavd: reduce LAVD_CPDOM_MAX_DIST to 4 The BPF verifier in the old kernel gives up to analysis the nested loop in the consume_task(). We reduce the loop less complex by reducing LAVD_CPDOM_MAX_DIST from 6 to 4 in order to make the verifier happy. Note that the theoretical maximum distance is 6 (numa > llc > core type) but there is no such hardware today, hence reducing it to 6 should be okay in next few years, when hopefully the verifier becomes smarter. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	950710990f	scx_lavd: move time slice calculation to ops.enqueue() and ops.select_cpu() Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	954b684a70	scx_lavd: update nr_queued_task every system stat update interval Updating nr_queue_task every runqueue operation is expensive and unnecessary. So we do update every system state update interval and use moving average, which is accurate enough. Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	4f906f1f49	scx_lavd: update README since it supports multi-CCX/NUMA Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	9551657b42	scx_lavd: prefer big cores in the performance mode Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	d4bb35e651	scx_lavd: use itertools::iproduct!() for a nested loop Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Changwoo Min	9368c6881d	scx_lavd: replace get_task_cpu_id() to scx_bpf_task_cpu() Signed-off-by: Changwoo Min <changwoo@igalia.com>	2024-08-26 11:43:29 +09:00
Andrea Righi	a469f0f1ce	Merge pull request #561 from sched-ext/bpfland-fix-energy-profile-refresh scx_bpfland: prevent reading energy profile if not available	2024-08-25 18:31:34 +02:00
Tejun Heo	ca13e13ad6	Merge pull request #559 from sched-ext/htejun/cargo-workspace build: Use workspace to group rust sub-projects	2024-08-25 06:26:18 -10:00
Andrea Righi	f8acd069f0	scx_bpfland: prevent reading energy profile if not available Avoid to periodically read the current performance profile from /sys/devices/system/cpu/cpufreq/policy0/energy_performance_preference if it's not available (i.e., with older CPUs or kernels without cpufreq). This fixes issue #560. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-25 16:53:35 +02:00
Andrea Righi	8853d9a9f2	Merge pull request #548 from sched-ext/rustland-core-refactoring scx_rustland_core: user-space framework refactoring	2024-08-25 16:39:28 +02:00
Tejun Heo	43950c65bd	build: Use workspace to group rust sub-projects meson build script was building each rust sub-project under rust/ and scheds/rust/ separately. This means that each rust project is built independently which leads to a couple problems - 1. There are a lot of shared dependencies but they have to be built over and over again for each proejct. 2. Concurrency management becomes sad - we either have to unleash multiple cargo builds at the same time possibly thrashing the system or build one by one. We've been trying to solve this from meson side in vain. Thankfully, in issue #546, @vimproved suggested using cargo workspace which makes the sub-projects share the same target directory and built together by the same cargo instance while still allowing each project to behave independently for development and publishing purposes. Make the following changes: - Create two cargo workspaces - one under rust/, the other under scheds/rust/. Each contains all rust projects underneath it. - Don't let meson descend into rust/. These are libraries used by the rust schedulers. No need to build them from meson. Cargo will build them as needed. - Change the rust_scheds build target to invoke `cargo build` in scheds/rust/ and let cargo do its thing. - Remove per-scheduler meson.build files and instead generate custom_targets in scheds/rust/meson.build which invokes `cargo build -p $SCHED`. - This changes rust binary directory. Update README and meson-scripts/install_rust_user_scheds accordingly. - Remove per-scheduler Cargo.lock as scheds/rust/Cargo.lock is shared by all schedulers now. - Unify .gitignore handling. The followings are build times on Ryzen 3975W: Before: ________________________________________________________ Executed in 165.93 secs fish external usr time 40.55 mins 2.71 millis 40.55 mins sys time 3.34 mins 36.40 millis 3.34 mins After: ________________________________________________________ Executed in 36.04 secs fish external usr time 336.42 secs 0.00 millis 336.42 secs sys time 36.65 secs 43.95 millis 36.61 secs Wallclock time is reduced 5x and CPU time 7x.	2024-08-25 00:47:58 -10:00
Andrea Righi	894f9582d0	scx_rustland_core: hide shutdown boilerplate in BpfScheduler Refactor the code to hide the shutdown handling inside BpfScheduler and simply use the exited() method to check when the scheduler is stopped. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-25 12:17:04 +02:00
Tejun Heo	152a8471cc	scx_bpfland: When reporting stats, use interval deltas Three of the reported stats are cumulative. While they obviously can be processed into delta values, that holds for the other direction too and the cumulative values are difficult to make intutive sense of. Report interval delta values instead. Note that a stats client can reliably build back cumulative values even under heavy system contention - the delta values reported between two consecutive reads are guaranteed to be correct regardless of the duration of the interval.	2024-08-24 23:14:57 -10:00
Tejun Heo	bd68e230b9	scx_bpfland: Convert to scx_stats Use scx_stats instead of prometheus for stats reporting. This has a few advantages: - Stats metadata can be defined more succinctly. - Natural support for nesting statistics which will be useful in making scheduler components composable. - Support for multiple programmable readers where each reader can use their own reading interval. - Built-in stats help message generation. - Openmetrics integration is still available through scx_stats/scripts/scxstats_to_openmetrics.py.	2024-08-24 23:14:55 -10:00
Tejun Heo	625381280c	scx_stats: Shorten exported names and add prelude module Let's make it a bit easier to use: - Shorten exported names by changing the prefix from ScxStats to Stats. This should be distinctive enough and more inline with how most libraries name their exports. - Importing the right set of traits can be tricky. Introduce prelude module so that importing is a bit less painful.	2024-08-24 22:04:25 -10:00
Andrea Righi	a2e97fecbb	scx_rustland_core: merge verbose and debug in the same option There is no reason to have two separate options for "verbose" and "debug" mode. Just merge the two and always use "debug". If enabled, increase verbosity to stdout and enable reporting BPF scheduling events in debugfs (e.g., /sys/kernel/debug/tracing/trace_pipe). Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-25 09:45:20 +02:00
Andrea Righi	cb16a11342	scx_rustland_core: get rid of the global scheduler's slice_us Since scx_rustland_core enables setting a time slice on a per-task basis during task dispatch, there's no need to maintain a global time slice in the BPF component. Instead, a global time slice can simply be managed in user-space, achieving the same outcome. Therefore, drop the global slice_us property from BpfScheduler to simplify the API. NOTE: if a time slice is not specified for a task, SCX_SLICE_DFL will be used by default. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-25 09:45:18 +02:00
Andrea Righi	e404bee5e7	scx_rustland / scx_rlfifo: small code format fixes Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-25 09:44:52 +02:00
Andrea Righi	1cd11ba916	scx_rlfifo: improve documentation and code readability Add more comments to make the source code more understandable, so that it can be easily used as a template for implementing more complex scheduling policies. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-25 09:44:28 +02:00
Tejun Heo	35a4326aee	scx_lavd: Drop unnecessary stat field explanation on startup The scheduling instances no longer prints out sched samples. No reason to print field explanation on startup.	2024-08-24 18:48:54 -10:00
Changwoo Min	02ad793c78	Merge branch 'main' into htejun/scx_lavd-stats	2024-08-25 11:57:41 +09:00
Changwoo Min	8b1874c27f	Merge pull request #552 from CachyOS/lavd-mutli-cxx2 scx_lavd: Drop message about unsupported multi-CXX support	2024-08-25 11:48:12 +09:00
Tejun Heo	fdfb7f60f4	Merge branch 'main' into htejun/scx_lavd-stats	2024-08-24 15:53:53 -10:00
Tejun Heo	55e5b8b43f	scx_lavd: Switch to scx_stats Scheduling sample reporting is switched to use scx_stats. This makes the scheduler run without making too much noise while still allowing monitoring on demand. It can also make introspection more dynamic - e.g. it shouldn't be difficult to add other monitoring commands which take scheduling samples based on different criteria or add other types of staisitcs. --nr_sched-samples is replaced with --monitor-nr-samples.	2024-08-24 15:53:02 -10:00
Tejun Heo	1bba713a29	Merge pull request #542 from sched-ext/htejun/scx_stats scx_stats, scx_rusty, scx_layered: Implement `--help-stats`	2024-08-24 15:38:36 -10:00
Peter Jung	906d054770	scx_lavd: Drop message about unsupported multi-CXX support Signed-off-by: Peter Jung <admin@ptr1337.dev>	2024-08-25 01:10:38 +02:00
Andrea Righi	0aa23481de	scx_rustland_core: drop update_tasks() and introduce notify_complete() The update_tasks() API is somewhat confusing, so replace it with a clearer API, notify_complete(). This new API will return control to the BPF component and inform it about the number of tasks still pending in the user-space scheduler. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-25 00:45:23 +02:00
Daniel Hodges	e81faef103	Merge pull request #544 from hodgesds/layered-tgid scx_layered: Add layer match for tgid	2024-08-24 16:58:19 -04:00
Andrea Righi	5ece102554	scx_rustland: get rid of unnecessary debugging information Additional statistics will be re-added later via scx_stats. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 21:29:10 +02:00
Andrea Righi	cef8ff8757	scx_rustland_core: get rid of the low_power API The low-power API is a bit of a hack implemented purely in the BPF layer, this should be better re-implemented with some concepts of topology awareness. Therefore, get rid of this API for now. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 21:29:10 +02:00
Andrea Righi	be7ef1009b	scx_rlfifo: user-space idle CPU selection Select an idle CPU from user-space, instead of always dispatching on the first CPU available. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 21:29:10 +02:00
Andrea Righi	568e292a24	scx_rustland_core: get rid of the exiting task API The current API used to notify the user-space scheduler when a task exits is really confusing (setting a negative value in queued_task_ctx.cpu), and it's also possible to detect task exiting events from user-space (or check in procfs, even if it's slower). In any case, a better API should be provided for this, so drop the current one for now. NOTE: this will cause additional memory usage for scx_rustland, but it can be fixed/addressed later in a separate commit (i.e., providing a periodic garbage collector for the unused task entries). Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 21:29:10 +02:00
Andrea Righi	5d544ea264	scx_rustland_core: move CPU idle selection logic in user-space Allow user-space scheduler to pick an idle CPU via self.bpf.select_cpu(pid, prev_task, flags), mimicking the BPF's select_cpu() iterface. Also remove the full_user option and always rely on the idle selection logic from user-space. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 21:28:13 +02:00
Andrea Righi	1dd329dd7d	scx_rustland: update Cargo.lock Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 20:24:48 +02:00
Andrea Righi	106d59d997	scx_rlfifo: update Cargo.lock Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 20:24:48 +02:00

1 2 3 4 5 ...

864 Commits