scx-upstream

mirror of https://github.com/sched-ext/scx.git synced 2024-12-01 07:00:23 +00:00

Author	SHA1	Message	Date
Tejun Heo	5ae459c9fc	Merge pull request #550 from CachyOS/fix-clang-rc get_clang_ver: Fix regex for LLVM RC Versions	2024-08-24 15:37:50 -10:00
Peter Jung	0faa0efe74	get_clang_ver: Fix regex for LLVM RC Versions Signed-off-by: Peter Jung <admin@ptr1337.dev>	2024-08-25 00:49:00 +02:00
Daniel Hodges	e81faef103	Merge pull request #544 from hodgesds/layered-tgid scx_layered: Add layer match for tgid	2024-08-24 16:58:19 -04:00
Andrea Righi	016aae759f	Merge pull request #545 from sched-ext/bpfland-honor-avg-nvcsw scx_bpfland: always honor average nvcsw in lowlatency mode	2024-08-24 20:24:33 +02:00
Tejun Heo	21021e938b	Merge pull request #547 from anh0516/docs-cleanup scx_lavd: Fix my own formatting error in #543 scx_rusty: help info and README cleanup	2024-08-24 08:17:18 -10:00
Avraham Hollander	b50cd3cdd8	Merge branch 'sched-ext:main' into docs-cleanup	2024-08-24 12:04:20 -04:00
Avraham Hollander	66b5dd0de9	Clean up scx_rusty help info a bit	2024-08-24 11:56:12 -04:00
Avraham Hollander	c34a470024	scx_lavd: Fix my own formatting error	2024-08-24 11:36:19 -04:00
Andrea Righi	5a08855a86	scx_bpfland: always honor average nvcsw in lowlatency mode Keep evaluating the average number of voluntary context switches for each task when lowlatency mode is enabled, even when interactive tasks classification is disabled (via `-c 0`). The average nvcsw is also used in lowlatency mode to evaluate the proportional bonus to the tasks' deadline and it shouldn't be ignored when interactive tasks classification is disabled. Moreover, make sure that such bonus never exceeds the starvation threshold. Keep in mind that it is still possible to disable the periodic average nvcsw evaluation with `-c 0`, without specifying `--lowlatency`. Fixes: `6a22853` ("scx_bpfland: introduce --lowlatency option") Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 10:42:22 +02:00
Tejun Heo	e238e01e8e	Merge pull request #543 from anh0516/docs-cleanup scx_bpfland, scx_lavd: Improve help info a bit	2024-08-23 17:09:29 -10:00
Daniel Hodges	5a2012763e	scx_layered: Add layer match for tgid Add layer match for tgid. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-23 23:00:28 -04:00
Avraham Hollander	bedb18b48e	Improve scx_lavd help info A lot of scx_lavd's options do not clearly explain what they do. Add some short explanations, clean up the existing ones, and direct the user to read the in-code documentation for more info.	2024-08-23 18:56:14 -04:00
Avraham Hollander	d6e27b59e7	Clean up scx_bpfland help info a bit	2024-08-23 18:55:04 -04:00
Andrea Righi	ab2e408385	Merge pull request #541 from sirlucjan/new-flags scx-scheds: Update scx_bpfland suggested flags	2024-08-24 00:36:57 +02:00
Piotr Gorski	924156c398	scx-scheds: Update scx_bpfland suggested flags Signed-off-by: Piotr Gorski <lucjan.lucjanov@gmail.com>	2024-08-23 21:23:20 +02:00
Andrea Righi	e72676ede3	Merge pull request #540 from sched-ext/bpfland-cpufreq-awareness scx_bpfland: cpu frequency and energy awareness	2024-08-23 21:17:34 +02:00
Tejun Heo	b6ccb87bec	Merge pull request #539 from sched-ext/htejun/scx_rusty scx_rusty: Convert to scx_stats	2024-08-23 08:42:47 -10:00
Daniel Hodges	7d45059fa9	Merge pull request #538 from hodgesds/layered-pid scx_layered: Add pid/ppid matches	2024-08-23 14:08:40 -04:00
Andrea Righi	6a5a6ed18c	README: add a link to the scx_bpfland topology awareness video demo Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-23 19:53:25 +02:00
Tejun Heo	8c8912ccea	Merge branch 'main' into htejun/scx_rusty	2024-08-23 07:50:23 -10:00
Andrea Righi	50684e4569	scx_bpfland: introduce Intel Turbo Boost awareness Make `--primar-domain auto` aware of turbo boosted CPUs and prioritize them over the primary scheduling domain when the energy model `balance_power` is used (typically when running on battery power with the "balanced" profile). With this change the scheduling hierarchy becomes the following: 1) CPUs in the turbo scheduling domain 2) CPUs in the primary scheduling domain 3) full-idle SMT CPUs 4) CPUs in the same L2 cache 5) CPUs in the same L3 cache 6) CPUs in the task's allowed domain And the idle selection logic is modified as following: - In the turbo scheduling domain: - pick same full-idle SMT CPU - pick any other full-idle SMT CPU sharing the same L2 cache - pick any other full-idle SMT CPU sharing the same L3 cache - pick any other full-idle SMT CPU - pick same idle CPU - pick any other idle CPU sharing the same L2 cache - pick any other idle CPU sharing the same L3 cache - pick any other idle SMT CPU - In the primary scheduling domain: - pick same full-idle SMT CPU - pick any other full-idle SMT CPU sharing the same L2 cache - pick any other full-idle SMT CPU sharing the same L3 cache - pick any other full-idle SMT CPU - pick same idle CPU - pick any other idle CPU sharing the same L2 cache - pick any other idle CPU sharing the same L3 cache - pick any other idle SMT CPU - In the entire task domain: - pick any other idle CPU Keep in mind that the turbo domain will be evaluated only when the scheduler is started with `--primary-domain auto` and only when the `balance_power` energy profile is used. The turbo domain is always made using the subset of CPUs in the system with the highest max frequency. If such subset can't be determined (for example if all the CPUs in the primary domain have all the same frequency), the turbo domain will be ignored. Prioritizing turbo boosted CPUs can help to improve performance by forcing the governor to scale up their frequency, without increasing too much power consumption, due to the fact that tasks will be preferably confined into a reduced amount of cores. This change seems to improve performance, without increasing much power consuption, on Intel laptops while using the `balanced_power` energy profile. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-23 19:49:08 +02:00
Andrea Righi	d958dd4482	scx_bpfland: introduce dynamic energy profile Introduce the new option `--primary-domain auto`. With this option the scheduler will dynamically adjusts the primary scheduling domain at run-time, in function of the current energy profile reported in /sys/devices/system/cpu/cpufreq/policy0/energy_performance_preference. When the `power` energy profile is selected, the primary scheduling domain will prioritize E-cores. Alternatively, when the `performance` profile is selected, it will prioritize P-cores. For all the other energy profiles, all the CPUs in the system will be used. Note that this option is only relevant on hybrid architectures with P-cores and E-cores. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-23 19:49:01 +02:00
Andrea Righi	bb7248ce61	scx_utils::cpumask: introduce is_empty() and is_full() Introduce new methods to CpuMask to check if no bits are set or if all bits are set. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-23 19:48:53 +02:00
Andrea Righi	115bfc8184	Merge pull request #536 from sched-ext/bpfland-lowlatency-mode scx_bpfland: introduce --lowlatency option	2024-08-23 19:48:08 +02:00
Tejun Heo	44a0f1b124	scx_utils: Factor out monitor_stats() from scx_rusty and scx_layered	2024-08-23 06:46:19 -10:00
Tejun Heo	e635e7eac8	Merge pull request #537 from sirlucjan/new-default scx-scheds: set scx_bpfland as default scheduler	2024-08-23 05:55:59 -10:00
Tejun Heo	ae3024e938	scx_layered: Add --stats and make --monitor behavior consistent with scx_rusty	2024-08-23 05:52:52 -10:00
Tejun Heo	0f04a93dd1	scx_rusty: Add stat descriptions and make minor adjustments	2024-08-23 05:46:13 -10:00
Tejun Heo	36865234f8	scx_rusty: Add scx_stats annotations necessary for openmetrics translation	2024-08-23 04:59:08 -10:00
Tejun Heo	70ba4cb9ef	scx_stats: Fix multiple labels handling in scxstats_to_openmetrics.py OM labels() was called with an array which is then incorrectly interpreted as a single label. Unpack it to list of arguments. While at it, make error reporting a bit more robust.	2024-08-23 04:56:30 -10:00
Tejun Heo	2f3f473cd3	scx_rusty: Improve timestamp reporting	2024-08-23 04:31:27 -10:00
Daniel Hodges	11b978a892	scx_layered: Add pid/ppid matches Add matches for pid/ppid. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-23 07:20:05 -07:00
Piotr Gorski	3f0fcc319c	scx-scheds: set scx_bpfland as default scheduler Signed-off-by: Piotr Gorski <lucjan.lucjanov@gmail.com>	2024-08-23 15:47:24 +02:00
Tejun Heo	76934f3aab	scx_rusty: Convert to scx_stats This allows scx_rusty to avoid generating excessive logs for statistics while still allowing detailed monitoring on demand.	2024-08-22 19:44:12 -10:00
Tejun Heo	4678476ca7	scx_stats_derive: Each _AssScxStastMeta assertion should have an unique ID	2024-08-22 13:53:27 -10:00
Tejun Heo	16c07a5cd9	scx_rusty: Don't reset bpf_stats, remember prev states and calculate delta This will ease transition to scx_stats.	2024-08-22 13:02:23 -10:00
Tejun Heo	13fa48a871	scx_rusty: Separate out stats generation and formatting to prepare for scx_stats conversion.	2024-08-22 10:03:10 -10:00
Tejun Heo	b4564520e5	scx_rusty: Simplify Stats structs and take id out of the structs to prepare for scx_stats conversion. While at it, make some cosmetic changes.	2024-08-22 08:45:33 -10:00
Andrea Righi	6a2285398d	scx_bpfland: introduce --lowlatency option Introduce the new `--lowlatency` option, which enables switching between the default pure vruntime-based scheduling (more optimized for server workloads) and a deadline-based scheduling (better suited for low-latency workloads). When the low-latency mode is activated, a task's deadline is calculated as its vruntime, adjusted by a bonus proportional to the task's average number of voluntary context switches (the more voluntary context switches, the shorter the deadline). This feature enhances the prioritization of interactive tasks even more, proportionally to their average voluntary context switches, also within the two main global queues (priority / shared) and it helps to maintain interactive workloads always responsive, even in presence of heavy non-interactive background work. Low-latency mode allows to prevent audio cracking even in presence of a large amount of short-lived tasks with pseudo-interactive behavior (i.e, hackbench) and it enables achieving approximately a +33% average frames-per-second (FPS) in the typical "gaming while building the kernel" benchmark. However, it can also amplify the de-prioritization of CPU-intensive tasks, making this option more suitable for specific low-latency scenarios. Therefore the low-latency mode is disabled by default and it can only be enabled via the `--lowlatency` option. Tested-by: Piotr Gorski (piotrgorski@cachyos.org) Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-22 13:26:19 +02:00
Andrea Righi	dca4af151e	Merge pull request #535 from sched-ext/bpfland-time-slice-control scx_bpfland: better time slice control	2024-08-22 10:32:16 +02:00
Tejun Heo	4834dec684	scx_rusty: Move stats structs to stats.rs and rename for consistency	2024-08-21 22:04:38 -10:00
Andrea Righi	b0a8e4a91e	scx_bpfland: better time slice control Explicitly replenish the task's time slice from ops.dispatch() if the task still wants to run and no other task is selected. In this way the sched_ext core won't automatically re-schedule the task on the same CPU, implicitly assigning a time slice of SCX_SLICE_DFL. Moreover, instead of determining the task time slice in ops.enqueue(), refresh the time slice immediately before the task is started on its assigned CPU in ops.running(). This allows to use a more precise time slice, adjusted based on the actual amount of tasks that are currently waiting to be scheduled. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-22 09:23:37 +02:00
Tejun Heo	8c35a1976f	Merge pull request #534 from sched-ext/htejun/misc scx_layered: Drop SCX_OPS_ENQ_LAST	2024-08-21 19:57:54 -10:00
Tejun Heo	d6ac5fbd9c	scx_layered: Drop SCX_OPS_ENQ_LAST The meaning of SCX_OPS_ENQ_LAST will change with future kernel updates and enqueueing on local DSQ will no longer be sufficient to avoid stalls. No reason to do it anyway. Just drop it.	2024-08-21 13:13:59 -10:00
Tejun Heo	52d97c041d	Merge pull request #533 from sched-ext/htejun/release Version: Cargo.lock	2024-08-21 06:46:02 -10:00
Tejun Heo	f726f0b73b	Version: Cargo.lock	2024-08-21 06:45:19 -10:00
Tejun Heo	98fd3ec3b0	Merge pull request #532 from sched-ext/htejun/release Version: v1.0.3	2024-08-21 06:43:36 -10:00
Tejun Heo	4d1f0639d8	Version: v1.0.3	2024-08-21 06:42:11 -10:00
Tejun Heo	2e63c0be60	Merge pull request #531 from sched-ext/bpfland-fix-unused-var scx_bpfland: drop unused variable	2024-08-21 06:40:16 -10:00
Tejun Heo	6a2faf2e17	Merge pull request #530 from sched-ext/htejun/misc scx_utils::topology: Use lazy_static instead of LazyLock	2024-08-21 06:04:14 -10:00

1 2 3 4 5 ...

1443 Commits