scx-upstream

mirror of https://github.com/sched-ext/scx.git synced 2024-12-01 07:00:23 +00:00

Author	SHA1	Message	Date
Tejun Heo	35a4326aee	scx_lavd: Drop unnecessary stat field explanation on startup The scheduling instances no longer prints out sched samples. No reason to print field explanation on startup.	2024-08-24 18:48:54 -10:00
Changwoo Min	02ad793c78	Merge branch 'main' into htejun/scx_lavd-stats	2024-08-25 11:57:41 +09:00
Changwoo Min	8b1874c27f	Merge pull request #552 from CachyOS/lavd-mutli-cxx2 scx_lavd: Drop message about unsupported multi-CXX support	2024-08-25 11:48:12 +09:00
Tejun Heo	fdfb7f60f4	Merge branch 'main' into htejun/scx_lavd-stats	2024-08-24 15:53:53 -10:00
Tejun Heo	55e5b8b43f	scx_lavd: Switch to scx_stats Scheduling sample reporting is switched to use scx_stats. This makes the scheduler run without making too much noise while still allowing monitoring on demand. It can also make introspection more dynamic - e.g. it shouldn't be difficult to add other monitoring commands which take scheduling samples based on different criteria or add other types of staisitcs. --nr_sched-samples is replaced with --monitor-nr-samples.	2024-08-24 15:53:02 -10:00
Tejun Heo	1bba713a29	Merge pull request #542 from sched-ext/htejun/scx_stats scx_stats, scx_rusty, scx_layered: Implement `--help-stats`	2024-08-24 15:38:36 -10:00
Peter Jung	906d054770	scx_lavd: Drop message about unsupported multi-CXX support Signed-off-by: Peter Jung <admin@ptr1337.dev>	2024-08-25 01:10:38 +02:00
Daniel Hodges	e81faef103	Merge pull request #544 from hodgesds/layered-tgid scx_layered: Add layer match for tgid	2024-08-24 16:58:19 -04:00
Andrea Righi	016aae759f	Merge pull request #545 from sched-ext/bpfland-honor-avg-nvcsw scx_bpfland: always honor average nvcsw in lowlatency mode	2024-08-24 20:24:33 +02:00
Avraham Hollander	66b5dd0de9	Clean up scx_rusty help info a bit	2024-08-24 11:56:12 -04:00
Avraham Hollander	c34a470024	scx_lavd: Fix my own formatting error	2024-08-24 11:36:19 -04:00
Andrea Righi	5a08855a86	scx_bpfland: always honor average nvcsw in lowlatency mode Keep evaluating the average number of voluntary context switches for each task when lowlatency mode is enabled, even when interactive tasks classification is disabled (via `-c 0`). The average nvcsw is also used in lowlatency mode to evaluate the proportional bonus to the tasks' deadline and it shouldn't be ignored when interactive tasks classification is disabled. Moreover, make sure that such bonus never exceeds the starvation threshold. Keep in mind that it is still possible to disable the periodic average nvcsw evaluation with `-c 0`, without specifying `--lowlatency`. Fixes: `6a22853` ("scx_bpfland: introduce --lowlatency option") Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-24 10:42:22 +02:00
Tejun Heo	48092c6f88	scx_lavd: Relay introspection output in stats::TaskSample This indirection doesn't make any visible behavior difference now but will be used to implement scx_stats support.	2024-08-23 18:49:36 -10:00
Tejun Heo	725fa7f1be	Merge branch 'main' into htejun/scx_stats	2024-08-23 17:10:08 -10:00
Daniel Hodges	5a2012763e	scx_layered: Add layer match for tgid Add layer match for tgid. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-23 23:00:28 -04:00
Avraham Hollander	bedb18b48e	Improve scx_lavd help info A lot of scx_lavd's options do not clearly explain what they do. Add some short explanations, clean up the existing ones, and direct the user to read the in-code documentation for more info.	2024-08-23 18:56:14 -04:00
Avraham Hollander	d6e27b59e7	Clean up scx_bpfland help info a bit	2024-08-23 18:55:04 -04:00
Tejun Heo	25e437753c	scx_layered, scx_rusty: Implement --help-stats which shows all the defined stats. While at it, make some cosmetic updates.	2024-08-23 12:39:47 -10:00
Tejun Heo	405bcc63fe	scx_stats: Make ScxStatsServerData a public carrier of data needed for stats server And move related ops into it. This is a bit more natural and will also allow doing other operaitons (e.g. describing stats) without launching the server.	2024-08-23 12:23:57 -10:00
Tejun Heo	7bd35b6cd3	scx_lavd: Cargo.lock update (caused by scx_utils depending on scx_stats)	2024-08-23 09:21:44 -10:00
Andrea Righi	e72676ede3	Merge pull request #540 from sched-ext/bpfland-cpufreq-awareness scx_bpfland: cpu frequency and energy awareness	2024-08-23 21:17:34 +02:00
Tejun Heo	9e3b4e6db0	scx_stats: A bit of cleanups and renames	2024-08-23 09:09:02 -10:00
Tejun Heo	b6ccb87bec	Merge pull request #539 from sched-ext/htejun/scx_rusty scx_rusty: Convert to scx_stats	2024-08-23 08:42:47 -10:00
Daniel Hodges	7d45059fa9	Merge pull request #538 from hodgesds/layered-pid scx_layered: Add pid/ppid matches	2024-08-23 14:08:40 -04:00
Tejun Heo	8c8912ccea	Merge branch 'main' into htejun/scx_rusty	2024-08-23 07:50:23 -10:00
Andrea Righi	50684e4569	scx_bpfland: introduce Intel Turbo Boost awareness Make `--primar-domain auto` aware of turbo boosted CPUs and prioritize them over the primary scheduling domain when the energy model `balance_power` is used (typically when running on battery power with the "balanced" profile). With this change the scheduling hierarchy becomes the following: 1) CPUs in the turbo scheduling domain 2) CPUs in the primary scheduling domain 3) full-idle SMT CPUs 4) CPUs in the same L2 cache 5) CPUs in the same L3 cache 6) CPUs in the task's allowed domain And the idle selection logic is modified as following: - In the turbo scheduling domain: - pick same full-idle SMT CPU - pick any other full-idle SMT CPU sharing the same L2 cache - pick any other full-idle SMT CPU sharing the same L3 cache - pick any other full-idle SMT CPU - pick same idle CPU - pick any other idle CPU sharing the same L2 cache - pick any other idle CPU sharing the same L3 cache - pick any other idle SMT CPU - In the primary scheduling domain: - pick same full-idle SMT CPU - pick any other full-idle SMT CPU sharing the same L2 cache - pick any other full-idle SMT CPU sharing the same L3 cache - pick any other full-idle SMT CPU - pick same idle CPU - pick any other idle CPU sharing the same L2 cache - pick any other idle CPU sharing the same L3 cache - pick any other idle SMT CPU - In the entire task domain: - pick any other idle CPU Keep in mind that the turbo domain will be evaluated only when the scheduler is started with `--primary-domain auto` and only when the `balance_power` energy profile is used. The turbo domain is always made using the subset of CPUs in the system with the highest max frequency. If such subset can't be determined (for example if all the CPUs in the primary domain have all the same frequency), the turbo domain will be ignored. Prioritizing turbo boosted CPUs can help to improve performance by forcing the governor to scale up their frequency, without increasing too much power consumption, due to the fact that tasks will be preferably confined into a reduced amount of cores. This change seems to improve performance, without increasing much power consuption, on Intel laptops while using the `balanced_power` energy profile. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-23 19:49:08 +02:00
Andrea Righi	d958dd4482	scx_bpfland: introduce dynamic energy profile Introduce the new option `--primary-domain auto`. With this option the scheduler will dynamically adjusts the primary scheduling domain at run-time, in function of the current energy profile reported in /sys/devices/system/cpu/cpufreq/policy0/energy_performance_preference. When the `power` energy profile is selected, the primary scheduling domain will prioritize E-cores. Alternatively, when the `performance` profile is selected, it will prioritize P-cores. For all the other energy profiles, all the CPUs in the system will be used. Note that this option is only relevant on hybrid architectures with P-cores and E-cores. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-23 19:49:01 +02:00
Tejun Heo	44a0f1b124	scx_utils: Factor out monitor_stats() from scx_rusty and scx_layered	2024-08-23 06:46:19 -10:00
Tejun Heo	ae3024e938	scx_layered: Add --stats and make --monitor behavior consistent with scx_rusty	2024-08-23 05:52:52 -10:00
Tejun Heo	0f04a93dd1	scx_rusty: Add stat descriptions and make minor adjustments	2024-08-23 05:46:13 -10:00
Tejun Heo	36865234f8	scx_rusty: Add scx_stats annotations necessary for openmetrics translation	2024-08-23 04:59:08 -10:00
Tejun Heo	2f3f473cd3	scx_rusty: Improve timestamp reporting	2024-08-23 04:31:27 -10:00
Daniel Hodges	11b978a892	scx_layered: Add pid/ppid matches Add matches for pid/ppid. Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2024-08-23 07:20:05 -07:00
Tejun Heo	76934f3aab	scx_rusty: Convert to scx_stats This allows scx_rusty to avoid generating excessive logs for statistics while still allowing detailed monitoring on demand.	2024-08-22 19:44:12 -10:00
Tejun Heo	16c07a5cd9	scx_rusty: Don't reset bpf_stats, remember prev states and calculate delta This will ease transition to scx_stats.	2024-08-22 13:02:23 -10:00
Tejun Heo	13fa48a871	scx_rusty: Separate out stats generation and formatting to prepare for scx_stats conversion.	2024-08-22 10:03:10 -10:00
Tejun Heo	b4564520e5	scx_rusty: Simplify Stats structs and take id out of the structs to prepare for scx_stats conversion. While at it, make some cosmetic changes.	2024-08-22 08:45:33 -10:00
Andrea Righi	6a2285398d	scx_bpfland: introduce --lowlatency option Introduce the new `--lowlatency` option, which enables switching between the default pure vruntime-based scheduling (more optimized for server workloads) and a deadline-based scheduling (better suited for low-latency workloads). When the low-latency mode is activated, a task's deadline is calculated as its vruntime, adjusted by a bonus proportional to the task's average number of voluntary context switches (the more voluntary context switches, the shorter the deadline). This feature enhances the prioritization of interactive tasks even more, proportionally to their average voluntary context switches, also within the two main global queues (priority / shared) and it helps to maintain interactive workloads always responsive, even in presence of heavy non-interactive background work. Low-latency mode allows to prevent audio cracking even in presence of a large amount of short-lived tasks with pseudo-interactive behavior (i.e, hackbench) and it enables achieving approximately a +33% average frames-per-second (FPS) in the typical "gaming while building the kernel" benchmark. However, it can also amplify the de-prioritization of CPU-intensive tasks, making this option more suitable for specific low-latency scenarios. Therefore the low-latency mode is disabled by default and it can only be enabled via the `--lowlatency` option. Tested-by: Piotr Gorski (piotrgorski@cachyos.org) Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-22 13:26:19 +02:00
Tejun Heo	4834dec684	scx_rusty: Move stats structs to stats.rs and rename for consistency	2024-08-21 22:04:38 -10:00
Andrea Righi	b0a8e4a91e	scx_bpfland: better time slice control Explicitly replenish the task's time slice from ops.dispatch() if the task still wants to run and no other task is selected. In this way the sched_ext core won't automatically re-schedule the task on the same CPU, implicitly assigning a time slice of SCX_SLICE_DFL. Moreover, instead of determining the task time slice in ops.enqueue(), refresh the time slice immediately before the task is started on its assigned CPU in ops.running(). This allows to use a more precise time slice, adjusted based on the actual amount of tasks that are currently waiting to be scheduled. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-22 09:23:37 +02:00
Tejun Heo	d6ac5fbd9c	scx_layered: Drop SCX_OPS_ENQ_LAST The meaning of SCX_OPS_ENQ_LAST will change with future kernel updates and enqueueing on local DSQ will no longer be sufficient to avoid stalls. No reason to do it anyway. Just drop it.	2024-08-21 13:13:59 -10:00
Tejun Heo	f726f0b73b	Version: Cargo.lock	2024-08-21 06:45:19 -10:00
Tejun Heo	4d1f0639d8	Version: v1.0.3	2024-08-21 06:42:11 -10:00
Andrea Righi	fedfee0bd6	scx_bpfland: drop unused variable With the global scx_utils::NR_CPU_IDS we don't need Topology anymore in init_primary_domain(), so drop the variable to fix the following build warning: warning: unused variable: `topo` --> src/main.rs:385:9 \| 385 \| topo: &Topology, \| ^^^^ help: if this is intentional, prefix it with an underscore: `_topo` \| = note: `#[warn(unused_variables)]` on by default Fixes: `1da249f` ("scx_utils::topology: Always use NR_CPU_IDS and NR_CPUS_POSSIBLE") Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-21 17:46:12 +02:00
Andrea Righi	9f7a11bba6	Merge pull request #528 from sched-ext/bpfland-turbo-boost scx_bpfland: properly classify Intel Turbo Boost CPUs	2024-08-21 17:40:25 +02:00
Daniel Hodges	f2a6661a85	Merge pull request #524 from hodgesds/layered-core-fixes scx_layered: Fix core selection	2024-08-21 08:13:33 -04:00
Tejun Heo	9c62019c81	Merge pull request #527 from sched-ext/htejun/scx_utils scx_utils::cpumask,topology: Misc updates	2024-08-20 22:25:25 -10:00
Andrea Righi	695e3b25b0	scx_bpfland: classify CPUs depending of their the base frequency Use the base frequency, instead of maximum frequency, to classify fast and slow CPUs. This ensures accurate distinction between Intel Turbo Boost CPUs and genuinely faster CPUs when auto-detecting the primary scheduling domain. Signed-off-by: Andrea Righi <andrea.righi@linux.dev>	2024-08-21 10:16:41 +02:00
Andrea Righi	e0fb99835d	Merge pull request #525 from sched-ext/bpfland-disable-interactive scx_bpfland: allow to completely disable interactive classification	2024-08-21 10:02:43 +02:00
Tejun Heo	5cf4212330	Revert "rusty: Integrate stats with the metrics framework" This reverts commit `83373b1f4e` in prepration for converting to scx_stats.	2024-08-20 21:59:25 -10:00

1 2 3 4 5 ...

699 Commits