linux

mirror of https://github.com/FEX-Emu/linux.git synced 2024-12-27 20:07:09 +00:00

History

Vincent Guittot 43f4d66637 sched: Improve sysbench performance by fixing spurious active migration Since commit `caeb178c60` ("sched/fair: Make update_sd_pick_busiest() ...") sd_pick_busiest returns a group that can be neither imbalanced nor overloaded but is only more loaded than others. This change has been introduced to ensure a better load balance in system that are not overloaded but as a side effect, it can also generate useless active migration between groups. Let take the example of 3 tasks on a quad cores system. We will always have an idle core so the load balance will find a busiest group (core) whenever an ILB is triggered and it will force an active migration (once above nr_balance_failed threshold) so the idle core becomes busy but another core will become idle. With the next ILB, the freshly idle core will try to pull the task of a busy CPU. The number of spurious active migration is not so huge in quad core system because the ILB is not triggered so much. But it becomes significant as soon as you have more than one sched_domain level like on a dual cluster of quad cores where the ILB is triggered every tick when you have more than 1 busy_cpu We need to ensure that the migration generate a real improveùent and will not only move the avg_load imbalance on another CPU. Before `caeb178c60`, the filtering of such use case was ensured by the following test in f_b_g: if ((local->idle_cpus < busiest->idle_cpus) && busiest->sum_nr_running <= busiest->group_weight) This patch modified the condition to take into account situation where busiest group is not overloaded: If the diff between the number of idle cpus in 2 groups is less than or equal to 1 and the busiest group is not overloaded, moving a task will not improve the load balance but just move it. A test with sysbench on a dual clusters of quad cores gives the following results: command: sysbench --test=cpu --num-threads=5 --max-time=5 run The HZ is 200 which means that 1000 ticks has fired during the test. With Mainline, perf gives the following figures: Samples: 727 of event 'sched:sched_migrate_task' Event count (approx.): 727 Overhead Command Shared Object Symbol ........ ............... ............. .............. 12.52% migration/1 [unknown] [.] 00000000 12.52% migration/5 [unknown] [.] 00000000 12.52% migration/7 [unknown] [.] 00000000 12.10% migration/6 [unknown] [.] 00000000 11.83% migration/0 [unknown] [.] 00000000 11.83% migration/3 [unknown] [.] 00000000 11.14% migration/4 [unknown] [.] 00000000 10.87% migration/2 [unknown] [.] 00000000 2.75% sysbench [unknown] [.] 00000000 0.83% swapper [unknown] [.] 00000000 0.55% ktps65090charge [unknown] [.] 00000000 0.41% mmcqd/1 [unknown] [.] 00000000 0.14% perf [unknown] [.] 00000000 With this patch, perf gives the following figures Samples: 20 of event 'sched:sched_migrate_task' Event count (approx.): 20 Overhead Command Shared Object Symbol ........ ............... ............. .............. 80.00% sysbench [unknown] [.] 00000000 10.00% swapper [unknown] [.] 00000000 5.00% ktps65090charge [unknown] [.] 00000000 5.00% migration/1 [unknown] [.] 00000000 Signed-off-by: Vincent Guittot <vincent.guittot@linaro.org> Reviewed-by: Rik van Riel <riel@redhat.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Link: http://lkml.kernel.org/r/1412170735-5356-1-git-send-email-vincent.guittot@linaro.org Signed-off-by: Ingo Molnar <mingo@kernel.org>		2014-10-03 05:46:54 +02:00
..
auto_group.c	sched: Change autogroup_move_group() to use for_each_thread()	2014-08-20 09:47:18 +02:00
auto_group.h
clock.c	kernel: use macros from compiler.h instead of __attribute__((...))	2014-04-07 16:36:11 -07:00
completion.c
core.c	sched: Use rq->rd in sched_setaffinity() under RCU read lock	2014-09-24 14:47:11 +02:00
cpuacct.c	cgroup: rename cgroup_subsys->base_cftypes to ->legacy_cftypes	2014-07-15 11:05:09 -04:00
cpuacct.h
cpudeadline.c	sched/deadline: Fix inter- exclusive cpusets migrations	2014-09-24 14:46:57 +02:00
cpudeadline.h	sched/deadline: Replace NR_CPUS arrays	2014-05-22 10:21:28 +02:00
cpupri.c	Merge commit '3cf2f34' into sched/core, to fix build error	2014-06-12 13:46:37 +02:00
cpupri.h	sched/cpupri: Replace NR_CPUS arrays	2014-05-22 10:21:29 +02:00
cputime.c	sched, time: Fix lock inversion in thread_group_cputime()	2014-09-19 12:35:17 +02:00
deadline.c	sched/deadline: Fix inter- exclusive cpusets migrations	2014-09-24 14:46:57 +02:00
debug.c	sched: print_rq(): Don't use tasklist_lock	2014-09-24 14:47:04 +02:00
fair.c	sched: Improve sysbench performance by fixing spurious active migration	2014-10-03 05:46:54 +02:00
features.h	sched: Rename capacity related flags	2014-06-05 11:52:32 +02:00
idle_task.c	sched: Transform resched_task() into resched_curr()	2014-07-16 13:38:19 +02:00
idle.c	sched: Let the scheduler see CPU idle states	2014-09-24 14:46:58 +02:00
Makefile
proc.c	cpuidle: menu: Lookup CPU runqueues less	2014-08-06 21:17:45 +02:00
rt.c	sched/rt: Use resched_curr() in task_tick_rt()	2014-09-24 14:47:12 +02:00
sched.h	sched, mips, ia64: Remove __ARCH_WANT_UNLOCKED_CTXSW	2014-09-24 14:47:05 +02:00
stats.c
stats.h
stop_task.c	sched: Add wrapper for checking task_struct::on_rq	2014-08-20 14:52:59 +02:00
wait.c	sched: Allow wait_on_bit_action() functions to support a timeout	2014-07-16 15:10:41 +02:00