提交 55eaa7c1 编写于 作者: S Stanislaw Gruszka 提交者: Ingo Molnar

sched: Avoid cputime scaling overflow

Here is patch, which adds Linus's cputime scaling algorithm to the
kernel.

This is a follow up (well, fix) to commit
d9a3c982 ("sched: Lower chances
of cputime scaling overflow") which commit tried to avoid
multiplication overflow, but did not guarantee that the overflow
would not happen.

Linus crated a different algorithm, which completely avoids the
multiplication overflow by dropping precision when numbers are
big.

It was tested by me and it gives good relative error of
scaled numbers. Testing method is described here:
http://marc.info/?l=linux-kernel&m=136733059505406&w=2

Originally-From: Linus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NStanislaw Gruszka <sgruszka@redhat.com>
Cc: Frederic Weisbecker <fweisbec@gmail.com>
Cc: rostedt@goodmis.org
Cc: Dave Hansen <dave@sr71.net>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/20130430151441.GC10465@redhat.comSigned-off-by: NIngo Molnar <mingo@kernel.org>
上级 25f55d9d
...@@ -506,34 +506,47 @@ void account_idle_ticks(unsigned long ticks) ...@@ -506,34 +506,47 @@ void account_idle_ticks(unsigned long ticks)
} }
/* /*
* Perform (stime * rtime) / total with reduced chances * Perform (stime * rtime) / total, but avoid multiplication overflow by
* of multiplication overflows by using smaller factors * loosing precision when the numbers are big.
* like quotient and remainders of divisions between
* rtime and total.
*/ */
static cputime_t scale_stime(u64 stime, u64 rtime, u64 total) static cputime_t scale_stime(u64 stime, u64 rtime, u64 total)
{ {
u64 rem, res, scaled; u64 scaled;
if (rtime >= total) { for (;;) {
/* /* Make sure "rtime" is the bigger of stime/rtime */
* Scale up to rtime / total then add if (stime > rtime) {
* the remainder scaled to stime / total. u64 tmp = rtime; rtime = stime; stime = tmp;
*/ }
res = div64_u64_rem(rtime, total, &rem);
scaled = stime * res; /* Make sure 'total' fits in 32 bits */
scaled += div64_u64(stime * rem, total); if (total >> 32)
} else { goto drop_precision;
/*
* Same in reverse: scale down to total / rtime /* Does rtime (and thus stime) fit in 32 bits? */
* then substract that result scaled to if (!(rtime >> 32))
* to the remaining part. break;
*/
res = div64_u64_rem(total, rtime, &rem); /* Can we just balance rtime/stime rather than dropping bits? */
scaled = div64_u64(stime, res); if (stime >> 31)
scaled -= div64_u64(scaled * rem, total); goto drop_precision;
/* We can grow stime and shrink rtime and try to make them both fit */
stime <<= 1;
rtime >>= 1;
continue;
drop_precision:
/* We drop from rtime, it has more bits than stime */
rtime >>= 1;
total >>= 1;
} }
/*
* Make sure gcc understands that this is a 32x32->64 multiply,
* followed by a 64/32->64 divide.
*/
scaled = div_u64((u64) (u32) stime * (u64) (u32) rtime, (u32)total);
return (__force cputime_t) scaled; return (__force cputime_t) scaled;
} }
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册