mirror of
https://github.com/FEX-Emu/linux.git
synced 2025-01-12 20:31:49 +00:00
Merge branch 'turbostat' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux into pm-tools
Pull turbostat updates for 4.6 from Len Brown. * 'turbostat' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux: tools/power turbostat: bugfix: TDP MSRs print bits fixing tools/power turbostat: correct output for MSR_NHM_SNB_PKG_CST_CFG_CTL dump tools/power turbostat: call __cpuid() instead of __get_cpuid() tools/power turbostat: indicate SMX and SGX support tools/power turbostat: detect and work around syscall jitter tools/power turbostat: show GFX%rc6 tools/power turbostat: show GFXMHz tools/power turbostat: show IRQs per CPU tools/power turbostat: make fewer systems calls tools/power turbostat: fix compiler warnings tools/power turbostat: add --out option for saving output in a file tools/power turbostat: re-name "%Busy" field to "Busy%" tools/power turbostat: Intel Xeon x200: fix turbo-ratio decoding tools/power turbostat: Intel Xeon x200: fix erroneous bclk value tools/power turbostat: allow sub-sec intervals tools/power turbostat: Decode MSR_MISC_PWR_MGMT tools/power turbostat: decode HWP registers x86 msr-index: Simplify syntax for HWP fields tools/power turbostat: CPUID(0x16) leaf shows base, max, and bus frequency tools/power turbostat: decode more CPUID fields
This commit is contained in:
commit
3fdb74649b
@ -230,10 +230,10 @@
|
|||||||
#define HWP_PACKAGE_LEVEL_REQUEST_BIT (1<<11)
|
#define HWP_PACKAGE_LEVEL_REQUEST_BIT (1<<11)
|
||||||
|
|
||||||
/* IA32_HWP_CAPABILITIES */
|
/* IA32_HWP_CAPABILITIES */
|
||||||
#define HWP_HIGHEST_PERF(x) (x & 0xff)
|
#define HWP_HIGHEST_PERF(x) (((x) >> 0) & 0xff)
|
||||||
#define HWP_GUARANTEED_PERF(x) ((x & (0xff << 8)) >>8)
|
#define HWP_GUARANTEED_PERF(x) (((x) >> 8) & 0xff)
|
||||||
#define HWP_MOSTEFFICIENT_PERF(x) ((x & (0xff << 16)) >>16)
|
#define HWP_MOSTEFFICIENT_PERF(x) (((x) >> 16) & 0xff)
|
||||||
#define HWP_LOWEST_PERF(x) ((x & (0xff << 24)) >>24)
|
#define HWP_LOWEST_PERF(x) (((x) >> 24) & 0xff)
|
||||||
|
|
||||||
/* IA32_HWP_REQUEST */
|
/* IA32_HWP_REQUEST */
|
||||||
#define HWP_MIN_PERF(x) (x & 0xff)
|
#define HWP_MIN_PERF(x) (x & 0xff)
|
||||||
|
@ -34,7 +34,10 @@ name as necessary to disambiguate it from others is necessary. Note that option
|
|||||||
\fB--debug\fP displays additional system configuration information. Invoking this parameter
|
\fB--debug\fP displays additional system configuration information. Invoking this parameter
|
||||||
more than once may also enable internal turbostat debug information.
|
more than once may also enable internal turbostat debug information.
|
||||||
.PP
|
.PP
|
||||||
\fB--interval seconds\fP overrides the default 5-second measurement interval.
|
\fB--interval seconds\fP overrides the default 5.0 second measurement interval.
|
||||||
|
.PP
|
||||||
|
\fB--out output_file\fP turbostat output is written to the specified output_file.
|
||||||
|
The file is truncated if it already exists, and it is created if it does not exist.
|
||||||
.PP
|
.PP
|
||||||
\fB--help\fP displays usage for the most common parameters.
|
\fB--help\fP displays usage for the most common parameters.
|
||||||
.PP
|
.PP
|
||||||
@ -61,7 +64,7 @@ displays the statistics gathered since it was forked.
|
|||||||
.nf
|
.nf
|
||||||
\fBCPU\fP Linux CPU (logical processor) number. Yes, it is okay that on many systems the CPUs are not listed in numerical order -- for efficiency reasons, turbostat runs in topology order, so HT siblings appear together.
|
\fBCPU\fP Linux CPU (logical processor) number. Yes, it is okay that on many systems the CPUs are not listed in numerical order -- for efficiency reasons, turbostat runs in topology order, so HT siblings appear together.
|
||||||
\fBAVG_MHz\fP number of cycles executed divided by time elapsed.
|
\fBAVG_MHz\fP number of cycles executed divided by time elapsed.
|
||||||
\fB%Busy\fP percent of the interval that the CPU retired instructions, aka. % of time in "C0" state.
|
\fBBusy%\fP percent of the interval that the CPU retired instructions, aka. % of time in "C0" state.
|
||||||
\fBBzy_MHz\fP average clock rate while the CPU was busy (in "c0" state).
|
\fBBzy_MHz\fP average clock rate while the CPU was busy (in "c0" state).
|
||||||
\fBTSC_MHz\fP average MHz that the TSC ran during the entire interval.
|
\fBTSC_MHz\fP average MHz that the TSC ran during the entire interval.
|
||||||
.fi
|
.fi
|
||||||
@ -83,13 +86,14 @@ Note that multiple CPUs per core indicate support for Intel(R) Hyper-Threading T
|
|||||||
\fBRAM_%\fP percent of the interval that RAPL throttling was active on DRAM.
|
\fBRAM_%\fP percent of the interval that RAPL throttling was active on DRAM.
|
||||||
.fi
|
.fi
|
||||||
.PP
|
.PP
|
||||||
.SH EXAMPLE
|
.SH PERIODIC EXAMPLE
|
||||||
Without any parameters, turbostat displays statistics ever 5 seconds.
|
Without any parameters, turbostat displays statistics ever 5 seconds.
|
||||||
(override interval with "-i sec" option, or specify a command
|
Periodic output goes to stdout, by default, unless --out is used to specify an output file.
|
||||||
for turbostat to fork).
|
The 5-second interval can be changed with th "-i sec" option.
|
||||||
|
Or a command may be specified as in "FORK EXAMPLE" below.
|
||||||
.nf
|
.nf
|
||||||
[root@hsw]# ./turbostat
|
[root@hsw]# ./turbostat
|
||||||
CPU Avg_MHz %Busy Bzy_MHz TSC_MHz
|
CPU Avg_MHz Busy% Bzy_MHz TSC_MHz
|
||||||
- 488 12.51 3898 3498
|
- 488 12.51 3898 3498
|
||||||
0 0 0.01 3885 3498
|
0 0 0.01 3885 3498
|
||||||
4 3897 99.99 3898 3498
|
4 3897 99.99 3898 3498
|
||||||
@ -145,7 +149,7 @@ cpu0: MSR_IA32_THERM_STATUS: 0x88340000 (48 C +/- 1)
|
|||||||
cpu1: MSR_IA32_THERM_STATUS: 0x88440000 (32 C +/- 1)
|
cpu1: MSR_IA32_THERM_STATUS: 0x88440000 (32 C +/- 1)
|
||||||
cpu2: MSR_IA32_THERM_STATUS: 0x88450000 (31 C +/- 1)
|
cpu2: MSR_IA32_THERM_STATUS: 0x88450000 (31 C +/- 1)
|
||||||
cpu3: MSR_IA32_THERM_STATUS: 0x88490000 (27 C +/- 1)
|
cpu3: MSR_IA32_THERM_STATUS: 0x88490000 (27 C +/- 1)
|
||||||
Core CPU Avg_MHz %Busy Bzy_MHz TSC_MHz SMI CPU%c1 CPU%c3 CPU%c6 CPU%c7 CoreTmp PkgTmp PkgWatt CorWatt GFXWatt
|
Core CPU Avg_MHz Busy% Bzy_MHz TSC_MHz SMI CPU%c1 CPU%c3 CPU%c6 CPU%c7 CoreTmp PkgTmp PkgWatt CorWatt GFXWatt
|
||||||
- - 493 12.64 3898 3498 0 12.64 0.00 0.00 74.72 47 47 21.62 13.74 0.00
|
- - 493 12.64 3898 3498 0 12.64 0.00 0.00 74.72 47 47 21.62 13.74 0.00
|
||||||
0 0 4 0.11 3894 3498 0 99.89 0.00 0.00 0.00 47 47 21.62 13.74 0.00
|
0 0 4 0.11 3894 3498 0 99.89 0.00 0.00 0.00 47 47 21.62 13.74 0.00
|
||||||
0 4 3897 99.98 3898 3498 0 0.02
|
0 4 3897 99.98 3898 3498 0 0.02
|
||||||
@ -171,14 +175,16 @@ The --debug option adds additional columns to the measurement ouput, including C
|
|||||||
See the field definitions above.
|
See the field definitions above.
|
||||||
.SH FORK EXAMPLE
|
.SH FORK EXAMPLE
|
||||||
If turbostat is invoked with a command, it will fork that command
|
If turbostat is invoked with a command, it will fork that command
|
||||||
and output the statistics gathered when the command exits.
|
and output the statistics gathered after the command exits.
|
||||||
|
In this case, turbostat output goes to stderr, by default.
|
||||||
|
Output can instead be saved to a file using the --out option.
|
||||||
eg. Here a cycle soaker is run on 1 CPU (see %c0) for a few seconds
|
eg. Here a cycle soaker is run on 1 CPU (see %c0) for a few seconds
|
||||||
until ^C while the other CPUs are mostly idle:
|
until ^C while the other CPUs are mostly idle:
|
||||||
|
|
||||||
.nf
|
.nf
|
||||||
root@hsw: turbostat cat /dev/zero > /dev/null
|
root@hsw: turbostat cat /dev/zero > /dev/null
|
||||||
^C
|
^C
|
||||||
CPU Avg_MHz %Busy Bzy_MHz TSC_MHz
|
CPU Avg_MHz Busy% Bzy_MHz TSC_MHz
|
||||||
- 482 12.51 3854 3498
|
- 482 12.51 3854 3498
|
||||||
0 0 0.01 1960 3498
|
0 0 0.01 1960 3498
|
||||||
4 0 0.00 2128 3498
|
4 0 0.00 2128 3498
|
||||||
@ -192,12 +198,12 @@ root@hsw: turbostat cat /dev/zero > /dev/null
|
|||||||
|
|
||||||
.fi
|
.fi
|
||||||
Above the cycle soaker drives cpu5 up its 3.9 GHz turbo limit.
|
Above the cycle soaker drives cpu5 up its 3.9 GHz turbo limit.
|
||||||
The first row shows the average MHz and %Busy across all the processors in the system.
|
The first row shows the average MHz and Busy% across all the processors in the system.
|
||||||
|
|
||||||
Note that the Avg_MHz column reflects the total number of cycles executed
|
Note that the Avg_MHz column reflects the total number of cycles executed
|
||||||
divided by the measurement interval. If the %Busy column is 100%,
|
divided by the measurement interval. If the Busy% column is 100%,
|
||||||
then the processor was running at that speed the entire interval.
|
then the processor was running at that speed the entire interval.
|
||||||
The Avg_MHz multiplied by the %Busy results in the Bzy_MHz --
|
The Avg_MHz multiplied by the Busy% results in the Bzy_MHz --
|
||||||
which is the average frequency while the processor was executing --
|
which is the average frequency while the processor was executing --
|
||||||
not including any non-busy idle time.
|
not including any non-busy idle time.
|
||||||
|
|
||||||
@ -233,7 +239,7 @@ in the brand string in /proc/cpuinfo. On a system where
|
|||||||
the TSC stops in idle, TSC_MHz will drop
|
the TSC stops in idle, TSC_MHz will drop
|
||||||
below the processor's base frequency.
|
below the processor's base frequency.
|
||||||
|
|
||||||
%Busy = MPERF_delta/TSC_delta
|
Busy% = MPERF_delta/TSC_delta
|
||||||
|
|
||||||
Bzy_MHz = TSC_delta/APERF_delta/MPERF_delta/measurement_interval
|
Bzy_MHz = TSC_delta/APERF_delta/MPERF_delta/measurement_interval
|
||||||
|
|
||||||
|
File diff suppressed because it is too large
Load Diff
Loading…
x
Reference in New Issue
Block a user