File: cpuprofile-fileformat.adoc

package info (click to toggle)
google-perftools 2.18.1-1
  • links: PTS, VCS
  • area: main
  • in suites: sid
  • size: 10,616 kB
  • sloc: cpp: 78,789; ansic: 17,902; sh: 10,783; python: 4,278; makefile: 1,127; ruby: 198; asm: 130; awk: 12
file content (161 lines) | stat: -rw-r--r-- 4,832 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
= Gperftools CPU Profiler Binary Data File Format
:reproducible:

[.normal]
This file documents the binary data file format produced by the
gperftools CPU Profiler. It is one of "legacy" formats supported by
the pprof tool. For information about using the CPU Profiler, see
link:cpuprofile.html[its user guide].

The profiler source code, which generates files using this format, is at
`src/profiler.cc`.

== CPU Profile Data File Structure

CPU profile data files each consist of four parts, in order:

* Binary header
* Binary profile records
* Binary trailer
* Text list of mapped objects

The binary data is expressed in terms of "slots." These are words large
enough to hold the program's pointer type, i.e., for 32-bit programs
they are 4 bytes in size, and for 64-bit programs they are 8 bytes. They
are stored in the profile data file in the native byte order (i.e.,
little-endian for x86 and x86_64).

== Binary Header

The binary header format is show below. Values written by the profiler,
along with requirements currently enforced by the analysis tools, are
shown in parentheses.

[cols=",",options="header",]
|===
|slot |data
|0 |header count (0; must be 0)
|1 |header slots after this one (3; must be >= 3)
|2 |format version (0; must be 0)
|3 |sampling period, in microseconds
|4 |padding (0)
|===

The headers currently generated for 32-bit and 64-bit little-endian (x86
and x86_64) profiles are shown below, for comparison.

[cols=",,,,,",options="header",]
|===
| |hdr count |hdr words |version |sampling period |pad
|32-bit or 64-bit (slots) |0 |3 |0 |10000 |0

|32-bit (4-byte words in file) |`0x00000` |`0x00003` |`0x00000`
|`0x02710` |`0x00000`

|64-bit LE (4-byte words in file) |`0x00000 0x00000`
|`0x00003 0x00000` |`0x00000 0x00000` |`0x02710 0x00000`
|`0x00000 0x00000`
|===

The contents are shown in terms of slots, and in terms of 4-byte words
in the profile data file. The slot contents for 32-bit and 64-bit
headers are identical. For 32-bit profiles, the 4-byte word view matches
the slot view. For 64-bit profiles, each (8-byte) slot is shown as two
4-byte words, ordered as they would appear in the file.

The profiling tools examine the contents of the file and use the
expected locations and values of the header words field to detect
whether the file is 32-bit or 64-bit.

== Binary Profile Records

The binary profile record format is shown below.

[cols=2*]
|===
|slot
|data

|0
|sample count, must be >= 1

|1
|number of call chain PCs (num_pcs), must be >= 1

|2 .. (num_pcs + 1)
|call chain PCs, most-recently-called function first.
|===

The total length of a given record is 2 + num_pcs.

Note that multiple profile records can be emitted by the profiler having
an identical call chain. In that case, analysis tools should sum the
counts of all records having identical call chains.

*Note:* Some profile analysis tools terminate if they see _any_ profile
record with a call chain with its first entry having the address 0.
(This is similar to the binary trailer.)

=== Example

This example shows the slots contained in a sample profile record.

[cols=",,,,",]
|===
|5 |3 |0xa0000 |0xc0000 |0xe0000
|===

In this example, 5 ticks were received at PC 0xa0000, whose function had
been called by the function containing 0xc0000, which had been called
from the function containing 0xe0000.

== Binary Trailer

The binary trailer consists of three slots of data with fixed values,
shown below.

[cols=",",options="header",]
|===
|slot |value
|0 |0
|1 |1
|2 |0
|===

Note that this is the same data that would contained in a profile record
with sample count = 0, num_pcs = 1, and a one-element call chain
containing the address 0.

== Text List of Mapped Objects

The binary data in the file is followed immediately by a list of mapped
objects. This list consists of lines of text separated by newline
characters.

Each line describes one mapping as produced by SaveProcSelfMaps. For
example:

....
  40000000-40015000 r-xp 00000000 03:01 12845071   /lib/ld-2.3.2.so
....

The first address must start at the beginning of the line. This is
essentially the same format as Linux's `/proc/<pid>/maps` file.
Recent Linux systems have this format documented in
link:https://man7.org/linux/man-pages/man5/proc_pid_maps.5.html[`man 5
proc_pid_maps`]. Less recent systems document it under `man 5 proc`.

Tools ignore minor:major number and inode number. And only executable
mappings really need to be present. See
`src/base/proc_maps_iterator.{h,cc}` for how it is produced.

Unrecognized lines should be ignored by analysis tools.

Note, original pprof tool also supported processing `$build`
"variable" when processing mappings, but we never produced such
mappings. So we don't document this anymore.

'''''

Original authror: Chris Demetriou (cgd) +
Last update by: Aliaksei Kandratsenka +