1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392
|
2025-11-20 Mauro Carvalho Chehab <mchehab@kernel.org>
- Version 0.8.4
* Makefile.am: package completions at the source tarball
* rasdaemon.spec.in: add install rules for completions file
* unified-sel.c: use tabs instead of spaces for indent
* Makefile.am: sort if groups
* Makefile.am: use one file per line, in alphabetic order
* INSTALL: update it with the autogenerated one from new autotools
* rasdaemon: Fix CPU isolate errors with limited CPUs at startup
* rasdaemon: Update the supported modules in the HiSilicon common section
* rasdaemon: Add support for the CXL memory sparing events
* rasdaemon: cxl: Add validity check for parsing and logging the corrected memory error threshold event flags and error count
* rasdaemon: cxl: Update handling of Common Event Record to CXL spec rev3.2
* rasdaemon: fix post-processing options
* rasdaemon: cxl: Add validity check for parse and log hdr_maint_op_sub_class field
* rasdaemon: cxl: Update ras-report.c with missing maintenance operation subclass information
* rasdaemon: cxl: Add parse and log linear cache address alias emission for cxl events
* Makefile.am: install completions files
* completions: add completions files for zsh and bash
* ras-events: Revert page_size to 4096 in get_pagesize()
* ras-events: fix for return error code in add_event_handler()
* ras-events: fix buffer overflow in add_event_handler read
* unified-sel: added AER Uncorrectable Error reporting to SEL on OpenBMC
* README.md: Fix spelling and grammar errors in README.md
* rasdaemon: Fix coding style issues
* rasdaemon: add event level for event record
* aer: print pci device name and vendor/device id
* rasdaemon: erst: decode panic mce through erst
* rasdaemon: introduce poison page statistics
* rasdaemon: support memory corrected error statistics
* dmidecode Requires on riscv64 Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
* rasdaemon: skip doesn't exist event
* config: add rsyslog config
* config: add syslog-ng and logrotate config
* rasdaemon: trace SIGBUS event for hardware error
2024-03-10 Mauro Carvalho Chehab <mchehab@kernel.org>
- Version 0.8.3
* ras-diskerror-handler: fix checkpatch warnings
* ras-diskerror-handler: Use the right dev_t decoding for diskerror handler
* rasdaemon: Add new modules supported by HiSilicon common section
* rasdaemon: Fix some static check warning
* rasdaemon: Fix few compilation warnings in non standard hisilicon code
* rasdaemon: Fix some compilation alarms in ras-record.h.
* rasdaemon: ras-mc-ctl: Update logging of CXL memory module data to align with CXL spec rev 3.1
* rasdaemon: ras-mc-ctl: Update logging of CXL DRAM event data to align with CXL spec rev 3.1
* rasdaemon: ras-mc-ctl: Update logging of CXL general media event data to align with CXL spec rev 3.1
* rasdaemon: ras-mc-ctl: Update logging of common event data to align with CXL spec rev 3.1
* rasdaemon: ras-mc-ctl: Fix logging of memory event type in CXL DRAM error table
* rasdaemon: cxl: Update memory module event to CXL spec rev 3.1
* rasdaemon: cxl: Update CXL DRAM event to CXL spec rev 3.1
* rasdaemon: cxl: Update CXL general media event to CXL spec rev 3.1
* rasdaemon: cxl: Add Component Identifier formatting for CXL spec rev 3.1
* rasdaemon: cxl: Update common event to CXL spec rev 3.1
* rasdaemon: cxl: Add automatic indexing for storing CXL fields in SQLite database
* rasdaemon: cxl: Fix mismatch in region field's name with kernel DRAM trace event
* rasdaemon: cxl: Fix logging of memory event type of DRAM trace event
* rasdaemon: Fix for parsing error when trace event's format file is larger than PAGE_SIZE
* rasdaemon: Add page offline support for cxl memory
* Add labels for ASRock X370 Taichi
* Add labels for ASRock X570 Creator
* Add labels for ASRock X570S PG Riptide
* rasdaemon: mce: decode io port for bus error
* rasdaemon: arm: do not print error msg if field not found
* rasdaemon: add DE error type for AMD
* rasdaemon: Fix the display format of JaguarMicro vendor no standard errors
2024-07-19 Mauro Carvalho Chehab <mchehab@kernel.org>
- Version 0.8.2
* ChangeLog: fix 0.8.1 release date
* checkpatch.pl: warn also about strcat and sprintf usages
* ci.yml: run checkpatch when doing tests
* Makefile: add types.h to the list of headers
* Makefile: only enable rbtree if needed
* mce-intel: drop a code commented a long time ago with an action
* mce-intel-ivb/mce-intel-sb: remove code commented with #if 0
* New feature: support memory row CE threshold policy
* non-standard-jaguarmicro: avoid CamelCase
* ras-arm-handler: cope with latest upstream changes
* ras-arm-handler: Parse and log ARM Processor Error Info table
* ras-arm-handler: use GENMASK() macro
* rasdaemon: add .editorconfig file to follow our coding style
* rasdaemon: alphabetically sort includes
* rasdaemon: check if sscanf() processed all arguments on dev_name
* rasdaemon: cleanup coding style
* rasdaemon: coding style cleanup
* rasdaemon: CXL: Extract, log and record region info from cxl_general_media and cxl_dram events
* rasdaemon: CXL: Fix uncorrectable macro spelling
* rasdaemon: don't use braces for single statement blocks
* rasdaemon: don't use unsafe strcpy, strcat and sprintf
* rasdaemon: enforce SPDX license tags
* rasdaemon: fix a coding style issue
* rasdaemon: fix checkpatch warnings
* rasdaemon: fix initialization potential issues
* rasdaemon: fix for compilation warning in ras-memory-failure-handler.c
* rasdaemon: fix mem_fail_event build breakage
* rasdaemon: fix some coding style issues
* rasdaemon: fix some typos and correct spelling
* rasdaemon: ipmitool SEL logging of AER CEs on OpenBMC platforms
* rasdaemon: modify support for vendor-specific machine check error information
* rasdaemon: move type macros to a separate header (types.h)
* rasdaemon: rbtree: removed unused definition for RB_ROOT
* rasdaemon: use __func__ instead of the name of the function
* ras-events: demote log information about trace being enabled/disabled
* ras-events: drop a dead code to check number of CPUs
* ras-events: fix -d option to work again
* ras-events: fix warning ‘filter_ras_mc_event’ defined but not used
* ras-events: make returned error code consistent
* ras-mc-ctl: log hpa and region info from cxl_general_media and cxl_dram tables
* ras-page-isolation: drop an unused variable
* ras-page-isolation: fix location_fields size
* ras-page-isolation: fix an overflow bug when units->val=1
* ras-page-isolation: use snprintf() instead of sprintf()
* scripts/checkpatch.pl: add support for checking SPDX
* scripts/checkpatch.pl: do some additional cleanups
* scripts/checkpatch.pl: set default mode to strict
* scripts/checkpatch.pl: some improvements to reduce false positives
* types.h: add an implementation for strscpy() and strscat()
* types.h: don't depend on linux/bits.h
* unified-sel: convert license boilerplate to SPDX
2024-07-16 Mauro Carvalho Chehab <mchehab@kernel.org>
- Version 0.8.1
* Lots of new labels added;
* Added a script to check coding style;
* Detect non-x86 CPUs on MCE handler;
* Added an error injection script for QEMU;
* Allow external scripts to handle memory errors;
* Fix ARM processor compatibility with upstream;
* Add support for CXL events;
* Add support for microcode and PPIN on MCE traces;
* Add non-standard decodes for Yitian and Jaguarmicro;
* Add EmeraldRapids Intel CPU;
* Add Corrected Internal Error for aer_cor_errors;
* Add support for disable ras events in runtime;
* Improve AMD SMCA handling;
* Fix several compilation and runtime issues;
* Add option on ras-mc-ctl to exclude old events from reports;
* Other improvements at ras-mc-ctl tool;
* Documentation updates;
* Code cleanups;
* Coding style cleanups.
2023-02-18 Mauro Carvalho Chehab <mchehab@kernel.org>
- Version 0.8.0
* This version now uses libtraceevent. Since its beginning,
rasdaemon came with an early version of this library. Now,
instead of keeping it embedded, use it from the system's
package.
* Fix mock build target and rasdaemon.spec.in
* Update README with instructions about contributing and
convert to markdown
* Fix a regression with Kernel 6.1-rc6
* make distcheck now works
* Add labels for ASRockRack model X399D8A-2T
2023-01-21 Mauro Carvalho Chehab <mchehab@kernel.org>
- Version 0.7.0
* Add labels for ASUS TUF GAMING B450-PLUS II
* Add four modules supported by HiSilicon common section
* Updated HiSilicon platform name
* Relocate reading and display Kunpeng920 errors to under Kunpeng9xx
* Add support to display the HiSilicon vendor errors for a specified module
* Add printing usage if necessary parameters are not passed for the vendor-error options
* Reformat error info of the HiSilicon Kunpeng920
* Modify error statistics for HiSilicon KunPeng9xx common errors
* Modify recording Hisilicon common error data
* Support cpu fault isolation for recoverable errors
* Support cpu fault isolation for corrected errors
* Use XSI version of strerror_r on non glibc systems
* Use the new block_rq_error tracepoint
* Fix bank limit types check
* Properly handle localtime() failure
* Fix for a memory out-of-bounds issue and optimized code to remove duplicate function.
* Fix possible but unlikely file descriptor leak
* Fix bashisms
2022-04-12 Mauro Carvalho Chehab <mchehab@kernel.org>
- Version 0.6.8
* Fix some issues related to sysconfigdir
* Some fixes for hisi boards
* Update ras-mc-ctl manpage to match current options
* Fix ras-mc-ctl when parsing some dimm sizes
* New asrock x570 motherboard label
* New Supermicro labels
* Support MCE for AMD CPU family 19h
* Add new SMCA bank types with error decoding
* Add error handling for Ampere-specific errors.
* Add support for multi-arch builds
2021-05-26 Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
- Version 0.6.7
* Support for Ice Lake and Sapphire Rapids
* Support for HiSilicon Kunpeng9xx
* Support for Ampere
* Support for memory failure events
* Support for ARM processor error information
* Support for decoding for new SMCA Load Store bank type
* Add 8 channel decoding for SMCA systems
* Improvements at the page isolation logic
* New labels: A2SDi-8C-HLN4F, A2SDi-8C+-HLN4F, ASUS PRIME X570-PRO
* New labels: Supermicro X10SRA-F and H8DGU
* Added support to specify SYSCONFDEFDIR
* RASSTATEDIR is now created at runtime
* Use a linked list for non-standard error decoding interface
* PCIe AER now displas PCIe dev name
* Fixed a memory leak
* Several fixes
* Added ppc64le to travis build
2020-07-21 Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
- Version 0.6.6
* Support for new AMD SMCA bank types
* Add decoders for more hip08 events
* Add support for memory Corrected Error predictive failure analysis
* Some bugs fixed
2019-11-20 Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
- Version 0.6.5
* Several fixes for error handling logic
* Alter tables on SQL in case of errors during update
* store PCIe dev name and TLP header for the aer event
2019-10-10 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.6.4
* Change DB for hip08 in order to better handle some OEM data
* Fix an issue of sqlite3 integer bind parameter mismatch
* Update instructions about sending patches
* Fix URLs to git.kernel.org repositories in README file
* Fix file descriptor leak in ras-report.c:setup_report_socket()
* Initialize record.cpu before pevent_print_event().
* Flush trace buffer immediately, not on next call
* Replace whitespaces by tabs
* Fix build with musl
2019-08-23 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.6.3
* Added support for ARM Scalable MCA
* Added support for HiSilicon HIP08
* Added support for Hygon Dhyana family 18h processor
* Added support for disk I/O error monitoring
* Added devlink events
* Integrate rasdaemon build tests with Travis CI
* Fixed asdaemon high CPU usage when part of CPUs offline
* Fixed mcgstatus message print
* Some other minor fixes
2018-08-14 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.6.2
* Update INSTALL from the auto-tools generated one
* Reorder this ChangeLog place new stuff at the beginning
* add option to show error counts at ras-mc-ctl
* Do some new gcc 8.1 warning cleanups
* Use separate string array for PCIe AER error status
* Fix PCIe AER error type
2018-04-25 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.6.1
* Update DIMM labels for 2-socket servers
* Add Skylake Xeon MSCOD values
* ARM: fully initialize ras_arm_event
* Update my email
2017-10-14 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.6.0
* Added support for non-standard CPER error sections
* Added support for Hisilicon HIP07 SAS HW module
* Added support for ARM events
* Updated DIMM labels for Intel Skylake servers
2016-06-08 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.5.9
* Add Knights Mill and updated DELL labels
* Configure now reports enabled options
2016-04-15 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.5.8
* Add Broadwell EP/EX MSCOD and Broadwell DE MSCOD values
2016-02-05 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.5.7
* Add model numbers for Broadwell-EP/EX and -DE
* Add support for Knights Landing processor
2015-07-03 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.5.6
* Add internal errors of IA32_MC4_STATUS for Haswell
* Use MCA error msg as error_msg
* Unnecessary comma for empty mc_location string
* Remove a space from mcgstatus_msg
* Add support to log Local Machine Check Exception (LMCE)
2015-06-03 Mauro Carvalho Chehab <mchehab+samsung@kernel.org>
- Version 0.5.5
* Improve INSTALL summary instructions
* Add support to match the machine by system's product name
* Add support for Haswell/Broadwell/Knights Landing
* Some bug fixes on some MCE handlers
2014-08-15 Mauro Carvalho Chehab <m.chehab@samsung.com>
- Version 0.5.4
* Fix a bug while parsing dimm labels on amd64
* Enable database recording by default on systemd service file
* Correct range while parsing top, middle and lower layers
2014-08-10 Mauro Carvalho Chehab <m.chehab@samsung.com>
- Version 0.5.3
* Add support for extlog trace events
* Some fixes affecting sqlite handling
* Handle failures of snprintf()
* Fix mce numfield decoded error
2014-04-03 Mauro Carvalho Chehab <m.chehab@samsung.com>
- Version 0.5.2
* Some fixes for ABRT report support
2014-03-25 Mauro Carvalho Chehab <m.chehab@samsung.com>
- Version 0.5.1
* Fix patches at *.service files
* Some fixes and documentation for --record option
2014-02-16 Mauro Carvalho Chehab <m.chehab@samsung.com>
- Version 0.5.0
* Initial ABRT support
2013-09-10 Mauro Carvalho Chehab <m.chehab@samsung.com>
- Version 0.4.2
* Fixes ras-mc-ctl layout
2013-05-29 Mauro Carvalho Chehab <mchehab+redhat@kernel.org>
- Version 0.4.1
* Some fixes, mostly at sqlite3 code
* Add support at ras-mc-ctl to query database
2013-05-28 Mauro Carvalho Chehab <mchehab+redhat@kernel.org>
- Version 0.4.0
* Several fixes
* Get rid of pthreads, to avoid troubles with sqllite3
(requires Kernel 3.10 or upper)
* Add memory error decoding on MCE traces
2013-05-20 Mauro Carvalho Chehab <mchehab+redhat@kernel.org>
- Version 0.3.0
* Several fixes
* Add support for MCE traces
* Add support for PCI AER traces
* Add a target to build it on rpm-based distros
2013-05-08 Mauro Carvalho Chehab <mchehab+redhat@kernel.org>
- Version 0.2.0
* Add support to log via syslog
* Add ras-mc-ctl script to handle dimm labels
* Add a rpm spec file
* Make sqlite3 code experimental
* Add manpages and systemd services
* Update to take advantage of tracing features on Kernel 3.10
2013-03-12 Mauro Carvalho Chehab <mchehab+redhat@kernel.org>
- Version 0.1.0
* Initial version
|