Solved WHEA Logger 18 crashes

  • Hi there and welcome to PC Help Forum (PCHF), a more effective way to get the Tech Support you need!
    We have Experts in all areas of Tech, including Malware Removal, Crash Fixing and BSOD's , Microsoft Windows, Computer DIY and PC Hardware, Networking, Gaming, Tablets and iPads, General and Specific Software Support and so much more.

    Why not Click Here To Sign Up and start enjoying great FREE Tech Support.

    This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.
  • Hello everyone We want to personally apologize to everyone for the downtime that we've experienced. We are working to get everything back up as quickly as possible. Due to the issues we've had, your password will need to be reset. Please click the button that says "Forgot Your Password" and change it. We are working to have things back to normal. Emails are fixed and should now send properly. Thank you all for your patience. Thanks, PCHF Management
Status
Not open for further replies.

Kristián Zsigó

PCHF Member
Nov 9, 2022
8
1
24
Computer Type: Desktop
GPU: AMD RX 5700
CPU: RYZEN 5 3600 6 CORE 12 THREADS
Motherboard: MSI B450 TOMAHAWK MAX (MS-7C02)
BIOS Version: 3.F0 07/23/2022
RAM: CORSAIR VENGEANCE® RGB PRO 16GB (2 x 8GB) DDR4 DRAM 3200MHz C16
PSU: SilverStone Essential Gold ET650-HG 650W
Operating System & Version: Windows 11 Pro 22H2 Build 22621.819
GPU Drivers: 21.50.21.11-220428a-382767C-AMD-Software-Adrenalin-Edition
Chipset Drivers: AMD B450 CHIPSET DRIVERS VERSION 4.09.23.507
Background Applications: LGHUB, iCUE AMD Adrenaline, HWINFO64, Malwarebytes
Description of Original Problem: Every time I try to play any game, it randomly crashes the whole system. It can happen after one hour of playing, or after a few days. Every time it's WHEA Logger 18 with random APIC ID (it basically crashed on every core ID ). The PC was build in 2019, these errors started in the begginening of this year.
Troubleshooting: I have tried every software fix like sfc. I also turned off cpu overclock and pbo. Temps are always stable, around 65 °C. Gpu is overclocked to 1850mhz with 1100mv max, vram is on 1860mhz. Gpu temps are also stable, around 75°C. I did all OCCT tests without errors. I also did MEMTEST86 for multiple hours without errors. I also tried changing RAM timing and voltage. RAM is currently running default at 2666mhz, no XMP. At this point I am trying to determine which HW component is responsible for the errors. I have a suspicion it's the RAM or CPU, but i am not 100% sure. So any suggestions what should I try next ?
 
Hello

Try resetting the overclock to a lower setting and see if that helps.. Also have you been running your pc at these settings since 2019?
 
Hello

Try resetting the overclock to a lower setting and see if that helps.. Also have you been running your pc at these settings since 2019?
Cpu is currently running on stock values, no overclock. And yes, I have been running at least PBO in ryzen master since start. There were some crashes from the start, but only in some games and it was never system crash, only display driver timeout.
 
Any crash dmps, if yes, see below;

1. Copy any dmp files from C:\Windows\Minidump onto the desktop.
2. Select all of them, right-click on one, and click on Send To> New Compressed (zipped) Folder.
3. Upload the zip folder using the Attach button, bottom left of the dialogue input box

Do not rule out the PSU as being the cause.
 
  • Like
Reactions: veeg
Any crash dmps, if yes, see below;

1. Copy any dmp files from C:\Windows\Minidump onto the desktop.
2. Select all of them, right-click on one, and click on Send To> New Compressed (zipped) Folder.
3. Upload the zip folder using the Attach button, bottom left of the dialogue input box

Do not rule out the PSU as being the cause.
C:\Windows\Minidump doesn't exist for me. I have checked startup and recovery options, if creating minidump files is configured and windows should create those files correctly.
 
As I said in my previous reply, I have checked, if small memory dumps are enabled, like it's shown in the link - they were already enabled.
 
Are you actually getting any BSOD and subsequent Windows Stop Code on the screen.
 
Nope, just system crash and reboot. Usually, when the crash happens, I lose video input, the last sound loops for a few seconds and then, the system reboots.
 
That explains it then, the computer is not crashing.

Software such as Windows can crash and when it does crash you get a BSOD and when enabled a crash dmp is generated, programs or games when they crash can on occasion close to the desktop but the computer will still be 100% functional.

Hardware failure such as a weak power supply and/or overheating are not software related and when a computer for example suddenly turns off, freezes or the screen goes black etc the behaviour should be described as the "computer shut down unexpectedly" or froze etc and not as having crashed as the latter implies a software issue as opposed to an obvious hardware issue when described properly.

Having the correct info means that helpers will not be looking for a software issue when the problem is clearly hardware related.
 
Okay, so let me correct myself. Computer is "shutting down unexpectedly". After every incident, it generates these WHEA Logger errors. I think it's hardware failure too (that's why I am posting in internal hardware section). From my research, the PSU shouldn't be weak, or low quality. In my initial post I have also stated, that none of the components is overheating. I am just trying to find out, which component is faulty.
 
  • Like
Reactions: veeg
A massive clue is WHEA which = Windows Hardware Error Architecture which is a generic error message for " some hardware " fell over and what you describe is typical behaviour of a weak PSU that is overheating internally.

PSU: SilverStone Essential Gold ET650-HG 650W

A decent brand years ago but not any more, your PSU was first released in 2017, had a pretty high failure rate and a sign of its lesser quality is the miserly three year warranty, recommended brands like Corsair, EVGA and Seasonic offer seven, ten and twelve year warranties for their PSUs which is showing the confidence that they have in their products, you mention the rig being built in 2019 so unless it was built in either November or December it will be out of warranty and needs to be replaced in any event.

Suggest that you see if you can borrow a known good 650W or above Gold efficiency rated PSU to swap in for testing purposes, if you can`t do this the only alternative is to get a local tech who will have the required equipment to test it properly.

I have also stated, that none of the components is overheating.

Been doing this stuff for years so am quite well placed to evaluate symptoms, you obviously are not aware that PSUs have internal thermal sensors that shut down PSUs when they overheat, there is no way for an end user to measure a PSUs internal temps and that is the reason why trying an alternative and appropriate PSU be tried or the services of a local tech are sought, your computer to do as you see fit and we can do no more.
 
Since I switched to different PSU, no unexpected shutdowns so far. If there are no reboots for another week, I think it's safe to say it was a bad PSU causing the problem.
 
Status
Not open for further replies.