WHEA Logger 18 crashes

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • Kristian_Zsigo
    PCHF Member
    • Nov 2022
    • 8

    #1

    WHEA Logger 18 crashes

    Computer Type: Desktop
    GPU: AMD RX 5700
    CPU: RYZEN 5 3600 6 CORE 12 THREADS
    Motherboard: MSI B450 TOMAHAWK MAX (MS-7C02)
    BIOS Version: 3.F0 07/23/2022
    RAM: CORSAIR VENGEANCE® RGB PRO 16GB (2 x 8GB) DDR4 DRAM 3200MHz C16
    PSU: SilverStone Essential Gold ET650-HG 650W
    Operating System & Version: Windows 11 Pro 22H2 Build 22621.819
    GPU Drivers: 21.50.21.11-220428a-382767C-AMD-Software-Adrenalin-Edition
    Chipset Drivers: AMD B450 CHIPSET DRIVERS VERSION 4.09.23.507
    Background Applications: LGHUB, iCUE AMD Adrenaline, HWINFO64, Malwarebytes
    Description of Original Problem: Every time I try to play any game, it randomly crashes the whole system. It can happen after one hour of playing, or after a few days. Every time it’s WHEA Logger 18 with random APIC ID (it basically crashed on every core ID ). The PC was build in 2019, these errors started in the begginening of this year.
    Troubleshooting: I have tried every software fix like sfc. I also turned off cpu overclock and pbo. Temps are always stable, around 65 °C. Gpu is overclocked to 1850mhz with 1100mv max, vram is on 1860mhz. Gpu temps are also stable, around 75°C. I did all OCCT tests without errors. I also did MEMTEST86 for multiple hours without errors. I also tried changing RAM timing and voltage. RAM is currently running default at 2666mhz, no XMP. At this point I am trying to determine which HW component is responsible for the errors. I have a suspicion it’s the RAM or CPU, but i am not 100% sure. So any suggestions what should I try next ?
  • veeg
    PCHF Director
    • Jul 2016
    • 8978

    #2
    Hello

    Try resetting the overclock to a lower setting and see if that helps.. Also have you been running your pc at these settings since 2019?

    Comment

    • Kristian_Zsigo
      PCHF Member
      • Nov 2022
      • 8

      #3
      Originally posted by veeg
      Hello

      Try resetting the overclock to a lower setting and see if that helps.. Also have you been running your pc at these settings since 2019?
      Cpu is currently running on stock values, no overclock. And yes, I have been running at least PBO in ryzen master since start. There were some crashes from the start, but only in some games and it was never system crash, only display driver timeout.

      Comment

      • veeg
        PCHF Director
        • Jul 2016
        • 8978

        #4
        Let’s get more help.

        @Evan Omo

        Comment

        • phillpower2
          PCHF Administrator
          • Sep 2016
          • 15206

          #5
          Any crash dmps, if yes, see below;
          1. Copy any dmp files from C:\Windows\Minidump onto the desktop.
          2. Select all of them, right-click on one, and click on Send To> New Compressed (zipped) Folder.
          3. Upload the zip folder using the Attach button, bottom left of the dialogue input box

          Do not rule out the PSU as being the cause.

          Comment

          • Kristian_Zsigo
            PCHF Member
            • Nov 2022
            • 8

            #6
            Originally posted by phillpower2
            Any crash dmps, if yes, see below;
            1. Copy any dmp files from C:\Windows\Minidump onto the desktop.
            2. Select all of them, right-click on one, and click on Send To> New Compressed (zipped) Folder.
            3. Upload the zip folder using the Attach button, bottom left of the dialogue input box

            Do not rule out the PSU as being the cause.
            C:\Windows\Minidump doesn’t exist for me. I have checked startup and recovery options, if creating minidump files is configured and windows should create those files correctly.

            Comment

            • phillpower2
              PCHF Administrator
              • Sep 2016
              • 15206

              #7
              Please, no need to quote every reply and nothing wrong with my eyesight so no need for bold text either

              Make sure that small memory dmps are enabled, see Configure the dump type

              Comment

              • Kristian_Zsigo
                PCHF Member
                • Nov 2022
                • 8

                #8
                As I said in my previous reply, I have checked, if small memory dumps are enabled, like it’s shown in the link - they were already enabled.

                Comment

                • phillpower2
                  PCHF Administrator
                  • Sep 2016
                  • 15206

                  #9
                  Are you actually getting any BSOD and subsequent Windows Stop Code on the screen.

                  Comment

                  • Kristian_Zsigo
                    PCHF Member
                    • Nov 2022
                    • 8

                    #10
                    Nope, just system crash and reboot. Usually, when the crash happens, I lose video input, the last sound loops for a few seconds and then, the system reboots.

                    Comment

                    • phillpower2
                      PCHF Administrator
                      • Sep 2016
                      • 15206

                      #11
                      That explains it then, the computer is not crashing.

                      Software such as Windows can crash and when it does crash you get a BSOD and when enabled a crash dmp is generated, programs or games when they crash can on occasion close to the desktop but the computer will still be 100% functional.

                      Hardware failure such as a weak power supply and/or overheating are not software related and when a computer for example suddenly turns off, freezes or the screen goes black etc the behaviour should be described as the “computer shut down unexpectedly” or froze etc and not as having crashed as the latter implies a software issue as opposed to an obvious hardware issue when described properly.

                      Having the correct info means that helpers will not be looking for a software issue when the problem is clearly hardware related.

                      Comment

                      • Kristian_Zsigo
                        PCHF Member
                        • Nov 2022
                        • 8

                        #12
                        Okay, so let me correct myself. Computer is “shutting down unexpectedly”. After every incident, it generates these WHEA Logger errors. I think it’s hardware failure too (that’s why I am posting in internal hardware section). From my research, the PSU shouldn’t be weak, or low quality. In my initial post I have also stated, that none of the components is overheating. I am just trying to find out, which component is faulty.

                        Comment

                        • phillpower2
                          PCHF Administrator
                          • Sep 2016
                          • 15206

                          #13
                          A massive clue is WHEA which = Windows Hardware Error Architecture which is a generic error message for " some hardware " fell over and what you describe is typical behaviour of a weak PSU that is overheating internally.
                          PSU: SilverStone Essential Gold ET650-HG 650W

                          A decent brand years ago but not any more, your PSU was first released in 2017, had a pretty high failure rate and a sign of its lesser quality is the miserly three year warranty, recommended brands like Corsair, EVGA and Seasonic offer seven, ten and twelve year warranties for their PSUs which is showing the confidence that they have in their products, you mention the rig being built in 2019 so unless it was built in either November or December it will be out of warranty and needs to be replaced in any event.

                          Suggest that you see if you can borrow a known good 650W or above Gold efficiency rated PSU to swap in for testing purposes, if you can`t do this the only alternative is to get a local tech who will have the required equipment to test it properly.
                          I have also stated, that none of the components is overheating.

                          Been doing this stuff for years so am quite well placed to evaluate symptoms, you obviously are not aware that PSUs have internal thermal sensors that shut down PSUs when they overheat, there is no way for an end user to measure a PSUs internal temps and that is the reason why trying an alternative and appropriate PSU be tried or the services of a local tech are sought, your computer to do as you see fit and we can do no more.

                          Comment

                          • Kristian_Zsigo
                            PCHF Member
                            • Nov 2022
                            • 8

                            #14
                            Okay, I will try different PSU and see, if it solves my problem. Thanks for help.

                            Comment

                            • phillpower2
                              PCHF Administrator
                              • Sep 2016
                              • 15206

                              #15
                              You are welcome and good luck (y)

                              Just to reiterate;
                              Originally posted by phillpower2
                              Suggest that you see if you can borrow a known good 650W or above Gold efficiency rated PSU
                              A proven brand such as those mentioned - if you can - would be best;
                              Originally posted by phillpower2
                              Corsair, EVGA and Seasonic

                              Comment

                              Working...