Hey all, first post, but I'm at my wit's end. First off, here's my build:
AMD Ryzen 9 7950X
ASUS ROG Strix X670E-F Gaming Wifi
2 sticks of Kingston Technology Fury Beast 16GB 6000MT/s DDR5 CL36 AMD Expo RAM
ASUS GeForce RTX Nvidia 4070 Ti TUF Gaming
Corsair RMX Series RM1000x, 1000W Gold Power Supply
Samsung 980 Pro SSD 1TB PCIe NVMe Gen 4 M.2 for OS and odds and ends storage
Samsung 980 Pro SSD 2TB PCIe NVMe Gen 4 M.2 for programs and multimedia storage
I'm going to be as thorough as possible with my troubleshooting; if you'd prefer to avoid the long read, a summary will be provided at the end.
Ran fine for 4 months, then suddenly, three days ago, every time I loaded into Diablo 4 it would crash to desktop without a reason. Figured it was probably a Diablo thing, so loaded up another game. Played for around 6 minutes, boom, crash to desktop. Assumed a GPU driver issue, so I booted up Nvidia's GeForce Experience and tried to update drivers, but the download continued to fail, saying that either I had no hard drive space (I have 600GB left on the 1TB, which is my C:/) or that the download was corrupted. I decided to try a manual install, but that failed also. Thinking it could be a hard drive issue at this point, I tried downloading something other than drivers to both drives and both installs were successful (and none of my existing files were unable to be accessed, which with my experience means it mostly isn't a hard drive problem). At this point during my research, I realized that Chrome had been giving a lot of "Aw snap" pages during my browsing, which I had ignored as Chrome being Chrome at first, but started to think that may be an indicator of the true issue. It was around this time, while I was browsing, I had my first reboot. No errors, nothing, just a black screen reboot. Loaded back in, started digging some more and this time, pretty quickly afterwards, I had a BSOD with no error code to dig into.
With the quick crash, I began to suspect overheating so I gave it a break and booted back in later, but after monitoring for a good twenty to thirty minutes, doing a range of non-gaming activities, it never got above 70 degrees Celsius with processing intensive activities, and baselined at around 34 degrees when idle. Overheating was out of the question, to my mind.
I began to dig around Device Manager and realized that my GPU had an error flag next to it. I uninstalled the drivers and GeForce Experience and reinstalled them (successfully this time) and the error cleared up, so I tried booting back into Diablo 4, which promptly crashed again, but not before I noticed a strange warbling effect on the right side of the screen - with no error log. I decided to boot up an EA game to see if maybe it'd have a different error message which, finally, it did - crashing mere moments after I loaded in - The message read something along the lines of "Crash reason GPUDeviceRemoved." Must be the GPU, right? But I realized that, for a good portion of the crashes, the GPU being used were the onboard graphics, not the Nvidia card, due to the errors I fixed with the fresh driver installs. So I'm blanking. It could be RAM, a good majority of the issues with the crashes, reboots, etc seem like RAM issues, but then the GPU is acting squirrely too. It could be the entire mobo, with how widespread the issues are, too. The final things I tested were Cinebench, which ran for a good while before crashing, and the Windows Mem Diagnostic Tool which didn't finish both times I ran it.
Currently, my gaming PC has been relegated to a role of glorified Netflix machine and even that crashes from time to time (not to desktop, the Chrome page crashes and I have to refresh to make it contine)
Side Note: A week before these issues popped up, I cleaned my PC by blowing the dust out. To make sure I didn't somehow wiggle or blow something lose, I reseated my components and tested things again to no avail.
TL;DR
-Chrome tabs throwing errors after being open for a bit (YouTube usually throws "Aw Snap" error on initial load too)
-Intensive Games crash almost as soon as I get beyond the opening menus (one game gave a "GPUDeviceRemoved error")
-Random Reboots (When the GPU was active and inactive, due to an error in device manager that forced the onboard graphics to become active)
-BSODs without error codes
-Cinebench not finishing its tests
-Windows Mem Diagnostic Tool not finishing its tests before crashes
-The system is not overheating, according to several separate monitors I ran
-Couldn't update Nvidia Drivers
If I think of anything else, or if anyone has more questions, I'll add to this. I just am trying to get to the root of the problem. Any ideas on what to troubleshoot, what the issue sounds like, etc? Like I said, I'm stumped...
AMD Ryzen 9 7950X
ASUS ROG Strix X670E-F Gaming Wifi
2 sticks of Kingston Technology Fury Beast 16GB 6000MT/s DDR5 CL36 AMD Expo RAM
ASUS GeForce RTX Nvidia 4070 Ti TUF Gaming
Corsair RMX Series RM1000x, 1000W Gold Power Supply
Samsung 980 Pro SSD 1TB PCIe NVMe Gen 4 M.2 for OS and odds and ends storage
Samsung 980 Pro SSD 2TB PCIe NVMe Gen 4 M.2 for programs and multimedia storage
I'm going to be as thorough as possible with my troubleshooting; if you'd prefer to avoid the long read, a summary will be provided at the end.
Ran fine for 4 months, then suddenly, three days ago, every time I loaded into Diablo 4 it would crash to desktop without a reason. Figured it was probably a Diablo thing, so loaded up another game. Played for around 6 minutes, boom, crash to desktop. Assumed a GPU driver issue, so I booted up Nvidia's GeForce Experience and tried to update drivers, but the download continued to fail, saying that either I had no hard drive space (I have 600GB left on the 1TB, which is my C:/) or that the download was corrupted. I decided to try a manual install, but that failed also. Thinking it could be a hard drive issue at this point, I tried downloading something other than drivers to both drives and both installs were successful (and none of my existing files were unable to be accessed, which with my experience means it mostly isn't a hard drive problem). At this point during my research, I realized that Chrome had been giving a lot of "Aw snap" pages during my browsing, which I had ignored as Chrome being Chrome at first, but started to think that may be an indicator of the true issue. It was around this time, while I was browsing, I had my first reboot. No errors, nothing, just a black screen reboot. Loaded back in, started digging some more and this time, pretty quickly afterwards, I had a BSOD with no error code to dig into.
With the quick crash, I began to suspect overheating so I gave it a break and booted back in later, but after monitoring for a good twenty to thirty minutes, doing a range of non-gaming activities, it never got above 70 degrees Celsius with processing intensive activities, and baselined at around 34 degrees when idle. Overheating was out of the question, to my mind.
I began to dig around Device Manager and realized that my GPU had an error flag next to it. I uninstalled the drivers and GeForce Experience and reinstalled them (successfully this time) and the error cleared up, so I tried booting back into Diablo 4, which promptly crashed again, but not before I noticed a strange warbling effect on the right side of the screen - with no error log. I decided to boot up an EA game to see if maybe it'd have a different error message which, finally, it did - crashing mere moments after I loaded in - The message read something along the lines of "Crash reason GPUDeviceRemoved." Must be the GPU, right? But I realized that, for a good portion of the crashes, the GPU being used were the onboard graphics, not the Nvidia card, due to the errors I fixed with the fresh driver installs. So I'm blanking. It could be RAM, a good majority of the issues with the crashes, reboots, etc seem like RAM issues, but then the GPU is acting squirrely too. It could be the entire mobo, with how widespread the issues are, too. The final things I tested were Cinebench, which ran for a good while before crashing, and the Windows Mem Diagnostic Tool which didn't finish both times I ran it.
Currently, my gaming PC has been relegated to a role of glorified Netflix machine and even that crashes from time to time (not to desktop, the Chrome page crashes and I have to refresh to make it contine)
Side Note: A week before these issues popped up, I cleaned my PC by blowing the dust out. To make sure I didn't somehow wiggle or blow something lose, I reseated my components and tested things again to no avail.
TL;DR
-Chrome tabs throwing errors after being open for a bit (YouTube usually throws "Aw Snap" error on initial load too)
-Intensive Games crash almost as soon as I get beyond the opening menus (one game gave a "GPUDeviceRemoved error")
-Random Reboots (When the GPU was active and inactive, due to an error in device manager that forced the onboard graphics to become active)
-BSODs without error codes
-Cinebench not finishing its tests
-Windows Mem Diagnostic Tool not finishing its tests before crashes
-The system is not overheating, according to several separate monitors I ran
-Couldn't update Nvidia Drivers
If I think of anything else, or if anyone has more questions, I'll add to this. I just am trying to get to the root of the problem. Any ideas on what to troubleshoot, what the issue sounds like, etc? Like I said, I'm stumped...