I have problems with my newly built work PC. I didn't build it myself; I only provided what hardware I wanted, and it arrived already built. I also have no idea if the problem is hardware related, or if sth needs to be updated or sth else. Here is everything that I managed to collect about the problem:
My PC randomly crashes while running algorithms for MRI image reconstruction or segmentation inside Docker containers. The Docker logs contain no errors. I've limited the amount of CPUs to 9, memory to 32GB and 2.5GB for swap.
The crashes appear to be completely random – sometimes it crashes within a few minutes, other times the same workload runs for several hours without any issues.
I've extensively checked the logs using the journalctl command, but I can't find anything that could be connected to the crash. There are also no new logs in /var/crash after the crash occurs.
When I run the same programs and data on my laptop (which has significantly weaker hardware), the laptop does not crash. I ran a stress test for the CPU last night, and it passed perfectly.
PC Build Specs:
My PC randomly crashes while running algorithms for MRI image reconstruction or segmentation inside Docker containers. The Docker logs contain no errors. I've limited the amount of CPUs to 9, memory to 32GB and 2.5GB for swap.
The crashes appear to be completely random – sometimes it crashes within a few minutes, other times the same workload runs for several hours without any issues.
I've extensively checked the logs using the journalctl command, but I can't find anything that could be connected to the crash. There are also no new logs in /var/crash after the crash occurs.
When I run the same programs and data on my laptop (which has significantly weaker hardware), the laptop does not crash. I ran a stress test for the CPU last night, and it passed perfectly.
PC Build Specs:
- CPU: Intel Core i7-14700
- Motherboard: B660 (DDR4 support)
- RAM: 64 GB (2×32 GB) DDR4-3200
- GPU: NVIDIA RTX 4070 12 GB
- Storage: 4 TB NVMe SSD
- PSU: 1500W
- OS: Ubuntu 20.04.2