Computer crashing every ~5 minutes.

QS1993

Member
Hi there, had my computer about a week. It is now crashing every 5 minutes. I've done a bit of investigating and am now a bit stuck.

My computer just completely freezes. It occasionally turns itself off when it does this, but otherwise just the entire computer gets stuck on whatever I was looking at. It then boots completely fine, but after a few minutes will do the same thing again. When I go to eventviewer I get this:

WHEA-LOGGER Event ID 16

A fatal hardware error has occurred.

Component: PCI Express Root Port
Error Source: Advanced Error Reporting (PCI Express)

Primary Bus:Device:Function: 0x0:0x1:0x0
Secondary Bus:Device:Function: 0x0:0x0:0x0
Primary Device Name:DCI\VEN_8086&DEV_A70D&SUBSYS_88821043&REV_01
Secondary Device Name:

(Sorry, that is obviously : D but it turns it into the smiley)
I am leaning towards thinking this is a hardware issue. When I attempt to run the intel diagnostics, my computer crashes every single time. I have removed and reinserted the ram too. I have had my computer for just over a week and this has only just started happening with no changes to anything.

I will attach my specs below too:

CaseHYTE Y40 WHITE MID-TOWER CASE
Processor (CPU)Intel® Core™ i5 14-Core Processor i5-13600K (Up to 5.1GHz) 24MB Cache
MotherboardASUS® TUF GAMING Z790-PLUS WIFI D4 (LGA1700, USB 3.2, PCIe 5.0) - ARGB Ready
Memory (RAM)64GB Corsair VENGEANCE DDR4 3200Mhz (2 x 32GB)
Graphics Card8GB NVIDIA GEFORCE RTX 4060 Ti - HDMI, DP, LHR
1st M.2 SSD Drive2TB SOLIDIGM P41+ GEN 4 M.2 NVMe PCIe SSD (up to 4125MB/sR, 3325MB/sW)
1st Storage Drive2TB Samsung 870 QVO 2.5" SSD, SATA 6Gb/s (up to 560MB/sR | 530MB/sW)
DVD/BLU-RAY DriveNOT REQUIRED
Power SupplyCORSAIR 650W TXm SERIES™ SEMI-MODULAR 80 PLUS® GOLD, ULTRA QUIET
Power Cable1 x 1.5 Metre UK Power Cable (Kettle Lead)
Processor CoolingDeepCool AG400 Performance ARGB CPU Cooler
Thermal PasteARCTIC MX-4 EXTREME THERMAL CONDUCTIVITY COMPOUND
Extra Case Fans3 x 120mm Thermaltake TOUGHFAN 12 Case Fans
Sound CardONBOARD 6 CHANNEL (5.1) HIGH DEF AUDIO (AS STANDARD)
Network CardONBOARD LAN PORT
Wireless Network CardNOT REQUIRED
USB/Thunderbolt OptionsMIN. 2 x USB 3.0 & 2 x USB 2.0 PORTS @ BACK PANEL + MIN. 2 FRONT PORTS
Operating SystemNO OPERATING SYSTEM REQUIRED
Operating System LanguageUnited Kingdom - English Language
Windows Recovery MediaNO RECOVERY MEDIA REQUIRED
Office SoftwareFREE 30 Day Trial of Microsoft 365® (Operating System Required)
Anti-VirusNO ANTI-VIRUS SOFTWARE
BrowserGoogle Chrome™
Surge Protection6 Socket 2m Surge Protector
Cable Management3 x PCS 1.5M Zip Cable Tidy - Professional Cable Management
 

SpyderTracks

We love you Ukraine
If you can upload the following?

 

QS1993

Member
Hi there - can definitely do this, but i've not once had a BSOD. It is all just the entire system crashing and me having to hard reset it.

I have included what you have asked for.

https://we.tl/t-yUzDEAiTBb

Thought i'd include a bit more information. Its probably turned off about 40-50 times in the last 5 hours. It seems stable when i'm in safe mode, but haven't tried for any extended period of time. Sometimes it crashes immediately following startup, sometimes makes it 10-15 minutes.
 
Last edited:

ubuysa

The BSOD Doctor
From your log error the key bit of information is this...
Code:
Primary Device Name: DCI\VEN_8086&DEV_A70D&SUBSYS_88821043&REV_01
These VEN & DEV identifiers uniquely identify the hardware device that failed. Unfortunately none of the databases have that specific device listed! However, the VEN identifier of 8086 indicates it's an Intel device, probably a chipset device. I suggest you download the Intel Driver Support and Assistant and use that to look for driver updates for your Intel devices.

Do please upload the data @SpyderTracks requested.
 

QS1993

Member
Hi there - I did upload the data I think. It’s in the wetransfer link in my previous reply. Let me know if anything is missing or if you’d prefer a different website. Thank you for your help.

I have done that and no driver updates available. An update is that my computer ran memtest86 last night and no errors were found. Still crashing repeatedly in windows though.
 

SpyderTracks

We love you Ukraine
Hi there - I did upload the data I think. It’s in the wetransfer link in my previous reply. Let me know if anything is missing or if you’d prefer a different website. Thank you for your help.

I have done that and no driver updates available. An update is that my computer ran memtest86 last night and no errors were found. Still crashing repeatedly in windows though.
No dump files?
 

QS1993

Member
Sorry - will do when I get back home later. I just assumed that because it wasn’t actually blue screening they wouldn’t exist!
 

ubuysa

The BSOD Doctor
So the device we had the VBEN & DEV identifiers (8086&DEV_A70D) is the Intel(R) PCIe RC 010 G5 - A70D device, from your system info...
Code:
[Conflicts/Sharing]

Resource    Device   
I/O Port 0x00006000-0x00006FFF    Intel(R) PCIe RC 010 G5 - A70D   
I/O Port 0x00006000-0x00006FFF    NVIDIA GeForce RTX 4060 Ti   
       
Memory Address 0x84000000-0x850FFFFF    Intel(R) PCIe RC 010 G5 - A70D   
Memory Address 0x84000000-0x850FFFFF    NVIDIA GeForce RTX 4060 Ti   
       
IRQ 17    High Definition Audio Controller   
IRQ 17    High Definition Audio Controller   
       
Memory Address 0x87100000-0x87103FFF    Standard NVM Express Controller   
Memory Address 0x87100000-0x87103FFF    Intel(R) PCIe RC 060 (x4) G4 - A74D
I have no idea whether the apparent I/O port and memory range conflict is significant or not. PCS would be able to tell you that. The crashes do seem to happen around the same time as the hardware error you noted, and these errors occur several times in the log...
Code:
Log Name:      System
Source:        Microsoft-Windows-WHEA-Logger
Date:          24/09/2023 19:04:44
Event ID:      16
Task Category: None
Level:         Error
Keywords:     
User:          LOCAL SERVICE
Computer:      Hayden-PC
Description:
A fatal hardware error has occurred.

Component: PCI Express Root Port
Error Source: Advanced Error Reporting (PCI Express)

Primary Bus:Device:Function: 0x0:0x1:0x0
Secondary Bus:Device:Function: 0x0:0x0:0x0
Primary Device Name:PCI\VEN_8086&DEV_A70D&SUBSYS_88821043&REV_01
Secondary Device Name:

You ordered it with no OS, did you by any chance activate the testing system that you found on the drive? If so, that was a mistake. We often see problems when users activate that testing system.

BTW. We have relatives visiting for a holiday this week so my reposes may be a bit tardy for this week. Sorry. :cool:
 

Scott

Behold The Ford Mondeo
Moderator
I would suggest a clean install just to rule out any configuration issues.

Have the Chipset drivers handy, along with the Nvidia GPU drivers, prior to the install. I would tend to have the Network drivers handy too, although this should be less important.

Install Windows, Install the Chipset, Install the NVidia drivers, install the network if not already covered then let Windows do all updates, include any option updates available.

If you're still getting issues after that it would suggest hardware. Unusual to find motherboard issues but not completely unheard of. Most PCie issues are towards drivers IME. Used to be RST causing issues not so long ago.
 

QS1993

Member
Hi, thank you both for your replies.

I have done a completely fresh install of windows, installed chipset, and when I go to install the nvidia drivers my computer bluescreens every single time now. My computer had never blue screened before - it was just crashing. It says "WHEA_UNCORRECTABLE_ERROR"

I have attached new of everything you asked for, including minidumps.

https://we.tl/t-GUtp7bFMtO
 

SpyderTracks

We love you Ukraine
Hi, thank you both for your replies.

I have done a completely fresh install of windows, installed chipset, and when I go to install the nvidia drivers my computer bluescreens every single time now. My computer had never blue screened before - it was just crashing. It says "WHEA_UNCORRECTABLE_ERROR"

I have attached new of everything you asked for, including minidumps.

https://we.tl/t-GUtp7bFMtO
That would suggest a GPU failure

Where are you sourcing the nvidia drivers?
 

QS1993

Member
I have tried both using Geforce experience and also just downloading them directly from the Nvidia website.

Edit: At third time of asking, I have managed to get the drivers installed from the Nvidea website. Now it is back to not blue screening again and simply crashing as it was before.

Double edit : Just to confirm, my system is more stable when the display port / hdmi are plugged into integrated graphics, except sometimes it will bluescreen on startup, saying "WHEA_UNCORRECTABLE_ERROR". If I get past startup, it will run fine. It has at times blue screened 3-4 times in a row before actually getting to the desktop. It sometimes then blue screens when I attempt to do things that use the integrated graphics. If the display port / hdmi are plugged into the graphics card, the system will crash every 5 minutes or so (with no blue screen).

I've attached another link that has a second lot of minidumps /event viewers. Thank you as always for your time.

https://we.tl/t-rOm0LdW4xk
 
Last edited:

ubuysa

The BSOD Doctor
It's most likely to be the GPU - assuming you have ALL the necssary drivers installed. Is there anything in Device Manager with a yallow triangle containing a black exclamation amrk next to it?

Of the three dumps, two are a PCIe device failure - the bugcheck code indicates a PCIe device failure, and we can see the Microsoft pci.sys driver called immediately prior to the WHEA error...
Code:
19: kd> knL
 # Child-SP          RetAddr               Call Site
00 ffffb901`b3ef1c48 fffff801`52f23b4b     nt!KeBugCheckEx
01 ffffb901`b3ef1c50 fffff801`4e4a10c0     nt!HalBugCheckSystem+0xeb
02 ffffb901`b3ef1c90 fffff801`530179c6     PSHED!PshedBugCheckSystem+0x10
03 ffffb901`b3ef1cc0 fffff801`53017db5     nt!WheaRecoveryBugCheck+0x56
04 ffffb901`b3ef1cf0 fffff801`547adf49     nt!WheaReportHwError+0x3d5
05 ffffb901`b3ef1dc0 fffff801`547a9bc1     pci!PciRpRcecHandleAerInterrupt+0x2e9
06 ffffb901`b3ef1e20 fffff801`547aa026     pci!ExpressRootPortAerInterruptRoutine+0xa1
07 ffffb901`b3ef1e90 fffff801`547aa0e9     pci!ExpressRootPortInterruptRoutine+0x46
08 ffffb901`b3ef1ef0 fffff801`52d39bd1     pci!ExpressRootPortMessageRoutine+0x9
09 ffffb901`b3ef1f20 fffff801`52d1b7fd     nt!KiInterruptMessageDispatch+0x11
0a ffffb901`b3ef1f50 fffff801`52e3401f     nt!KiCallInterruptServiceRoutine+0x16d
0b ffffb901`b3ef1f90 fffff801`52e342e7     nt!KiInterruptSubDispatch+0x11f
0c fffff502`765f79f0 fffff801`52e36efa     nt!KiInterruptDispatch+0x37
0d fffff502`765f7b80 00000000`00000000     nt!KiIdleLoop+0x5a
The other dump is a graphics card or driver issue, we see the Windows TDR function called to reset the driver and graphics card following a hang (TDR is part of the dxgkrnl.sys Microsoft DirectX driver). The Nvidia graphics driver nvlddmkm.sys is also involved, although it's not seen in this limited stack trace (it is seen in the full stack trace)...
Code:
9: kd> !dpx
Start memory scan  : 0xfffff90dfda4f7a8 ($csp)
End memory scan    : 0xfffff90dfda50000 (Kernel Stack Base)

               rsp : 0xfffff90dfda4f7a8 : 0xfffff8077edadf6e : dxgkrnl!TdrBugcheckOnTimeout+0xfe
0xfffff90dfda4f7a8 : 0xfffff8077edadf6e : dxgkrnl!TdrBugcheckOnTimeout+0xfe
0xfffff90dfda4f7e8 : 0xfffff8077ed5dd42 : dxgkrnl!ADAPTER_RENDER::Reset+0x12a
0xfffff90dfda4f818 : 0xfffff8077ed559ad : dxgkrnl!DXGADAPTER::Reset+0x60d
0xfffff90dfda4f888 : 0xfffff8077edad800 : dxgkrnl!TdrResetFromTimeoutWorkItem
0xfffff90dfda4f8c8 : 0xfffff8077edad6c5 : dxgkrnl!TdrResetFromTimeout+0x15
0xfffff90dfda4f8f8 : 0xfffff8077edad822 : dxgkrnl!TdrResetFromTimeoutWorkItem+0x22
0xfffff90dfda4f918 : 0xfffff80763949ac0 : nt!ExNode0
0xfffff90dfda4f930 : 0xfffff80763949ac0 : nt!ExNode0
0xfffff90dfda4f938 : 0xfffff80762eb8bd5 : nt!ExpWorkerThread+0x155
0xfffff90dfda4f988 : 0xfffff80763949ac0 : nt!ExNode0
0xfffff90dfda4f990 : 0xfffff80763949ac0 : nt!ExNode0
0xfffff90dfda4f9a0 : 0xffffa38e6809f1e0 : 0xffffa38e680e0280 : 0xfffff8076386b4c0 : nt!MiSystemPartition
0xfffff90dfda4faf8 : 0xfffff80762fcb072 : nt!EtwTraceContextSwap+0xb2
0xfffff90dfda4fb28 : 0xfffff80762e12667 : nt!PspSystemThreadStartup+0x57
0xfffff90dfda4fb48 : 0xfffff80762eb8a80 : nt!ExpWorkerThread
0xfffff90dfda4fb78 : 0xfffff807630370a4 : nt!KiStartSystemThread+0x34
0xfffff90dfda4fb90 : 0xfffff80762e12610 : nt!PspSystemThreadStartup
You do appear to have the latest version on nvlddmkm.sys installed...
Code:
9: kd> lmDvm nvlddmkm
Browse full module list
start             end                 module name
fffff807`933d0000 fffff807`96d48000   nvlddmkm T (no symbols)      
    Loaded symbol image file: nvlddmkm.sys
    Image path: \SystemRoot\System32\DriverStore\FileRepository\nv_dispig.inf_amd64_4e58e7ac1d277d04\nvlddmkm.sys
    Image name: nvlddmkm.sys
    Browse all global symbols  functions  data
    Timestamp:        Tue Sep 12 22:56:26 2023 (6500C26A)
    CheckSum:         03870084
    ImageSize:        03978000
    Translations:     0000.04b0 0000.04e4 0409.04b0 0409.04e4
    Information from resource tables:
Since we're seeing both PCIe errors and TDR errors I'd suspect the graphics card itself, or possibly the motherboard or slot. If you're CERTAIN that you have all the correct drivers installed ( because this could be a bad/missing chipset driver issue) then this is an RMA to PCS I'm afraid.
 
Last edited:

QS1993

Member
How can I be certain I’ve got all the chipset drivers installed? I’ve got the intel driver utility and installed everything it asked me to, and done all windows updates. Is that it? Nothing in device manager has a yellow triangle.

Other forums mention this might be an issue with a riser cable/ specifically my mobo and gpu. Do you think that is likely?

Thank you again for all of your help anyway!
 

Scott

Behold The Ford Mondeo
Moderator
If you've used the automated tool it should have everything. It's not the ideal way of doing it, but I think it should suffice and I would never expect this sort of behaviour.

I'm almost convinced that it's a hardware fault. It could be something silly like a dislodged GPU though, from travelling around.

If you are confident in doing so, try removing the GPU entirely. Be sure to note where the power connectors are in etc. There will be a couple of screws at the external end. Once removed, use the motherboard GPU connection to run the system (As I believe you have tried before).

If you still get issues, it would suggest the motherboard to me. If you don't get any issues, carefully re-insert the GPU and plug in the power as it was before. Ensure that the card is in properly and the cables are in nice and snug.

If everything is in properly and seated well and you still get the issues, it's by far most likely the GPU. The only thing I can think of it relating to other than that is the physical PCIe connection on the motherboard. To rule out the connection you could try the next PCIe connection.

The only reason I would want to 100% (or as near as) rule on the GPU is that it's a very simple switch with PCS as they would simply send you out a new one to swap with the current one. The last thing you want is to receive the new one and it's some sort of motherboard hardware fault.
 

QS1993

Member
Temporary update is that I have done what you asked with the connectors - unplugged everything from the motherboard and plugged it all back in. My computer now is running with it connected to the GPU and has not crashed once in the 2 hours since. I will update in a couple of days but that genuinely might have been it!
 

Scott

Behold The Ford Mondeo
Moderator
Fingers crossed. An imperfect seat on the PCIe would cause strange gremlins with temperature flex.
 

Virtue

New member
Temporary update is that I have done what you asked with the connectors - unplugged everything from the motherboard and plugged it all back in. My computer now is running with it connected to the GPU and has not crashed once in the 2 hours since. I will update in a couple of days but that genuinely might have been it!
hi mate, I am having same issue, did you resolve your problem with just unplug/plugin back resolution. Like you say sometimes turning off display keeps playing song, sometimes if I am playing game, it is totally turning off display and sound, and fully powering up fans. When I check the system logs in event viewer, it says fatal error occreed on PCI\VEN_8086&DEV_A70D&SUBSYS_88821043&REV_01. Very rare but sometimes it is working all day. Can you please help me if you fix your problem?
 

Martinr36

MOST VALUED CONTRIBUTOR
hi mate, I am having same issue, did you resolve your problem with just unplug/plugin back resolution. Like you say sometimes turning off display keeps playing song, sometimes if I am playing game, it is totally turning off display and sound, and fully powering up fans. When I check the system logs in event viewer, it says fatal error occreed on PCI\VEN_8086&DEV_A70D&SUBSYS_88821043&REV_01. Very rare but sometimes it is working all day. Can you please help me if you fix your problem?
If you're having problems, I'd advise starting your own post and supplying the relevant info
 
Top