Hi,
I'm building a Windows Server 2012 R2 system using some existing server parts that we have available. I started with one S5500BC board and added the required hardware, which is an LSI 9260-8i card (this card was already in a number of old servers we have with the same board and worked fine) and I've added an Intel I350-T4 network card and a 2 port USB 3 card. I have done the exact same upgrades / additions to 2 other servers based on the same S5500BC boards and Server 2012 R2 and neither of those other 2 servers have any issues.
So with this build, the server lost power 2 nights in a row, windows just logging the standard 'kernel power' error, which is as if the plug was pulled (but it wasn't). After the second night of this I installed the Intel ASC and it told me there was an uncorrectable ECC issue in dim slot B1. I powered down the server and moved the RAM about and it still reported the same error in the same slot. I did this a couple of times to be sure and it always reported DIMM slot B1 as the problem, regardless of what stick was in it. I took this to mean that the motherboard itself was faulty so I replaced the whole board with another S5500BC we have spare. This is a working board that also (like the first one) has never given any reason to believe there are any problems with it.
After putting the second board in, I ran the ASC again and the same B1 error was present. I thought the may be the ASC calling up the old logs so I uninstalled and reinstalled it, and the memory error was gone. I checked it after around 12 hours of the server running and this remained the same. However a couple of nights ago the server again 'lost power' or just shut down ungracefully. I check the ASC again, and the same B1 error is back! On a completely different board.
What could be going on here? I quite urgently need to get this server stable. I have tried to update the firmwares using the S5500BC_BIOS63_BMC61_FRUSDR22_ME112 package, I ran the one boot windows flash utility (v9.7 build 21) pointing to the extracted location using the following command:
flashupdt -u C:\TempPath
But I get the following:
Update file configuration: XXX S5500BC,1.0
*ERROR* BMC responded with incompatible values
Could anyone please help? The only thing that is different about this server to the other 2 that are stable is that this one has 8 drives instead of 6. But it had 8 drives anyway during it's previous installation and there were no issues... ??
Thanks.