Quantcast
Channel: Intel Communities : All Content - Servers
Viewing all articles
Browse latest Browse all 3917

2 S5500BC Server Boards Both Producing DIMM B1 Uncorrectable ECC Errors

$
0
0

Hi,

 

I'm building a Windows Server 2012 R2 system using some existing server parts that we have available. I started with one S5500BC board and added the required hardware, which is an LSI 9260-8i card (this card was already in a number of old servers we have with the same board and worked fine) and I've added an Intel I350-T4 network card and a 2 port USB 3 card. I have done the exact same upgrades / additions to 2 other servers based on the same S5500BC boards and Server 2012 R2 and neither of those other 2 servers have any issues.

 

So with this build, the server lost power 2 nights in a row, windows just logging the standard 'kernel power' error, which is as if the plug was pulled (but it wasn't). After the second night of this I installed the Intel ASC and it told me there was an uncorrectable ECC issue in dim slot B1. I powered down the server and moved the RAM about and it still reported the same error in the same slot. I did this a couple of times to be sure and it always reported DIMM slot B1 as the problem, regardless of what stick was in it. I took this to mean that the motherboard itself was faulty so I replaced the whole board with another S5500BC we have spare. This is a working board that also (like the first one) has never given any reason to believe there are any problems with it.

 

After putting the second board in, I ran the ASC again and the same B1 error was present. I thought the may be the ASC calling up the old logs so I uninstalled and reinstalled it, and the memory error was gone. I checked it after around 12 hours of the server running and this remained the same. However a couple of nights ago the server again 'lost power' or just shut down ungracefully. I check the ASC again, and the same B1 error is back! On a completely different board.

 

What could be going on here? I quite urgently need to get this server stable. I have tried to update the firmwares using the S5500BC_BIOS63_BMC61_FRUSDR22_ME112 package, I ran the one boot windows flash utility (v9.7 build 21) pointing to the extracted location using the following command:

 

flashupdt -u C:\TempPath

 

But I get the following:

 

Update file configuration: XXX S5500BC,1.0

*ERROR* BMC responded with incompatible values

 

Could anyone please help? The only thing that is different about this server to the other 2 that are stable is that this one has 8 drives instead of 6. But it had 8 drives anyway during it's previous installation and there were no issues... ??

 

Thanks.


Viewing all articles
Browse latest Browse all 3917

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>