• 关于dell x86架构服务器报错:EDAC MC1: CE row 0, channel 0, label "CPU_SrcID#1_Channel#1_DIMM#0


    1、查看messages,dmesg发现有许多关于EMC的报错,如下:

    EDAC MC1: CE row 1, channel 0, label "CPU_SrcID#1_Channel#1_DIMM#1": 6317 Unknown error(s): memory read on FATAL area OVERFLOW: cpu=1 Err=0001:0092 (ch=2), addr = 0x7fc3afe40 => socket=1, Channel=1(mask=2), rank=4
    
    EDAC MC1: CE row 2, channel 0, label "CPU_SrcID#1_Channel#2_DIMM#0": 5793 Unknown error(s): memory read on FATAL area OVERFLOW: cpu=1 Err=0001:0092 (ch=2), addr = 0x546da78c0 => socket=1, Channel=2(mask=4), rank=0
    
    EDAC MC1: CE row 3, channel 0, label "CPU_SrcID#1_Channel#2_DIMM#1": 5017 Unknown error(s): memory read on FATAL area OVERFLOW: cpu=1 Err=0001:0092 (ch=2), addr = 0x696e5cbc0 => socket=1, Channel=2(mask=4), rank=5
    
    EDAC MC1: CE row 1, channel 0, label "CPU_SrcID#1_Channel#1_DIMM#1": 3525 Unknown error(s): memory read on FATAL area OVERFLOW: cpu=1 Err=0001:0092 (ch=2), addr = 0x74e70c240 => socket=1, Channel=1(mask=2), rank=4

    2、找出错误的DIMM,如下分别是cpu0,cpu1上8根内存条报错,count不为0表示有错误

    mc代表第几个cpu,csrow内存通道,ch第几个内存

    [root@localhost ~]#  grep "[0-9]" /sys/devices/system/edac/mc/mc*/csrow*/ch*_ce_count
    /sys/devices/system/edac/mc/mc0/csrow0/ch0_ce_count:0
    /sys/devices/system/edac/mc/mc0/csrow1/ch0_ce_count:0
    /sys/devices/system/edac/mc/mc0/csrow2/ch0_ce_count:0
    /sys/devices/system/edac/mc/mc0/csrow3/ch0_ce_count:0
    /sys/devices/system/edac/mc/mc1/csrow0/ch0_ce_count:21248125
    /sys/devices/system/edac/mc/mc1/csrow1/ch0_ce_count:11360507
    /sys/devices/system/edac/mc/mc1/csrow2/ch0_ce_count:18691380
    /sys/devices/system/edac/mc/mc1/csrow3/ch0_ce_count:9044537

     dmidecode -t memory 可查看内存详细信息

  • 相关阅读:
    原型1
    可参考的gulp资源
    手机端rem自适应布局实例
    页面变灰效果
    图片上传
    angular学习笔记
    远程页面调试
    drag
    真的了解JS么?
    发现意外之美
  • 原文地址:https://www.cnblogs.com/ad-note/p/12697123.html
Copyright © 2020-2023  润新知