Random Kernel panics, restarts, and now fans out of control

Discussion in 'Mac Pro' started by msintros, Sep 11, 2014.

  1. msintros macrumors member

    Joined:
    Jul 7, 2014
    #1
    A few days ago I started getting random kernel panics and restarts. I tried opening up the computer, spraying a bit of compressed air to clean things out, removed and reseated every hard drive and did the same for the memory. Upon turning my computer back on, the fans are totally out of control. They were always inexplicably over-revved despite normal temperature readings (see my other thread: http://forums.macrumors.com/showthread.php?t=1751951&highlight=need+noise+help), a problem I was never able to solve, but now they sound like they're working at maximum rpm at idle.

    The error report looks like this:

    Interval Since Last Panic Report: 18995745 sec
    Panics Since Last Report: 4
    Anonymous UUID: CB26DA1C-001B-4318-8CBD-7F7852994348

    Mon Sep 8 18:32:49 2014
    Machine-check capabilities (cpu 6) 0x0000000000000806:
    family: 6 model: 15 stepping: 7 microcode: 102
    Intel(R) Xeon(R) CPU X5365 @ 3.00GHz
    6 error-reporting banks
    threshold-based error status present
    Machine-check status 0x0000000000000005:
    restart IP valid
    machine-check in progress
    MCA error-reporting registers:
    IA32_MC0_STATUS(0x401): 0x1000000020000000 invalid
    IA32_MC1_STATUS(0x405): 0x0000000000000000 invalid
    IA32_MC2_STATUS(0x409): 0x0000000000000000 invalid
    IA32_MC3_STATUS(0x40d): 0x0020000000000000 invalid
    IA32_MC4_STATUS(0x411): 0x0000000000000011 invalid
    IA32_MC5_STATUS(0x415): 0xb200000084200e0f valid
    MCA error code: 0x0e0f
    Model specific error code: 0x8420
    Other information: 0x00000000
    Threshold-based status: Undefined
    Status bits:
    Processor context corrupt
    Error enabled
    Uncorrected error
    panic(cpu 7 caller 0x2cf3cc): "Machine Check at 0x002d3432, trapno:0x12, err:0x0," "registers:\n" "CR0: 0x80010033, CR2: 0x01bf0000, CR3: 0x00100000, CR4: 0x00000660\n" "EAX: 0x00000000, EBX: 0x00008b37, ECX: 0x7d9b7000, EDX: 0x00008b01\n" "ESP: 0x8c9a3c58, EBP: 0x8c9a3c58, ESI: 0x00000004, EDI: 0x51d3dbe5\n" "EFL: 0x00000002, EIP: 0x002d3432\n"@/SourceCache/xnu/xnu-1699.32.7/osfmk/i386/trap_native.c:258
    Backtrace (CPU 7), Frame : Return Address (4 potential args on stack)
    0x83b5df38 : 0x2203de (0x6b08cc 0x83b5df58 0x229fb0 0x0)
    0x83b5df68 : 0x2cf3cc (0x6bde94 0x6bdfc8 0x2d3432 0x12)
    0x83b5e0d8 : 0x2cf409 (0x83b5e110 0xde 0x6bdfc8 0x1)
    0x83b5e0f8 : 0x2e6182 (0x83b5e110 0x0 0x0 0x0)
    0x8c9a3c58 : 0x2d3831 (0x14 0x7fffffff 0x8c9a3c88 0x23191be)
    0x8c9a3c68 : 0x23191be (0x83a76100 0x2c4b7118 0x8c9a3cc8 0x153b1800)
    0x8c9a3c88 : 0x231928b (0x83a76100 0xffffffff 0x7fffffff 0x51d3f59e)
    0x8c9a3cb8 : 0x231164f (0x83a76100 0x51d3f59e 0x8b37 0x8eb5c30)
    0x8c9a3d58 : 0x23129fe (0x51d3f59e 0x8b37 0x3 0x0)
    0x8c9a3de8 : 0x2d0cde (0xffffffff 0x7fffffff 0x8c9a3e08 0x2cd4ca)
    0x8c9a3e08 : 0x22d3e5 (0x83b83000 0x51d3b7f5 0x8b37 0x83b83de0)
    0x8c9a3e48 : 0x22e1c0 (0x16a727f0 0x83b83000 0x0 0x0)
    0x8c9a3e98 : 0x22f6c2 (0x16a7280c 0x237fb680 0x8b8b 0x0)
    0x8c9a3ef8 : 0x22f821 (0x2150f0 0x0 0x0 0x16a727f0)
    0x8c9a3f18 : 0x21505d (0x2150f0 0x113 0x60 0x57e40)
    0x8c9a3f48 : 0x21b385 (0x26d739e0 0x113 0x60 0x57e40)
    0x8c9a3f98 : 0x2b7bb7 (0x15daa214 0x1 0x15daa244 0x8)
    0x8c9a3fc8 : 0x2e60c7 (0x15daa210 0x0 0x10 0x162ef350)
    Kernel Extensions in backtrace:
    com.apple.driver.AppleIntelCPUPowerManagement(195.0)[D1550426-D346-4805-A777-0663C69080DA]@0x230f000->0x2338fff

    BSD process name corresponding to current thread: mdworker

    Mac OS version:
    11G63

    Kernel version:
    Darwin Kernel Version 11.4.2: Thu Aug 23 16:26:45 PDT 2012; root:xnu-1699.32.7~1/RELEASE_I386
    Kernel UUID: 859B45FB-14BB-35ED-B823-08393C63E13B
    System model name: MacPro2,1 (Mac-F4208DA9)

    System uptime in nanoseconds: 153069712715384
    last loaded kext at 112108308259707: com.apple.driver.iPodSBCDriver 1.6.0 (addr 0xda5000, size 16384)
    last unloaded kext at 115961094295901: com.apple.driver.AppleUSBCDC 4.1.22 (addr 0xd57000, size 16384)
    loaded kexts:
    com.digidesign.iokit.DigiIO 7.4.2f1
    com.digidesign.iokit.DigiDal 7.4.2f1
    com.apple.driver.iPodSBCDriver 1.6.0
    com.apple.filesystems.udf 2.2.1
    com.apple.filesystems.autofs 3.0
    com.apple.driver.AppleUpstreamUserClient 3.5.9
    com.apple.driver.AppleMCCSControl 1.0.33
    com.apple.driver.AppleHDA 2.2.5a5
    com.apple.kext.ATIFramebuffer 7.3.2
    com.apple.driver.AudioAUUC 1.59
    com.apple.ATIRadeonX3000 7.3.2
    com.apple.iokit.IOUserEthernet 1.0.0d1
    com.apple.iokit.IOBluetoothSerialManager 4.0.8f17
    com.apple.Dont_Steal_Mac_OS_X 7.0.0
    com.apple.driver.AppleMCEDriver 1.1.9
    com.apple.driver.AudioIPCDriver 1.2.3
    com.apple.driver.ApplePolicyControl 3.1.33
    com.apple.driver.ACPI_SMC_PlatformPlugin 5.0.0d8
    com.apple.driver.AppleLPC 1.6.0
    com.apple.filesystems.ntfs 3.10.1
    com.apple.driver.CSRUSBBluetoothHCIController 4.0.8f17
    com.apple.iokit.SCSITaskUserClient 3.2.1
    com.apple.AppleFSCompression.AppleFSCompressionTypeDataless 1.0.0d1
    com.apple.AppleFSCompression.AppleFSCompressionTypeZlib 1.0.0d1
    com.apple.BootCache 33
    com.apple.driver.XsanFilter 404
    com.apple.iokit.IOAHCIBlockStorage 2.1.0
    com.apple.driver.AppleFWOHCI 4.9.0
    com.apple.driver.AppleAHCIPort 2.3.1
    com.apple.driver.AppleIntelPIIXATA 2.5.1
    com.apple.driver.AirPortBrcm43224 501.36.15
    com.apple.driver.AppleUSBHub 5.1.0
    com.apple.driver.AppleIntel8254XEthernet 2.1.3b1
    com.apple.driver.AppleEFINVRAM 1.6.1
    com.apple.driver.AppleUSBEHCI 5.1.0
    com.apple.driver.AppleUSBUHCI 5.1.0
    com.apple.driver.AppleACPIButtons 1.5
    com.apple.driver.AppleRTC 1.5
    com.apple.driver.AppleHPET 1.7
    com.apple.driver.AppleSMBIOS 1.9
    com.apple.driver.AppleACPIEC 1.5
    com.apple.driver.AppleAPIC 1.6
    com.apple.driver.AppleIntelCPUPowerManagementClient 195.0.0
    com.apple.nke.applicationfirewall 3.2.30
    com.apple.security.quarantine 1.4
    com.apple.security.TMSafetyNet 8
    com.apple.driver.AppleIntelCPUPowerManagement 195.0.0
    com.apple.driver.AppleUSBHIDKeyboard 160.7
    com.apple.driver.AppleHIDKeyboard 160.7
    com.apple.iokit.IOUSBHIDDriver 5.0.0
    com.apple.kext.triggers 1.0
    com.apple.driver.AppleSMBusController 1.0.10d0
    com.apple.driver.DspFuncLib 2.2.5a5
    com.apple.kext.ATI5000Controller 7.3.2
    com.apple.kext.ATISupport 7.3.2
    com.apple.iokit.IOSurface 80.0.2
    com.apple.iokit.IOSerialFamily 10.0.5
    com.apple.iokit.IOFireWireIP 2.2.5
    com.apple.driver.AppleHDAController 2.2.5a5
    com.apple.iokit.IOHDAFamily 2.2.5a5
    com.apple.iokit.IOAudioFamily 1.8.6fc18
    com.apple.kext.OSvKernDSPLib 1.3
    com.apple.driver.AppleGraphicsControl 3.1.33
    com.apple.iokit.IONDRVSupport 2.3.4
    com.apple.iokit.IOGraphicsFamily 2.3.4
    com.apple.driver.AppleSMC 3.1.3d10
    com.apple.driver.IOPlatformPluginLegacy 5.0.0d8
    com.apple.driver.IOPlatformPluginFamily 5.1.1d6
    com.apple.driver.AppleUSBBluetoothHCIController 4.0.8f17
    com.apple.iokit.IOBluetoothFamily 4.0.8f17
    com.apple.driver.AppleUSBMergeNub 5.1.0
    com.apple.iokit.IOSCSIBlockCommandsDevice 3.2.1
    com.apple.iokit.IOUSBMassStorageClass 3.0.3
    com.apple.driver.AppleUSBComposite 5.0.0
    com.apple.iokit.IOSCSIMultimediaCommandsDevice 3.2.1
    com.apple.iokit.IOBDStorageFamily 1.7
    com.apple.iokit.IODVDStorageFamily 1.7.1
    com.apple.iokit.IOCDStorageFamily 1.7.1
    com.apple.iokit.IOATAPIProtocolTransport 3.0.0
    com.apple.iokit.IOSCSIArchitectureModelFamily 3.2.1
    com.apple.iokit.IOFireWireFamily 4.4.8
    com.apple.iokit.IOAHCIFamily 2.0.8
    com.apple.iokit.IOATAFamily 2.5.1
    com.apple.iokit.IO80211Family 420.3
    com.apple.iokit.IONetworkingFamily 2.1
    com.apple.iokit.IOUSBUserClient 5.0.0
    com.apple.iokit.IOUSBFamily 5.1.0
    com.apple.driver.AppleEFIRuntime 1.6.1
    com.apple.iokit.IOHIDFamily 1.7.1
    com.apple.iokit.IOSMBusFamily 1.1
    com.apple.security.sandbox 177.11
    com.apple.kext.AppleMatch 1.0.0d1
    com.apple.driver.DiskImages 331.7
    com.apple.iokit.IOStorageFamily 1.7.2
    com.apple.driver.AppleKeyStore 28.18
    com.apple.driver.AppleACPIPlatform 1.5
    com.apple.iokit.IOPCIFamily 2.7
    com.apple.iokit.IOACPIFamily 1.4
    Model: MacPro2,1, BootROM MP21.007F.B06, 8 processors, Quad-Core Intel Xeon, 3 GHz, 16 GB, SMC 1.15f3
    Graphics: ATI Radeon HD 5770, ATI Radeon HD 5770, PCIe, 1024 MB
    Memory Module: DIMM Riser A/DIMM 1, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    Memory Module: DIMM Riser A/DIMM 2, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    Memory Module: DIMM Riser B/DIMM 1, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    Memory Module: DIMM Riser B/DIMM 2, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    Memory Module: DIMM Riser A/DIMM 3, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    Memory Module: DIMM Riser A/DIMM 4, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    Memory Module: DIMM Riser B/DIMM 3, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    Memory Module: DIMM Riser B/DIMM 4, 2 GB, DDR2 FB-DIMM, 667 MHz, 0x80CE, 0x4D3339355435373530455A342D4345363620
    AirPort: spairport_wireless_card_type_airport_extreme (0x14E4, 0x87), Broadcom BCM43xx 1.0 (5.10.131.36.15)
    Bluetooth: Version 4.0.8f17, 2 service, 18 devices, 1 incoming serial ports
    Network Service: Ethernet 2, Ethernet, en1
    PCI Card: ATI Radeon HD 5770, sppci_displaycontroller, Slot-1
    PCI Card: ATI Radeon HD 5770, ATY,HoolockParent, Slot-1
    Serial ATA Device: ST3250820AS P, 250.06 GB
    Serial ATA Device: Hitachi HDS721010KLA330, 1 TB
    Serial ATA Device: ST3400620AS, 400.09 GB
    Serial ATA Device: WDC WD5000AACS-00ZUB0, 500.11 GB
    Parallel ATA Device: PIONEER DVD-RW DVR-118L
    Parallel ATA Device: PIONEER DVD-RW DVR-118L
    USB Device: External HDD, 0x1058 (Western Digital Technologies, Inc.), 0x1003, 0xfd400000 / 3
    USB Device: USB2.0 Hub, 0x05e3 (Genesys Logic, Inc.), 0x0608, 0xfd200000 / 2
    USB Device: eLicenser, 0x0819, 0x0101, 0xfd240000 / 6
    USB Device: Keyboard Hub, apple_vendor_id, 0x1006, 0xfd210000 / 5
    USB Device: Apple Keyboard, apple_vendor_id, 0x0220, 0xfd212000 / 8
    USB Device: USB2.0 Hub, 0x05e3 (Genesys Logic, Inc.), 0x0608, 0xfd230000 / 4
    USB Device: USB OPTICAL MOUSE, 0x093a (Pixart Imaging, Inc.), 0x2510, 0xfd231000 / 7
    USB Device: Bluetooth USB Host Controller, apple_vendor_id, 0x8206, 0x5d200000 / 2
    USB Device: composite_device, 0x0763 (M-Audio), 0x1002, 0x3d100000 / 2
    FireWire Device: built-in_hub, 800mbit_speed


    Other searches on Google make it look like this is probably a RAM issue, but why the fans acting up now? Anyone have any suggested courses of action?
     
  2. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #2
    Ok, one thing I did was shut down and do a power reset (or whatever it's called - I unplugged the power cord and held down the power button for about 20 seconds before plugging it back in). When I restarted, my fans are the quietest they've ever been without manual fan control intervention (around 500 rpm, which is about what one would hope for, whereas before they were closer to 1500 rpm even at idle).

    In Mac Fan Control, though, I can see my temperature readings. http://i.imgur.com/OVPzLez.pngThe only ones that really concern me are my DIMM modules. Is that really high? It looks really high to me. 128 Celcius is higher than the boiling point of water. Do I have a memory problem here? Is this the cause of my kernel panics?
     
  3. AxoNeuron, Sep 11, 2014
    Last edited: Sep 11, 2014

    AxoNeuron macrumors 65816

    AxoNeuron

    Joined:
    Apr 22, 2012
    Location:
    The Left Coast
    #3
    Is this a Mac Pro? ECC RAM? Yes based solely from your diagnostic report it does look like a RAM issue.

    Edit: have you been playing around with kexts? Specifically, this one: com.apple.driver.AppleIntelCPUPowerManagement
     
  4. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #4
    I had to look up what a kext is, so nope ;)

    Not sure what kind of RAM, but yeah, it is a Mac Pro 2.1

    Do those RAM temp readings look high to you? I don't know what is normal, but they look so much higher than everything else that I have to wonder.
     
  5. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #5
    I ran a hardware test (holding D at startup with install disc in the drive) and it came back with 1 error:

    4SNS/1/40000000:TM3S

    Any idea what this means?
     
  6. Peace macrumors Core

    Peace

    Joined:
    Apr 1, 2005
    Location:
    Space--The ONLY Frontier
    #6
    Are you sure it says 128 Celsius ? If so that is way too high and should be removed asap.


    My guess is the memory is part of your problem but the controller should have shut the system down if that RAM temp. was that high.
     
  7. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #7
    Click the picture link and see the screenshot for yourself. It seems almost implausibly high, but that's what it says, unless Mac Fan Control is not reading correctly from the sensor.

    By the way, after the most recent crash, the fan speeds are back to their jet-engine speed like before (almost 2900 rpm for CPU_MEM and EXHAUST - the other two are normal).
     
  8. Peace macrumors Core

    Peace

    Joined:
    Apr 1, 2005
    Location:
    Space--The ONLY Frontier
    #8
    Might want to read this :

    https://discussions.apple.com/message/20286218#20286218


    They seem to think it's a sensor problem. and in your case with the memory running so hot it could be the controller.

    Try resetting the SMC / PRAM.

    If that doesn't help shut it down because it's probably a bad sensor connection or the memory controller.
     
  9. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #9
    I just shut down my system and opened the side panel. There is a red light lit on one of the two memory boards. What does this mean?

    I let the chips cool (they were warm but definitely didn't feel hot enough to boil water) and put them back in (I think I reversed the slots they were in).

    On restart, I re-opened Mac Fan Control and the temps read in the mid to upper 50s and are slowly climbing to 58 and 63 degrees Celsius respectively. This seems more plausible.

    But the light on the same memory board is lit.
     
  10. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #10
    Did a PRAM reset and SMC reset. Not sure if the PRAM reset had any effect at all, but the SMC reset (which I did a little earlier) reset the fan speed back to normal again, but it looks like that is causing the RAM temperature to creep up again very quickly. I just restarted and it's already crossed 80 degrees and climbing. The light is still lit on one of the memory boards as well.

    Should I remove the memory chip that is associated with the lit light, or is that likely to have no effect? I don't really know what I should do here.
     
  11. MacVidCards Suspended

    Joined:
    Nov 17, 2008
    Location:
    Hollywood, CA
    #11
    Don't be afraid, you'll never get a real answer that way.

    All we can do is guess, you can find the answer.
     
  12. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #12
    It would help to know how hot the RAM should be getting. I can't really seem to find information about this.

    I have Mac Fan Control turned on and both the CPU and Exhaust fans are set to run according to how hot the DIMM modules are. I have them set to start working at 40 Celcius, but it seems even at idle, they quickly get to over 60 Celcius with the fans at full blast. If I let the fans idle at around 500 rpm, the RAM relatively quickly climbs in temperature. The light on DIMM 2 on the bottom card is also still lit.

    I haven't had any crashes or panics since setting the fan to be DIMM dependent, but the fans are also running at almost maximum speed, which is annoying and because it's so loud.
     
  13. h9826790 macrumors G3

    h9826790

    Joined:
    Apr 3, 2014
    Location:
    Hong Kong
    #13
    My ambient temperature is 37C, fan at 600RPM, RAM temperature between 43-48C stable.
     
  14. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #14
    Any theories on what would cause RAM to specifically overheat?
     
  15. 666sheep macrumors 68040

    666sheep

    Joined:
    Dec 7, 2009
    Location:
    Poland
    #15
    DDR2 FB DIMMs with Apple heatsinks usually run about 60 C. These with regular, flat ones about 80 C. AMB is rated to a bit over 120 C (I don't remember the exact value now).
    Anything up to ~80 C is safe for such type of RAM.

    My 2,1 has exhaust fan set to 1100 because I use flat heatsink RAM and I have 8x 4GB which produce good amount of heat together. Works like this since few years without problems.
     
  16. msintros thread starter macrumors member

    Joined:
    Jul 7, 2014
    #16
    Interesting, thanks.

    I do think these are supposed to get hot because the heat sink or whatever the metal sheath is says "hot surface" on it, which to me indicates that it is normal for it to get kinda hot, but most people seem to be telling me that RAM should idle at like 45 C.

    I have 8 x 2GB sticks. The main thing concerning me is that they weren't always this hot. I have a screenshot from back when my other thread about fan noise was created (if you follow the link in the top and look down a few posts you'll see it) where it showed that my RAM was idling at closer to what other people say is normal (in the 40-50 C range) with my fans at around 1500rpm. Why they now seem to require the fans to be on max (around 2800-2900 rpm) to maintain an idle temperature of under 70 C is concerning to me.

    I've also noticed my processor core readings also appear somewhat erratic. Even at idle sometimes it gets up to 66 C (for both CPU A and B), but then other times it shows it is closer to 40 C.

    Last night I tried taking out one of each of the two memory cards and starting it up. If I only had a card in the bottom slot, Mac Fan Control wouldn't even tell me what the temperatures of the DIMM modules was. With it in the top slot, it read both DIMM module temperatures, which were still really high. After putting both back in, the aforementioned DIMM light was now off. However on one restart the power light started flashing at me and the computer wouldn't boot. Disconnecting all the USB peripherals allowed it to boot. The light remains off and things appear to be running OK, except for the fact that my fans still have to be really high for the RAM to maintain a temperature under 70 C.
     

Share This Page