14

I am running 64 bit Ubuntu 11.10 on an i7 with 8gigs of ram. I thought of putting this on askubuntu.com but decided that maybe the question has a much broader appeal.

I have the following error message popping up when I run math simulations.

CPUn: Core temperature above threshold, cpu clock throttled (total events = xxxxxxx)

CPUn: Package temperature above threshold, cpu clock throttled (total events = xxxxxxx)

I realize that this is a hardware warning message (machine check exception, correct me if I am wrong). How do I turn these messages off? Since it doesn't seem to have a detrimental effect of my calculations or my computer (presumably), I don't like it cluttering up my virtual console screen with hundreds of these messages.

dearN
  • 211
  • 1
  • 2
  • 7

5 Answers5

14

You said:

I had cpu cooling (fan) but it went kaput :(

This is why you are having this problem. Your CPU is overheating; these messages are warning you that the system is throttling down in order to prevent permanent physical damage to the system.

I generally expect a Core i7 desktop CPU running under full load to be somewhere around 75-80 degrees Celsius. At around 95-100, thermal throttling kicks in. If it gets significantly above that, the computer will power itself off without warning in an attempt to prevent permanent damage.

The proper thing to do is to replace the CPU fan. Replacement OEM fan/heatsink combos can be had on eBay for less than $20; or you may want to invest a little more cash in something better if you intend to overclock, or run lots of these computations. A better fan may run quieter for the same CPU temperatures, or may cool the CPU more effectively allowing for overclocking, or both.

Ignoring this message, you will eventually destroy your CPU or lose significant amounts of performance or both.

4

I have no knowledge about possibility for turning off such warnings, but I'd like to point out, that actually - your kernel needs to lower your CPU frequency because of constant overheating. It's not a warning about high temperature 10 degrees before the yellow light would light on. I'd recommend you to try to get better cpu/mainboard cooling - otherwise your hardware can get seriously damaged in not so long period of time.
Although it's not a way to switch off the warnings completely: you can pipe the output of the program to a file and later use less or watch with cat/tail to see the newest results without the warnings.

mrówa
  • 191
4

In order to favorably remove the warning clutter IS to address the hardware component that has failed (the fan) or its related components (heat sink clogged, vents blocked), just like if your SMART data for your hard drive was warning you of bad blocks. Yes you can keep usung the drive but the possability of imminent failure is being given to you to try and avoid catastrophic loss of data. Thus back up and replacment would be the wise advice. So in short you need to address the kaput fan. You are being told that the system is throttled(slowing down, in an attempt to stay running and reducing heat). The warning is that the system has acheived or nearing is max temp threshold and that safegaurd procedures may take over before permanent failure occurs. Or Permanent failure will occur.

I can represent this with a little hummor :)

if y + f > G = 0 M

y = Bear Grylls (he represent you)

f = fish (your mathamatical load to the CPU)

g = Grizzly (Heat build up)

M = chubby sportsman on wheel (your CPU)

When you

bg

Apply

trout

to

mow

Doing the computations..

chase

with no fan or means to disipate the heat...

grizzly

You will end up with a grizzly with full belly of chubby sportsman and the wheel will stop turning!! You will eventualy have to get a new chubby sportsman to run on the wheel. That can be expensive when a simple fan replacment will fix the problem and allow your CPU to opperate a maximum performance.

Carl B
  • 6,660
1

In the case of such errors You should install the package :

aptitude install lm-sensors

After that You could check sensors temperature with following command :

sensors

If it's too high (around 100%), You should check Your cooling system.

-1

I was facing same issues but solved by

Turning of TPM mode in BIOS.