Hardware :: Mcelog - HARDWARE ERROR - Get Deadlock Hangs Once Or Twice A Week
Jan 9, 2011
I have my desktop with following hardware:
1. MB: ASUS P5QL SE/EPU
2. RAM: 2 x 2GB Corsair PC2-8500 (1066 MGhz)
3. CPU: Intel Dual-Core E6500
4. GPU: nVidia GeForce 9400 GT with binary driver
5. Net: D-Link DWA-520 with madwifi driver
This machine is running for more than two years without any problem, but during the last 2 or 3 months I get deadlock hangs once or twice a week. This machine runs Slackware-current 32 bit with custom compiled vanilla kernel with two additional patches - BFS and TuxOnIce. Deadlocks usually occur when there is no user activity - only rtorrent is running and two KDE4 sessions is open (with firefox, okular, claws-mail, goldendict, virtualbox and other memory consuming apps).
Here is an excerpt from my syslog:
Quote:
I just would like to get a clue - what should I replace first - a memory or a CPU? The price is almost the same - around hundred of bucks. I tried to run memtest86 3.5a and got 1983 memory errors, but I had several problems with false positive errors with earlier versions of memtest86 and I don't trust it on 100%. My idea is to update BIOS firmware, update to the new kernel (I'm waiting for 2.6.37) and only then replace a RAM.
View 11 Replies
ADVERTISEMENT
Dec 7, 2009
I get the following error in /var/log/mcelog:
Code:
Mon Nov 30 05:29:29 2009
HARDWARE ERROR. This is *NOT* a software problem!
[code]....
View 1 Replies
View Related
Mar 23, 2010
I'm experiencing a problem with cron.hourly running mcelog.cron on Fedora 11:Quote:mcelog: warning: 18446744073709551600 bytes ignored in each recordmcelog: consider an updateI've read a dead end email thread here from February 2010 describing what I am seeing on Fedora here:Does anyone have any insight into this problem? This is relevant portion of strace(1)ing mcelog which shows /dev/mcelog is open()d and configured correctly:Quote:
open("/dev/mcelog", O_RDONLY) = 3
ioctl(3, MTRRIOC_SET_ENTRY, 0x7fff13129bac) = 0
ioctl(3, MTRRIOC_DEL_ENTRY, 0x7fff13129ba8) = 0
[code]...
View 4 Replies
View Related
May 12, 2011
The following two pieces of codes share printing to stdout with a POSIX semaphore /dev/shm/sem.abcd
sema1.c:
Code:
int j;
sem_t *sem = sem_open( "/abcd", O_CREAT, S_IRUSR|S_IWUSR, 1 );
j = 0;
while (j < 100) {
[Code].....
If started at the same time, the first will finish in about 10 seconds; the second 20 secs.
What I want to ask is, if the first program crashes at Checkpoint A, then B will never gets to continue, then normally how do programmers avoid this kind of deadlock due to crashes inside the critical section?
View 3 Replies
View Related
Jun 16, 2011
Thanks to a recommendation and extensive help at phpbuilder.com, I have a multithreaded PHP script running in the cloud which fetches images and stores them the cloud. It's fast and seems quite stable.However, I believe I'm faced with a situation where I need to introduce an additional lock to my code so I'm faced with the task of managing multiple locks while avoiding deadlock.I was wondering if anyone had experience managing multiple locks in a MT environment and could recommend specific functions and data structures. I've been reading up on resource management in MT code and understand a few things:
1) Any shared resource (global and static vars, memory, file descriptors, etc.) generally needs to be protected by some kind of mutex, lock, or other sync var.
2) There are certain necessary conditions for deadlock to occur.
3) You generally need a sequence or hierarchy of your resources so that all threads request them in the same order. This sequence must be the same and immutable for all participating threads, whether local or remote.
4) Recursion is a very common cause of deadlock, so a process should know what locks it has acquired previously in order to avoid blocking in the attempt to re-acquire those same locks again.
5) There are a variety of algorithms described such as the Banker's Algorithm, the Chandy/Misra solution, etc. to help avoid deadlock.
I'm hoping to come up with a technique for properly handling multiple locks/mutexes/sync vars that I can re-use in the future, but I'm still coming to grips with the algorithm descriptions and am unsure precisely what sorts of data structures or functions I'll need.
View 3 Replies
View Related
Jan 4, 2010
I am connecting servers using NFS4 the shared directories are on servers running Debian 4 while the one who read from them is Debian 5.0.3. The problem is one of these shared servers suddenly stop responding and you cannot list it from Debian 5 server, also df hang, and the web application that is using it does not respond to requests that use this shared directory since it is blocked. Then the load on the server start to increase until the server cannot respond (over 90). I have found many entries in the syslog that refer to this like:
ma25555 kernel: [1200285.732919] nfs: server 10.xxx.xxx.xxx not responding, still trying
Dec 31 08:16:33 ma25555 kernel: [1200289.815378] INFO: task java:9702 blocked for more than 120 seconds.
Dec 31 08:16:33 ma25555 kernel: [1200289.835249] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
code....
I have tried the connection between the 2 servers using ping for one day and all are OK (zero lost)
There are 3 other servers that are running Debian 4 and are working fine.
View 1 Replies
View Related
Jun 3, 2011
Someone knows how to change the first day of the week in the calendar popup to Monday?I already looked in /usr/share/gnome-shell/js/ui/calendar.js but I didn't saw where I have to change that (my knowledge of JavaScript isn't very good yet)
View 1 Replies
View Related
Feb 6, 2010
I installed ubuntu a mount ago, since then my vista goes bluescreen almost every week, I have vista business sp2 32bit and ubuntu 9.10 32bit dual boot. Details about one of bluescreens is below:
Problem signature:
Problem Event Name: BlueScreen
OS Version: 6.0.6002.2.2.0.256.6
Locale ID: 1033
Additional information about the problem:
BCCode: be
BCP1: 837020B7
BCP2: 28EA0860
BCP3: 8BB63A18
BCP4: 0000000E
OS Version: 6_0_6002
Service Pack: 2_0
Product: 256_1
Files that help describe the problem:
C:WindowsMinidumpMini020110-01.dmp
C:Users
afisehAppDataLocalTempWER-50060-0.sysdata.xml
C:Users
afisehAppDataLocalTempWER8304.tmp.version.txt
Read our privacy statement: [URL]
View 8 Replies
View Related
Dec 3, 2009
I logged on my ibm600e with SUSE 11.1 + on Thanksgiving and auto update started and is still running today 12/02/2009. Pentium cpu is at 77 to 97% 93% memory usage plus some swap usage. Ethernet and wireless are activeily working
also separately the CD ROM is not detecting/detected except by the hardware report I sent in. My external Zip100 Is visible. as is the plug in floppy drive
My linux is very limited
View 1 Replies
View Related
Nov 10, 2010
I am using the following to find the Day of the Week on my server which is in UTC.
Code:
But say if I want to find the Day of the Week in another timezone, how would one go about it. I had a look at date --help but could not find any timezone related switches so am not sure if this is possible using this method.
Update:
This appears to work but is this the only way?
Code:
View 2 Replies
View Related
May 28, 2010
I am using Ubuntu 10.04 64 bit on my vaio VGN-NS325j. Using Eclipse as my IDE. But i have a big problem ; while i am debugging my app. It is locking my interface. No more responses for my mouse clicks or keyboard.. Machine is still working.. If there is an application running on background it is continuing what it is job is.But i could interact with anything. It is just happening during debugging.. Otherwise it is fine.In addition my app is running on Jboss 4.2.2 if it helps.
View 1 Replies
View Related
Nov 21, 2010
I have a cyclical problem. After a new fedora installation the calendar week start with sunday and I want to change it with monday. I tried with many guide without success (the latest is this: [URL]). Is possible to say how to change it? The version is fedora 14.
View 3 Replies
View Related
Apr 20, 2011
Suse 11.3/Gnome 2.30.0
Evolution 2.30.1.2
I cannot get the monthly view to start the week on Sunday. The settings seem right but they don't stick. Restarting Evolution always defaults to a week start on Monday even though the defaults display Sunday. I've viewed the default settings the following two ways:
Within Evolution... Edit > Preferences > Calendar and Tasks > week starts on "Sunday"
GNOME Configuration editor.../apps/evolution/calendar/display/week_start_day = 0 (Sunday)
View 4 Replies
View Related
Jan 29, 2010
My system specification:I am using is in my PC(stand alone) with the following system config..AMD Sempron(tm) processor2500+704 MB RAMASUS mother board..buntu version 9.10When I click ubuntu software center ->search any software ->select the software -> click the arrow at right corner->then click install button for me nothing happens. It was installing for one week and now it seems to be not. It gives no error message no action, but just static. Is that a paid version or i am missing something basic? I have tried to do the following
sudo dpkg --configure -a
sudo apt-get update
sudo apt-get upgrade
View 9 Replies
View Related
Jun 22, 2010
If i use my Ubuntu some times per week, there isn't any problem. But if i don't use it for a short time (a week), when i want use it again and select it on GRUB.. after the first loading screen (on Ubuntu 10.04 the purple screen with Ubuntu in white and some points under), my screen is black. The computer screen is receiving signal because the led is green and not brown/orange like when is on standby mode. On Windows i haven't any problem.
First time it happened with Ubuntu 9.10. I didn't got solution for it and i format the partition and install the 10.04. But now it happen again with 10.04. I have an ATI 5770 and i have tried drivers since 9.12 till 10.4 (last ones the best i have had) On versions before 9.10 i had no problem, but i can't confirm that, because i had other graphic card (NVIDIA 9800GT). But the 5770 didn't work well on previous version so i updated to 9.10 and after this problem, updated to 10.04.
At spanish forum, people don't know solution for this problem, they never read about a similar problem. I hope you know a solution for it, because the only one i found and work is.. format and reinstall.
View 3 Replies
View Related
Sep 28, 2010
I'm running Ubuntu 10.04 which was upgrade from a 9.x install. I never had this problem on my 9.x install, but since the "upgrade" I experience my desktop freezing. I'm not exactly sure how to troubleshoot this.In looking at /var/log/messages the last entries I see before me having to a hard reboot are:
Code:
Sep 27 19:27:07 gingerk sudo: pam_sm_authenticate: username = [me]
Sep 27 19:27:07 gingerk sudo: pam_sm_authenticate: /home/me is already mounted
Sep 27 20:56:40 gingerk kernel: [102839.046817] operapluginwrap[26712]: segfault at 0 ip (null) sp bf9ce11c error 4 in
[code]....
View 1 Replies
View Related
May 19, 2011
My manager has asked me to look at memory drop on system while I was doing test on Linux machine. there is a big dip in memory graph produced by another tool. I do not know which processes were responsible for those memory dips. Is there any way I can find memory utilization of process during last week?
View 1 Replies
View Related
Mar 30, 2010
I have been requested to deliver the logs for last friday, but the problem is that I have configured squid logs to be weekly, not daily. How can I extract the logs generated friday from the week file?
View 3 Replies
View Related
Jun 24, 2011
How to get the week number in linux using gawk with different first day of the week? the date command can give me the week number with +%V but it is based on Monday (1-53) or +%U (based on Sunday, 0-53).
I tried to to do this: date -d "ddmmyy+2days" +%V, but the result is not correct. I want the first day of the week is based on Saturday.
View 1 Replies
View Related
Jul 15, 2011
Coming from Fedora, I had a Web/Mail/SQL server with 2 hdd's. One (sda1) was the one with Fedora on, and all backups were on sdb1.
Now, I started having issues with SDA1, so I started copying files to SDB1. 2 weeks later I noticed similar issues, and started copying new back files to SDB1. But once I started doing this I lost connection with the server.
Turns out that (in rescue mode) that I couldn't access SDA anymore, it couldn't even read stored data on it, etc. It didn't show up in fdisk anymore either.
So after a lot of trying I asked to replace the disk and started installing Ubuntu 10.04 LTS on it. After having installed everything, I tried to restore one of the MySQL backups that was on the SDB disk, but halfway I started getting errors. I stopped the restore and did some other stuff that I wanted to take care of first (apache2 config etc.).
I then tried to reboot to be sure that that was working as well, and then the problems started. I couldn't access the server anymore, and in rescue mode (which is a live cd my host provides, it's a dedicated server btw.) I didn't see SDB anymore.
I told them this and now after they told me that they managed to get it back, I went into the rescue mode and tried to recover bad blocks with the e2fsck command.
Code:
Trying to restore this bad superblock doesn't work. I've tried for almost all of them:
Code:
e2fsck: Attempt to read block from filesystem resulted in short read while trying to open /dev/sdb1
Could this be a zero-length partition?
What are the chances of TWO hard disks failing? Surely there must be something else wrong? Did a bad driver (which would be weird, since I didn't do any updates like that) screw both of my hard disks up in Fedora?
Also, I must note that when I tried to copy the backups from SDB1 to SDA1 (the new one). I got a read only (??) error, and some other I/O errors.
Just for information, I'll show fstab and mtab as well: /etc/fstab
Code:
Code:
I haven't added the sdb here yet, but it was here in Fedora so that can't be the problem.
And this is /var/log/messages when the problems emerged
Code:
Then on the ubuntu forum they proposed me to do this:
I ran smartctl and it gave me this:
Code:
Code:
View 1 Replies
View Related
Feb 5, 2011
i am going to write rhce exam in this month ,i want some tips how to do revision for the same.
View 1 Replies
View Related
Feb 2, 2011
We need to collect data from nagios once a week and have it sent/mailed.What would be a good place to get it? I am thinking starting with nagios/var/nagios.logscan it once a week and get most recent entries from each hosts.
View 3 Replies
View Related
Apr 27, 2010
I have been using a ARM device with 512MB ram and flash to send messages via Bluetooth to everyone in the proximity. I am doing this with OpenProximity (Python) and it worked for the last week, it did stop for two times but since I didn't have time to find the cause, I just rebooted it (Windows style ) and it worked again, but suddenly it stopped working again last Saturday. When I did /etc/init.d/bluetooth restart it did not say it failed or something, but when I did /etc/init.d/bluetooth status it said "bluetooth is not running ... failed!". This is what I got in the syslog:
Code:
Apr 27 13:52:45 debian bluetoothd[31243]: Bluetooth daemon 4.60
Apr 27 13:52:45 debian bluetoothd[31243]: Starting SDP server
Apr 27 13:52:45 debian bluetoothd[31243]: Starting experimental netlink support
Apr 27 13:52:45 debian bluetoothd[31243]: Failed to find Bluetooth netlink family
Apr 27 13:52:45 debian bluetoothd[31243]: Failed to init netlink plugin
Apr 27 13:52:45 debian bluetoothd[31243]: Can't create GN bridge
[Code]...
You'll probably note that the first message is version 4.60 and the second one 4.63, this is because I tried to upgrade, but that did not work completely, but the error above is the same as first before the upgrade.
Don't know if it matters, but I am using Debian Squeeze and the output of uname -a is: Linux debian 2.6.32.2 #1 PREEMPT Fri Dec 18 22:54:28 MST 2009 armv5tel GNU/Linux. Hope someone know how this could happen suddenly, and even better, knows how to fix it
EDIT: Forgot to say, I am using a SENA Parani UD100 Bluetooth USB Adapter which uses the Toshiba Bluetooth stack.
UPDATE: It won't crash until the Bluetooth stick is inserted, so it should be a driver problem?
View 2 Replies
View Related
Jun 14, 2010
been running Lucid & Karmic for a while now, after an update last week (with kernel) & reboot sound no longer works.Results of 'sudo aplay -l':
Code:
**** List of PLAYBACK Hardware Devices ****
card 0: NVidia [HDA NVidia], device 0: ALC888 Analog [ALC888 Analog]
[code]....
View 3 Replies
View Related
Jan 2, 2011
I'd like to tweak my Conky so it automatically displays the days of the week correctly. So for example, if TODAY were Tuesday, it would look like this:
Tuesday
Wednesday
Thursday
Friday
[etc.]
And then tomorrow, *automatically*, it would look like this:
Wednesday
Thursday
Friday
[etc.]
I know I can get it to display *today's* date like this:
${time %A}
But how do I get it to display tomorrow's date?
View 1 Replies
View Related
Apr 12, 2011
I've installed osmo , and lots of software from Software Center, but nothing suits me. What I need is a lightweight program, with this capabilities : week and day view, (something like google calendar)advanced repetitive tasks(on week criteria[so I will be able to set a task once in a week, once in two weeks , and so on]) and alarm
View 4 Replies
View Related
Oct 2, 2009
I recently installed CentOS 5.3 on a Dell PowerEdge 1800 server that used to have Microsoft Windows SBS 2003 on it. Everything seemed to be going fine until a few days ago. Here's the sum of it all...
- System seems to randomly hang, at which it becomes unavailable on the network (can't be pinged, monitor won't come back on, and you can't wake it up... have to reboot, the scroll lock and caps lock keys flash at the same time)
- Received this error message...
Uhhuh. NMI received for unknown reason a0.
You probably have a hardware problem with your RAM chips.
Dazed and confused, but trying to continue.
...it doesn't crash at this point though. I ran memtest86 twice and did not experience any problems. I updated the BIOS to the newest version, which contained a few bug fixes, but still experienced the same problems
View 2 Replies
View Related
Feb 20, 2010
I have just made a clean installation of FC12. The only change I have made for now is to disable the NetworkManager and set a static ip address. My problem is that when I click "shutdown" it doesn't complete the shutdown and hangs due to some kind of kernel error. Then I have to use the power-switch to shut it down. It doesn't happen every time, but it happens often (every 3rd shutdown or so).
I get this error-messages on the screen:
Code:
Unmounting file systems:[OK]
Halting system...
--------[ cut here ]-----------
kernel BUG at kernel/time/clockevents.c:262!
invalid opcode: 0000 [#1] SMP
last sysfs file: /sys/mmodule/ip6_tables/refcnt
Modules linked in: {long list}
Pid: 1809, comm: hal Not tainted (2.6.31.12-174.2.22.fc12.i686 #1) OptiPlex GX270
This is exactly whats described here: [URL].
View 2 Replies
View Related
Mar 22, 2010
Gparted won't let me install Ubuntu 9.10 64 bit. Every time it hangs at 47% and throws a sector size error, something like: doesn't support sector size 2048 and the code is HIGHLY EXPERIMENTAL. My 1TB hard drive exists out of the following partitions:
100MB Windows 7 Reserved
900GB+ Windows 7
30GB EXT4
1MB unallocated space
Is there a workaround for this? I've tried installing Linux Mint and Ubuntu but both gave me the same error.
View 7 Replies
View Related
Jul 20, 2010
I started my laptop this morning with no problems. It was doing a file system check which I quit (pressing C) after about 30%. The only problem I had was wired networking didn't seem to be working (though wireless was). No problem: this happens sometimes and a restart fixes it. After restart: the GRUB screen hangs at "Loading stage 1.5" for ages before issuing Error 25. Try a restart. Same thing. And again. I'm on Lucid 10.4, no other OS installed, Toshiba laptop that up to now has had no problems with the last two versions of Ubuntu.
View 1 Replies
View Related