| |
 |
|
| Linux Forum Index » Linux Development » Mysterious server lockups with Ubuntu Hardy... |
|
Page 1 of 1 |
|
| Author |
Message |
| Ignoramus4557... |
Posted: Sun May 11, 2008 9:43 pm |
|
|
|
Guest
|
I have a server with Hardy server edition installed (2.6.24). No GUI
on it. It runs apache, mysql, is an NFS client and Samba server. Runs
some of our custom scripts, nothing major.
Once in about every 2 days, it seems to lock up randomly.
Fortunately, it has an IPMI card and I programmed it to run an IPMI
watchdog, so it reboots itself very quickly upon lockup,
automatically. Still, I obviously do not like it in the least and hope
for something better.
If there is something that I can do, like enable debug mode of some
sort, etc, I will be happy to get started.
--
Due to extreme spam originating from Google Groups, and their inattention
to spammers, I and many others block all articles originating
from Google Groups. If you want your postings to be seen by
more readers you will need to find a different means of
posting on Usenet.
http://improve-usenet.org/ |
|
|
| Back to top |
|
|
|
| Mark Hobley... |
Posted: Mon May 12, 2008 5:08 am |
|
|
|
Guest
|
Ignoramus4557 <ignoramus4557 at (no spam) nospam.4557.invalid> wrote:
Quote: Once in about every 2 days, it seems to lock up randomly.
Try setting the following boot options:
noapic nolapic pci=noacpi
Regards,
Mark.
--
Mark Hobley,
393 Quinton Road West,
Quinton, BIRMINGHAM.
B32 1QE. |
|
|
| Back to top |
|
|
|
| Ignoramus4557... |
Posted: Mon May 12, 2008 6:58 am |
|
|
|
Guest
|
On 2008-05-12, Mark Hobley <markhobley at (no spam) hotpop.donottypethisbit.com> wrote:
Quote: Ignoramus4557 <ignoramus4557 at (no spam) nospam.4557.invalid> wrote:
Once in about every 2 days, it seems to lock up randomly.
Try setting the following boot options:
noapic nolapic pci=noacpi
Regards,
where in menu.lst should I specify them? Any idea?
--
Due to extreme spam originating from Google Groups, and their inattention
to spammers, I and many others block all articles originating
from Google Groups. If you want your postings to be seen by
more readers you will need to find a different means of
posting on Usenet.
http://improve-usenet.org/ |
|
|
| Back to top |
|
|
|
| The Natural Philosopher... |
Posted: Mon May 12, 2008 12:32 pm |
|
|
|
Guest
|
Ignoramus4557 wrote:
Quote: I have a server with Hardy server edition installed (2.6.24). No GUI
on it. It runs apache, mysql, is an NFS client and Samba server. Runs
some of our custom scripts, nothing major.
Once in about every 2 days, it seems to lock up randomly.
Fortunately, it has an IPMI card and I programmed it to run an IPMI
watchdog, so it reboots itself very quickly upon lockup,
automatically. Still, I obviously do not like it in the least and hope
for something better.
If there is something that I can do, like enable debug mode of some
sort, etc, I will be happy to get started.
Probably hardware/heat.
Look at logs. |
|
|
| Back to top |
|
|
|
| Ignoramus17662... |
Posted: Mon May 12, 2008 12:44 pm |
|
|
|
Guest
|
On 2008-05-12, The Natural Philosopher <a at (no spam) b.c> wrote:
Quote: Ignoramus4557 wrote:
I have a server with Hardy server edition installed (2.6.24). No GUI
on it. It runs apache, mysql, is an NFS client and Samba server. Runs
some of our custom scripts, nothing major.
Once in about every 2 days, it seems to lock up randomly.
Fortunately, it has an IPMI card and I programmed it to run an IPMI
watchdog, so it reboots itself very quickly upon lockup,
automatically. Still, I obviously do not like it in the least and hope
for something better.
If there is something that I can do, like enable debug mode of some
sort, etc, I will be happy to get started.
Probably hardware/heat.
Look at logs.
I do not think so. The server is not that busy. If lockups continue
after disabling ACPI, I will explore this possibility using sensors.
i
--
Due to extreme spam originating from Google Groups, and their inattention
to spammers, I and many others block all articles originating
from Google Groups. If you want your postings to be seen by
more readers you will need to find a different means of
posting on Usenet.
http://improve-usenet.org/ |
|
|
| Back to top |
|
|
|
| The Natural Philosopher... |
Posted: Mon May 12, 2008 2:11 pm |
|
|
|
Guest
|
Ignoramus17662 wrote:
Quote: On 2008-05-12, The Natural Philosopher <a at (no spam) b.c> wrote:
Ignoramus4557 wrote:
I have a server with Hardy server edition installed (2.6.24). No GUI
on it. It runs apache, mysql, is an NFS client and Samba server. Runs
some of our custom scripts, nothing major.
Once in about every 2 days, it seems to lock up randomly.
Fortunately, it has an IPMI card and I programmed it to run an IPMI
watchdog, so it reboots itself very quickly upon lockup,
automatically. Still, I obviously do not like it in the least and hope
for something better.
If there is something that I can do, like enable debug mode of some
sort, etc, I will be happy to get started.
Probably hardware/heat.
Look at logs.
I do not think so. The server is not that busy.
That has nothing to do with flakey hardware. Or a clogged fan. Its
summer in the northern hemisphere..all sorts of margnal electronics
kicks over the traces ..I could bore you with a weekend of equipment
failures due to heat, but i wont.
If lockups continue
Quote: after disabling ACPI, I will explore this possibility using sensors.
i |
|
|
| Back to top |
|
|
|
| Baho Utot... |
Posted: Mon May 12, 2008 2:49 pm |
|
|
|
Guest
|
On Mon, 12 May 2008 06:58:37 -0500, Ignoramus4557 wrote:
Quote: On 2008-05-12, Mark Hobley <markhobley at (no spam) hotpop.donottypethisbit.com
wrote:
Ignoramus4557 <ignoramus4557 at (no spam) nospam.4557.invalid> wrote:
Once in about every 2 days, it seems to lock up randomly.
Try setting the following boot options:
noapic nolapic pci=noacpi
Regards,
where in menu.lst should I specify them? Any idea?
/boot/grub/menu.lst
# (0) Arch Linux
title Arch Linux [/boot/vmlinuz26]
root (hd0,0)
kernel /vmlinuz26 root=/dev/hda3 ro
initrd /kernel26.img
# (0) Arch Linux
title Arch Linux [/boot/vmlinuz26]
root (hd0,0)
kernel /vmlinuz26 root=/dev/hda3 ro noapic nolapic pci=noacpi
initrd /kernel26.img
^^^^^^^^^^^^^^^^^^^^^^^^^
OK?
--
Tayo'y Mga Pinoy |
|
|
| Back to top |
|
|
|
| Darren Salt... |
Posted: Mon May 12, 2008 5:35 pm |
|
|
|
Guest
|
I demand that The Natural Philosopher may or may not have written...
Quote: Ignoramus17662 wrote:
On 2008-05-12, The Natural Philosopher <a at (no spam) b.c> wrote:
Ignoramus4557 wrote:
I have a server with Hardy server edition installed (2.6.24). No GUI
on it. It runs apache, mysql, is an NFS client and Samba server. Runs
some of our custom scripts, nothing major.
Once in about every 2 days, it seems to lock up randomly.
Fortunately, it has an IPMI card and I programmed it to run an IPMI
watchdog, so it reboots itself very quickly upon lockup,
[snip]
Probably hardware/heat.
Look at logs.
I do not think so. The server is not that busy.
That has nothing to do with flakey hardware. Or a clogged fan. Its summer
in the northern hemisphere..all sorts of margnal electronics kicks over the
traces ..I could bore you with a weekend of equipment failures due to heat,
but i wont.
What's this rubbish about it being hot in summer? :-)
[snip]
--
| Darren Salt | linux or ds at | nr. Ashington, | Toon
| RISC OS, Linux | youmustbejoking,demon,co,uk | Northumberland | Army
| + Output less CO2 => avoid boiling weather. TIME IS RUNNING OUT *FAST*.
Look afar and see the end from the beginning. |
|
|
| Back to top |
|
|
|
| Matt... |
Posted: Sat Aug 23, 2008 8:15 am |
|
|
|
Guest
|
Ignoramus17662 wrote:
Quote: On 2008-05-12, The Natural Philosopher <a at (no spam) b.c> wrote:
Probably hardware/heat.
Look at logs.
I do not think so. The server is not that busy. If lockups continue
after disabling ACPI, I will explore this possibility using sensors.
i
I agree that you should look at the logs.
There are a lot of hardware problems that can cause a system freeze,
including a bad power supply, motherboard, memory, or video card. Could
be a bad device driver though if you have any hardware that requires an
obscure driver. |
|
|
| Back to top |
|
|
|
| Matt... |
Posted: Mon Aug 25, 2008 8:40 am |
|
|
|
Guest
|
Ignoramus9283 wrote:
Quote: The server is almost new,
More reason to suspect hardware, I'd say. What is the longest uptime
you've had? |
|
|
| Back to top |
|
|
|
| The Natural Philosopher... |
Posted: Mon Aug 25, 2008 2:42 pm |
|
|
|
Guest
|
Matt wrote:
Quote: Ignoramus9283 wrote:
The server is almost new,
More reason to suspect hardware, I'd say. What is the longest uptime
you've had?
Yep. with the new info, its probably something on the edge of spec, or
slightly beyond..
I hate these sorts of problems: they can be terribly sensitive to
exactly what code is being run at what locations.
Standard memtest stuff doesn't always work either..if youu have a dodgy
peripheral card or controller, memtest wont show it, but it can still
corrupt memory when its being accessed, or occasionally when ANY stuff
on the IO bus is being accessed. |
|
|
| Back to top |
|
|
|
| Jean-David Beyer... |
Posted: Wed Aug 27, 2008 9:13 am |
|
|
|
Guest
|
Ignoramus17245 wrote:
Quote: On 2008-08-26, Jure Sah <dustwolfy at (no spam) gmail.com> wrote:
Anyhow, there are better ways to debug hardware issues. A memtest
would be in order. Smartmontools to check your disks for bad
sectors. Review your hardware for ISR conflicts (such as can be
attained by insterting two identical PCI serial port adapters into
one machine). But you probably knew that.
This is a brand new supermicro machine, they would not put such things
in there. Anyway. I am trying 2.4.25 kernel right now, and so far,
it's been almost 9 days and counting and it is still up.
I have a machine with a SuperMicro motherboard that I put together myself.
It ran Red Hat Enterprise Linux 3 just fine for about three years or so.
Then I upgraded to RHEL 5 and it works just fine too. X5DP8-G2 motherboard.
All the Supermicro motherboards I looked at at the time I was building this
were claimed to run with Linux. I am running the kernel from this rpm:
kernel-PAE-2.6.18-92.1.10.el5
that is the latest one for RHEL5.
The only trouble I have had is that I could not run an ISA sound board with
a 2.6 kernel, so on my old machine I had to put in a similar PCI one. But
that did not stop anything from booting; I just could not hear anything with
the sound board.
--
.~. Jean-David Beyer Registered Linux User 85642.
/V\ PGP-Key: 9A2FC99A Registered Machine 241939.
/( )\ Shrewsbury, New Jersey http://counter.li.org
^^-^^ 10:05:01 up 20 days, 16:11, 4 users, load average: 4.75, 4.26, 4.16 |
|
|
| Back to top |
|
|
|
| Ignoramus17245... |
Posted: Wed Aug 27, 2008 9:26 am |
|
|
|
Guest
|
On 2008-08-27, Jean-David Beyer <jeandavid8 at (no spam) verizon.net> wrote:
Quote: Ignoramus17245 wrote:
On 2008-08-26, Jure Sah <dustwolfy at (no spam) gmail.com> wrote:
Anyhow, there are better ways to debug hardware issues. A memtest
would be in order. Smartmontools to check your disks for bad
sectors. Review your hardware for ISR conflicts (such as can be
attained by insterting two identical PCI serial port adapters into
one machine). But you probably knew that.
This is a brand new supermicro machine, they would not put such things
in there. Anyway. I am trying 2.4.25 kernel right now, and so far,
it's been almost 9 days and counting and it is still up.
I have a machine with a SuperMicro motherboard that I put together myself.
It ran Red Hat Enterprise Linux 3 just fine for about three years or so.
Then I upgraded to RHEL 5 and it works just fine too. X5DP8-G2 motherboard.
All the Supermicro motherboards I looked at at the time I was building this
were claimed to run with Linux. I am running the kernel from this rpm:
kernel-PAE-2.6.18-92.1.10.el5
that is the latest one for RHEL5.
The only trouble I have had is that I could not run an ISA sound board with
a 2.6 kernel, so on my old machine I had to put in a similar PCI one. But
that did not stop anything from booting; I just could not hear anything with
the sound board.
There are all kinds of Supermicro motherboards. I have a webserver
that runs for up to a year until it locks up, on a supermicro
platform. My webserver (to be retired soon) lasted for 360 days before
locking up last Saturday. I can live with that. The webserver,
especially, makes me up to 20-30 dollars an hour during busy hours, so
I want it to stay up as much as possible. The new server, to replace
it, is also supermicro and it sits at home and seems to work fine.
I would be happy if the server from the original question lasted as long.
--
Due to extreme spam originating from Google Groups, and their inattention
to spammers, I and many others block all articles originating
from Google Groups. If you want your postings to be seen by
more readers you will need to find a different means of
posting on Usenet.
http://improve-usenet.org/ |
|
|
| Back to top |
|
|
|
| Joe... |
Posted: Wed Aug 27, 2008 8:49 pm |
|
|
|
Guest
|
On 2008-08-26, Ignoramus29035 <ignoramus29035 at (no spam) NOSPAM.29035.invalid> wrote:
Quote:
By the way, I have seen a lot of Linux lockups.
Let's see, Fedora 4 locking up, Fedora 3 doing same, this Ubuntu Hardy
machine, Ubuntu Gutsy machines, etc etc.
I admin 2-3 dozen linux machines and yes, I have seen plenty of
lockups.
Not to be too much of a prick, but this may say more about you than
about Linux.
I have built and administered at least a couple hundred linux servers
and workstations over the years, and have never had one regularly lock
up that didn't either have a hardware issue (heat or memory usually),
or was continuously using more memory than the machine had ram
(causing constant swapping).
Not to say that it's impossible that there is some incompatible module
somewhere flaking something out, but the more likely (by far) answer
is that you have a bad dimm or a failing fan.
--
Joe - Linux User #449481/Ubuntu User #19733
joe at hits - buffalo dot com
"Hate is baggage, life is too short to go around pissed off all the
time..." - Danny, American History X |
|
|
| Back to top |
|
|
|
|
|
All times are GMT - 5 Hours
The time now is Mon Nov 23, 2009 1:56 am
|
|