[KLUG Members] Attempting to fix a server

Adam Bultman adamb at glaven.org
Thu Aug 26 15:20:31 EDT 2004


Phillip Hofmeister wrote:

>On Thu, 26 Aug 2004 at 09:22:12AM -0400, Adam Tauno WIlliams wrote:
>  
>
>>>Last week, I had a server tip over on me.   I believe that the SCSI card 
>>>had pretty much up and died, and as a result we had some filesystem 
>>>corruption, and the inability to boot.
>>>At the time, the system would 'start' to boot, and then reboot when it 
>>>started getting too far, and start over again.
>>>      
>>>
>>Have you tried a memory test?  Reseated any DIMMs/SIMMs.  This sound
>>allot like a potential memory problem.
>>    
>>
>
>I had a computer starting to act up a year ago.  I ran memtest86 (runs
>out of lilo or your boot loader).  It quickly identified a problem in an
>address area in my second DIMM.  Removed the DIMM, everything worked
>fine again.
>
>  
>
The system, before it REALLY tanked, would crash somewhat often, and 
mostly with SCSI errorsduring large transfers of data.  It was an NFS 
server for all sorts of data, and would be fine, until you backed it up 
too fast or tried to copy too much data at once.  Then it would cry like 
a little girl and crash.  Unfortunately, it's a 1u server, with no CDROM 
and no floppy, so I don't know how I'd go about running memtest86 on it 
(unless I netbooted it, ugh).

I'll be working on it again tonight (gonna replace the password/shadow 
files, probably). If it runs, I'll see what I can do aobut swapping the 
RAM and testing the RAM in another machine.

Adam




More information about the Members mailing list