Rocksolid Light

News from da outaworlds

mail  files  register  groups  login

Message-ID:  

BOFH excuse #155: Dumb terminal


comp / comp.os.os2.misc / hardware failing

SubjectAuthor
* hardware failingtholen
+* Re: hardware failingLars Erdmann
|`- Re: hardware failingDavid Wade
+- Re: hardware failingMarcel Mueller
+- Re: hardware failingDave Yeo
`- Re: hardware failingDan Camp

1
Subject: hardware failing
From: tholen@antispam.ham
Newsgroups: comp.os.os2.misc
Organization: Aioe.org NNTP Server
Date: Sun, 2 Aug 2020 01:49 UTC
Path: eternal-september.org!news.eternal-september.org!reader01.eternal-september.org!reader01.eternal-september.org!feeder.eternal-september.org!aioe.org!.POSTED.hq46U+9BZOHR+BSotaUNXw.user.gioia.aioe.org!not-for-mail
From: tholen@antispam.ham
Newsgroups: comp.os.os2.misc
Subject: hardware failing
Date: Sun, 2 Aug 2020 01:49:22 +0000 (UTC)
Organization: Aioe.org NNTP Server
Lines: 48
Message-ID: <rg5631$6jn$1@gioia.aioe.org>
Reply-To: tholen@antispam.ham
NNTP-Posting-Host: hq46U+9BZOHR+BSotaUNXw.user.gioia.aioe.org
X-Complaints-To: abuse@aioe.org
X-Notice: Filtered by postfilter v. 0.9.2
X-Newsreader: IBM NewsReader/2 2.0
View all headers

One of my OS/2 systems must be pushing 20 years old and is starting to
show signs of hardware failure. I'm faced with the choice of either
getting new, faster, more modern hardware and fighting with OS/2
compatibility issues, or trying to repair the existing system, which
is still surprisingly capable despite its age. The trick to the
latter choice is to figure out what component is failing.

I've not yet noticed any pattern to the random reboots. The machine
can be up for many days when it decides to just stop without any kind
of error message and reboot. However, by far the most frequent
occurrence of a failure is associated with the reboot process itself,
specifically when the white square and OS/2 appear in the upper left
corner of the screen. If it gets past that point, the system can
remain up for days at a time, but I've seen the system reboot process
interrupted at that particular point many consecutive times. Could
be an ambient temperature issue, with some component having become
particularly sensitive to overheating.

This much I can say. The Adaptec 29160 SCSI bus scan happens just
fine. The system can boot from a MEMTEST 3.0 bootable CD just fine
and run tests on memory for hours without failure. Furthermore, the
machine has error-correcting memory, and the BIOS hasn't recorded any
events, so I'm fairly confident that the SCSI adapter and memory aren't
the source of the problem.

I have Boot Manager installed, and that always comes up just fine, so
the Boot Manager partition appears to be okay, as is the disk controller.
I have the operating system installed on two different partitions, both
on the same physical disk as the Boot Manager partition. Doesn't matter
which one I try to boot from; I get the reboot when "white square OS/2"
appears with either, so I doubt the failure is associated with a
particular disk partition.

Once booted, I need to run CHKDSK on the partitions located on other
physical disks, and those always complete without problem, so I don't
think the disk drives represent the failing component.

I have seen more than one random reboot occur while using FTP to
transfer files to another computer, so the network card is a suspect,
and if the system touches the network card during the boot process,
particular immediately after the "white square OS/2" appears in the
upper left corner, then I'd have something to try.

Does anybody familiar with the details of the OS/2 boot process know
exactly what is happening immediately after the "white square OS/2"
appears? Narrowing down the list of possible failing components is
the goal here.

Subject: Re: hardware failing
From: Lars Erdmann
Newsgroups: comp.os.os2.misc
Organization: solani.org
Date: Sun, 2 Aug 2020 07:41 UTC
References: 1
Path: eternal-september.org!news.eternal-september.org!reader01.eternal-september.org!reader01.eternal-september.org!feeder.eternal-september.org!weretis.net!feeder6.news.weretis.net!feeder5.news.weretis.net!news.solani.org!.POSTED!not-for-mail
From: lars.erdmann@arcor.de (Lars Erdmann)
Newsgroups: comp.os.os2.misc
Subject: Re: hardware failing
Date: Sun, 2 Aug 2020 09:41:31 +0200
Organization: solani.org
Message-ID: <rg5qnb$sh9$1@solani.org>
References: <rg5631$6jn$1@gioia.aioe.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=windows-1252; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Sun, 2 Aug 2020 07:41:31 -0000 (UTC)
Injection-Info: solani.org;
logging-data="29225"; mail-complaints-to="abuse@news.solani.org"
User-Agent: Mozilla/5.0 (OS/2; Warp 4.5; rv:45.0) Gecko/20100101
Thunderbird/45.8.0
X-User-ID: eJwFwQkBACAIBLBKIBxoHN7+EdwgxlauBlMsNrZRnZGJezJny+iV31bGOnCCeFQl9knJ5T4zhfZWWietD4r5Fnk=
In-Reply-To: <rg5631$6jn$1@gioia.aioe.org>
Cancel-Lock: sha1:9IplOPCICHOqUhyeqGlRdpmOQp8=
View all headers

I think the component that is most likely to fail is the power supply.
In particular, if it is too small in power.
I cannot say for sure if this is the problem with your system but power
supplies are not that expensive (compared to the MOBO at least) and
therefore I would give that a try.
And make sure if offers plenty of Watts. Pick a larger one than what you
currently have.

Lars

On 02.08.20 03.49, tholen@antispam.ham wrote:
> One of my OS/2 systems must be pushing 20 years old and is starting to
> show signs of hardware failure. I'm faced with the choice of either
> getting new, faster, more modern hardware and fighting with OS/2
> compatibility issues, or trying to repair the existing system, which
> is still surprisingly capable despite its age. The trick to the
> latter choice is to figure out what component is failing.
....

Subject: Re: hardware failing
From: David Wade
Newsgroups: comp.os.os2.misc
Organization: A noiseless patient Spider
Date: Sun, 2 Aug 2020 09:09 UTC
References: 1 2
Path: eternal-september.org!news.eternal-september.org!reader01.eternal-september.org!reader01.eternal-september.org!.POSTED!not-for-mail
From: g4ugm@dave.invalid (David Wade)
Newsgroups: comp.os.os2.misc
Subject: Re: hardware failing
Date: Sun, 2 Aug 2020 10:09:57 +0100
Organization: A noiseless patient Spider
Lines: 24
Message-ID: <rg5vt5$r5s$1@dont-email.me>
References: <rg5631$6jn$1@gioia.aioe.org> <rg5qnb$sh9$1@solani.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=windows-1252; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Sun, 2 Aug 2020 09:09:57 -0000 (UTC)
Injection-Info: reader02.eternal-september.org; posting-host="04f29bddef65a3c2661758ceb4c18e8a";
logging-data="27836"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18om8g27xcdYkuIQUZrHyvk"
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101
Thunderbird/68.11.0
Cancel-Lock: sha1:bGVitZHYgBAsmdLNKZxopp1FD+M=
In-Reply-To: <rg5qnb$sh9$1@solani.org>
Content-Language: en-GB
View all headers

On 02/08/2020 08:41, Lars Erdmann wrote:
> I think the component that is most likely to fail is the power supply.
> In particular, if it is too small in power.
> I cannot say for sure if this is the problem with your system but power
> supplies are not that expensive (compared to the MOBO at least) and
> therefore I would give that a try.
> And make sure if offers plenty of Watts. Pick a larger one than what you
> currently have.
>
> Lars
>
> On 02.08.20 03.49, tholen@antispam.ham wrote:
>> One of my OS/2 systems must be pushing 20 years old and is starting to
>> show signs of hardware failure.  I'm faced with the choice of either
>> getting new, faster, more modern hardware and fighting with OS/2
>> compatibility issues, or trying to repair the existing system, which
>> is still surprisingly capable despite its age.  The trick to the
>> latter choice is to figure out what component is failing.
> ...

Almost certainly a capacitor dried up. Its worth getting the caps on the
Mother board and in the PSU replaced.

Dave

Subject: Re: hardware failing
From: Marcel Mueller
Newsgroups: comp.os.os2.misc
Organization: MB-NET.NET for Open-News-Network e.V.
Date: Sun, 2 Aug 2020 10:40 UTC
References: 1
Path: eternal-september.org!news.eternal-september.org!reader01.eternal-september.org!reader01.eternal-september.org!feeder.eternal-september.org!news.mb-net.net!open-news-network.org!.POSTED.2a02:908:1b19:ad20:a00:27ff:fe8d:1f8d!not-for-mail
From: news.5.maazl@spamgourmet.org (Marcel Mueller)
Newsgroups: comp.os.os2.misc
Subject: Re: hardware failing
Date: Sun, 2 Aug 2020 12:40:40 +0200
Organization: MB-NET.NET for Open-News-Network e.V.
Message-ID: <rg657a$teh$1@gwaiyur.mb-net.net>
References: <rg5631$6jn$1@gioia.aioe.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=iso-8859-15; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Sun, 2 Aug 2020 10:40:42 -0000 (UTC)
Injection-Info: gwaiyur.mb-net.net; posting-host="2a02:908:1b19:ad20:a00:27ff:fe8d:1f8d";
logging-data="30161"; mail-complaints-to="abuse@open-news-network.org"
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101
Thunderbird/68.10.0
Cancel-Lock: sha1:YAuLRVwsnv6CHX0YFHzLn28pF0Q=
X-User-ID: U2FsdGVkX1/M2JgOfHyaDCPtvRqC31tUPZvUzCJk7S/QJE9KldjoR6cJAkQdFcmv
In-Reply-To: <rg5631$6jn$1@gioia.aioe.org>
Content-Language: en-US
View all headers

Am 02.08.20 um 03:49 schrieb tholen@antispam.ham:
> I've not yet noticed any pattern to the random reboots. The machine
> can be up for many days when it decides to just stop without any kind
> of error message and reboot. However, by far the most frequent
> occurrence of a failure is associated with the reboot process itself,
> specifically when the white square and OS/2 appear in the upper left
> corner of the screen. If it gets past that point, the system can
> remain up for days at a time, but I've seen the system reboot process
> interrupted at that particular point many consecutive times. Could
> be an ambient temperature issue, with some component having become
> particularly sensitive to overheating.
>
> This much I can say. The Adaptec 29160 SCSI bus scan happens just
> fine. The system can boot from a MEMTEST 3.0 bootable CD just fine
> and run tests on memory for hours without failure. Furthermore, the
> machine has error-correcting memory, and the BIOS hasn't recorded any
> events, so I'm fairly confident that the SCSI adapter and memory aren't
> the source of the problem.

Most likely you have unstable power supply. Check all electrolytic
capacitors on the main board and inside the power supply for bulges. If
one is positive it is defective. Unfortunately the opposite does not
always apply.

Whether you are able to replace this capacitor on your own is another
question. In the power supply this is quite easy. On the main board you
need soldering experience and a powerful soldering iron.

Alternatively you may replace the defective component. But do not expect
any old power supply fond somewhere to work. I had at most 3 consecutive
defective power supplies in one day.

Do not care much on the rated maximum power. It is almost never the
cause of problems. Larger power supplies will primarily consume more
power as they are less efficient at the same output.

> I have Boot Manager installed, and that always comes up just fine, so
> the Boot Manager partition appears to be okay, as is the disk controller.
> I have the operating system installed on two different partitions, both
> on the same physical disk as the Boot Manager partition. Doesn't matter
> which one I try to boot from; I get the reboot when "white square OS/2"
> appears with either, so I doubt the failure is associated with a
> particular disk partition.

Typically defective capacitors do not primarily cause problems at high
power supply. More often they cause problems at with load changes. And
the first major load change is when the kernel activates power saving
states of CPU. The BIOS of older hardware usually doesn't do so.

> Does anybody familiar with the details of the OS/2 boot process know
> exactly what is happening immediately after the "white square OS/2"
> appears? Narrowing down the list of possible failing components is
> the goal here.

You know Alt-F2? It will show you the drivers loaded.

But it is not too likely that it will help you much to diagnose hardware
problems. Almost any driver may fail because of this. You will just
blame the first one that executes some code that can no longer be
executed correctly.

Marcel

Subject: Re: hardware failing
From: Dave Yeo
Newsgroups: comp.os.os2.misc
Organization: Aioe.org NNTP Server
Date: Tue, 4 Aug 2020 06:33 UTC
References: 1
Path: eternal-september.org!news.eternal-september.org!reader01.eternal-september.org!reader01.eternal-september.org!feeder.eternal-september.org!aioe.org!.POSTED.7k29vX4qWi6wkHK1tfOB7w.user.gioia.aioe.org!not-for-mail
From: dave.r.yeo@gmail.com (Dave Yeo)
Newsgroups: comp.os.os2.misc
Subject: Re: hardware failing
Date: Mon, 3 Aug 2020 23:33:21 -0700
Organization: Aioe.org NNTP Server
Lines: 12
Message-ID: <rgavfj$cts$1@gioia.aioe.org>
References: <rg5631$6jn$1@gioia.aioe.org>
NNTP-Posting-Host: 7k29vX4qWi6wkHK1tfOB7w.user.gioia.aioe.org
Mime-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
X-Complaints-To: abuse@aioe.org
User-Agent: Mozilla/5.0 (OS/2; Warp 4.5; rv:45.0) Gecko/20100101
Thunderbird/45.8.0
X-Notice: Filtered by postfilter v. 0.9.2
View all headers

On 08/01/20 06:49 PM, tholen@antispam.ham wrote:
> One of my OS/2 systems must be pushing 20 years old and is starting to
> show signs of hardware failure.

While the others are likely correct about the power supply, or worse, a
system board capacitor, I'd start with the easiest thing. Unplug
everything and plug it in again. My sons computer recently died and
wouldn't boot, he removed the ram and reinserted it and it came back to
life and has been working fine. Likely just some oxidation in his case
and possibly yours. It's an easy thing to do. I've also heard of cards
working themselves slightly loose.
Dave

Subject: Re: hardware failing
From: Dan Camp
Newsgroups: comp.os.os2.misc
Organization: Aioe.org NNTP Server
Date: Sun, 16 Aug 2020 05:57 UTC
References: 1
Path: eternal-september.org!news.eternal-september.org!reader01.eternal-september.org!reader01.eternal-september.org!feeder.eternal-september.org!aioe.org!.POSTED.1nJ3i4lNmvIm98n050D+Xw.user.gioia.aioe.org!not-for-mail
From: dannycamps@attglobal.net (Dan Camp)
Newsgroups: comp.os.os2.misc
Subject: Re: hardware failing
Date: Sun, 16 Aug 2020 00:57:03 -0500
Organization: Aioe.org NNTP Server
Lines: 33
Message-ID: <rhaeb1$1jvp$1@gioia.aioe.org>
References: <rg5631$6jn$1@gioia.aioe.org>
Reply-To: Dan Camp <dannycamps@spamsucks.attglobal.net>
NNTP-Posting-Host: 1nJ3i4lNmvIm98n050D+Xw.user.gioia.aioe.org
Mime-Version: 1.0
Content-Type: text/plain; charset="US-ASCII"
Content-Transfer-Encoding: 8bit
X-Complaints-To: abuse@aioe.org
X-Notice: Filtered by postfilter v. 0.9.2
X-Newsreader: Internet Adventurer v1.50 for OS/2 - Unregistered
View all headers

On Sun, 2 Aug 2020 01:49:22 +0000 (UTC), tholen@antispam.ham wrote:

>One of my OS/2 systems must be pushing 20 years old and is starting to
>show signs of hardware failure. I'm faced with the choice of either
>getting new, faster, more modern hardware and fighting with OS/2
>compatibility issues, or trying to repair the existing system, which
>is still surprisingly capable despite its age. The trick to the
>latter choice is to figure out what component is failing.
>

Just throwing this out there as a possible workaround. I recently
pulled out an old OS/2 machine I hadn't booted in over 10 years because
I needed to get some files off it (to ironically build a newer machine
running ArcaOS).

Rather than risking booting up 20+ year old equipment and having who
knows what happen, I was able to use VMWare's standalone VM convertor
software to do a bare-metal conversion of my OS/2 machine to a VM which
I can run in VMWare Player.

The benefit of doing this is that I now have access to my OS/2 machine
in its entirety running virtually on a modern machine.

===================================
dannycamps@spamsucks.attglobal.net
(Change the obvious to reply)

Windows 9x -n- (win-doze): A 32 bit Extension to
a 16 bit Graphical Shell of an 8 bit Operating System
originally coded for a 4 bit Processor by a 2 bit
company that can't stand one bit of competition!!

1

rocksolid light 0.9.8
clearnet tor