rotten news relay - sci.stat.math - Re: Q direct observation of statistical comparison

Subject: Q direct observation of statistical comparison
From: Cosine
Newsgroups: sci.stat.math
Date: Mon, 12 Jun 2023 15:40 UTC

X-Received: by 2002:ad4:5502:0:b0:62d:e570:9ad1 with SMTP id pz2-20020ad45502000000b0062de5709ad1mr583003qvb.10.1686584439977;
Mon, 12 Jun 2023 08:40:39 -0700 (PDT)
X-Received: by 2002:a05:6830:1241:b0:6b1:591b:40fa with SMTP id
s1-20020a056830124100b006b1591b40famr2985643otp.0.1686584439750; Mon, 12 Jun
2023 08:40:39 -0700 (PDT)
Path: eternal-september.org!news.eternal-september.org!1.us.feeder.erje.net!feeder.erje.net!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer01.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.stat.math
Date: Mon, 12 Jun 2023 08:40:39 -0700 (PDT)
Injection-Info: google-groups.googlegroups.com; posting-host=114.24.71.128; posting-account=H-IscAoAAABkDNrURGSxo9jPN3MJ3a8A
NNTP-Posting-Host: 114.24.71.128
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <0a654c29-ebfd-442c-bff8-de6b0e0a49a3n@googlegroups.com>
Subject: Q direct observation of statistical comparison
From: asecant@gmail.com (Cosine)
Injection-Date: Mon, 12 Jun 2023 15:40:39 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 1840

View all headers

Hi:

A formal way to determine if the effect of a random variable is greater than another is to perform the hypothesis to check whether the difference or ratio of the metric is greater and whether this fact is significant.

However, are there special cases in which one could determine whether the effect of a random variable is greater than that of another without performing the above formal procedure?

For example, when comparing the salary of the domestic and foreign groups, the average salaries and the associated standard errors of the two groups are: (Avg_d, Se_d) and (Avg_f, Se_f). Could we quickly answer the question of greater salary by directly observing the numeric data given above? Say, the confidence interval of the two average salaries overlaps greatly.

Subject: Re: Q direct observation of statistical comparison
From: Rich Ulrich
Newsgroups: sci.stat.math
Date: Mon, 12 Jun 2023 16:24 UTC
References: 1

Path: eternal-september.org!news.eternal-september.org!weretis.net!feeder6.news.weretis.net!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!feeder.usenetexpress.com!tr3.iad1.usenetexpress.com!69.80.99.26.MISMATCH!Xl.tags.giganews.com!local-2.nntp.ord.giganews.com!news.giganews.com.POSTED!not-for-mail
NNTP-Posting-Date: Mon, 12 Jun 2023 16:24:53 +0000
From: rich.ulrich@comcast.net (Rich Ulrich)
Newsgroups: sci.stat.math
Subject: Re: Q direct observation of statistical comparison
Date: Mon, 12 Jun 2023 12:24:53 -0400
Message-ID: <cuge8it0hq2sq725kaeicim6gfhq4qvlnf@4ax.com>
References: <0a654c29-ebfd-442c-bff8-de6b0e0a49a3n@googlegroups.com>
User-Agent: ForteAgent/8.00.32.1272
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Lines: 42
X-Usenet-Provider: http://www.giganews.com
X-Trace: sv3-YAEJEFB58AHye0cTbqcNQTjAnEDn/LuexdUl37jsycq2CD3UD6KC5krZaMdyXxQwwDYvEkunTurm6dm!rgrEjmfzBincp+3x+iGHvsb4PUETzgOtp75z0BPJgzxhIF6+AzHQ0aGb0JYcGVlIqLfMdYg=
X-Complaints-To: abuse@giganews.com
X-DMCA-Notifications: http://www.giganews.com/info/dmca.html
X-Abuse-and-DMCA-Info: Please be sure to forward a copy of ALL headers
X-Abuse-and-DMCA-Info: Otherwise we will be unable to process your complaint properly
X-Postfilter: 1.3.40

View all headers

On Mon, 12 Jun 2023 08:40:39 -0700 (PDT), Cosine <asecant@gmail.com>
wrote:

>Hi:
>
> A formal way to determine if the effect of a random variable is greater than another is to perform the hypothesis to check whether the difference or ratio of the metric is greater and whether this fact is significant.
>
> However, are there special cases in which one could determine whether the effect of a random variable is greater than that of another without performing the above formal procedure?
>
> For example, when comparing the salary of the domestic and foreign groups, the average salaries and the associated standard errors of the two groups are: (Avg_d, Se_d) and (Avg_f, Se_f). Could we quickly answer the question of greater salary by directly observing the numeric data given above? Say, the confidence interval of the two average salaries overlaps greatly.

Hmm. You say "effect" a couple of times, suggesting
something more complicated, before you ask about means.

Means and their SDs are the basis of ordinary t-tests.

"Directly observing" the data? Do you want something like this?

https://www.qimacros.com/hypothesis-testing/tukey-quick-test-excel/
Tukey's Quick Test can be used when:

There are two unpaired samples of similar size that overlap each
other. Ratio of sizes should not exceed 4:3.
One sample contains the highest value, the other sample contains
the lowest value. One sample cannot contain both the highest and the
lowest value, nor can both samples have the same high or low value.

By adding the counts of the number of unmatched points on either end,
one can determine the 5%, 1% and 0.1% critical values as roughly 7,
10, and 13 points.

IIRC, the textbook that first showed me this test quoted Tukey
exactly. Tukey described the test AND its critical values in two
sentences. I was disappointed, a few years later, when I saw
that the newer edition of the textbook had dropped the topic.

If you want a full test on ranks, editors will prefer the K-S test
on ranks.

--
Rich Ulrich

Subject: Re: Q direct observation of statistical comparison
From: Rich Ulrich
Newsgroups: sci.stat.math
Date: Wed, 14 Jun 2023 04:43 UTC
References: 1 2

Path: eternal-september.org!news.eternal-september.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!feeder.usenetexpress.com!tr2.iad1.usenetexpress.com!69.80.99.27.MISMATCH!Xl.tags.giganews.com!local-2.nntp.ord.giganews.com!news.giganews.com.POSTED!not-for-mail
NNTP-Posting-Date: Wed, 14 Jun 2023 04:43:07 +0000
From: rich.ulrich@comcast.net (Rich Ulrich)
Newsgroups: sci.stat.math
Subject: Re: Q direct observation of statistical comparison
Date: Wed, 14 Jun 2023 00:43:08 -0400
Message-ID: <okgi8it94fkjmgngs2130llhu54bbned56@4ax.com>
References: <0a654c29-ebfd-442c-bff8-de6b0e0a49a3n@googlegroups.com> <cuge8it0hq2sq725kaeicim6gfhq4qvlnf@4ax.com>
User-Agent: ForteAgent/8.00.32.1272
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Lines: 59
X-Usenet-Provider: http://www.giganews.com
X-Trace: sv3-mmYs0QdMU6ozCp1Yny2mrzdNqnctzJcOlu26AW6/RvvPFqq0VeMzjJCZB4hXlN9aLVFpTCzljbrWPzh!kQw7yoq/M45Mr857emoqvoy4G7KU0jjpyG8Kn4al+pw/bq5Avi3RrC+0Giv8gkFFnnur8jU=
X-Complaints-To: abuse@giganews.com
X-DMCA-Notifications: http://www.giganews.com/info/dmca.html
X-Abuse-and-DMCA-Info: Please be sure to forward a copy of ALL headers
X-Abuse-and-DMCA-Info: Otherwise we will be unable to process your complaint properly
X-Postfilter: 1.3.40

View all headers

On Mon, 12 Jun 2023 12:24:53 -0400, Rich Ulrich
<rich.ulrich@comcast.net> wrote:

>On Mon, 12 Jun 2023 08:40:39 -0700 (PDT), Cosine <asecant@gmail.com>
>wrote:
>
>>Hi:
>>
>> A formal way to determine if the effect of a random variable is greater than another is to perform the hypothesis to check whether the difference or ratio of the metric is greater and whether this fact is significant.
>>
>> However, are there special cases in which one could determine whether the effect of a random variable is greater than that of another without performing the above formal procedure?
>>
>> For example, when comparing the salary of the domestic and foreign groups, the average salaries and the associated standard errors of the two groups are: (Avg_d, Se_d) and (Avg_f, Se_f). Could we quickly answer the question of greater salary by directly observing the numeric data given above? Say, the confidence interval of the two average salaries overlaps greatly.
>
>Hmm. You say "effect" a couple of times, suggesting
>something more complicated, before you ask about means.
>
>Means and their SDs are the basis of ordinary t-tests.
>
>"Directly observing" the data? Do you want something like this?
>
>https://www.qimacros.com/hypothesis-testing/tukey-quick-test-excel/
>Tukey's Quick Test can be used when:
>
> There are two unpaired samples of similar size that overlap each
>other. Ratio of sizes should not exceed 4:3.
> One sample contains the highest value, the other sample contains
>the lowest value. One sample cannot contain both the highest and the
>lowest value, nor can both samples have the same high or low value.
>
>By adding the counts of the number of unmatched points on either end,
>one can determine the 5%, 1% and 0.1% critical values as roughly 7,
>10, and 13 points.
>
>IIRC, the textbook that first showed me this test quoted Tukey
>exactly. Tukey described the test AND its critical values in two
>sentences. I was disappointed, a few years later, when I saw
>that the newer edition of the textbook had dropped the topic.
>
>
>If you want a full test on ranks, editors will prefer the K-S test
>on ranks.

By the way -- I remembered the Tukey Quick Test because I
kept it in mind and used it a number of times, for my own
confirmation when browsing data.

I've seen a text book (I forget whose) that had an appendix
with different cutoffs for various pairs of sample Ns. But I
would not suggest trying to publish something relying on it.

I speculate that the "4:3" ratio of Ns (mentioned above) is a
pretty good match to where the cutoffs are exact.

Tukey's two sentences did not specify the ratio of sample sizes,
and called it 'approximate'.

--
Rich Ulrich

Subject: Re: Q direct observation of statistical comparison
From: Bruce Weaver
Newsgroups: sci.stat.math
Date: Wed, 28 Jun 2023 18:27 UTC
References: 1 2 3

X-Received: by 2002:a05:620a:454c:b0:765:8643:12f3 with SMTP id u12-20020a05620a454c00b00765864312f3mr515292qkp.8.1687976828811;
Wed, 28 Jun 2023 11:27:08 -0700 (PDT)
X-Received: by 2002:a05:6870:7c10:b0:1b0:460c:548b with SMTP id
je16-20020a0568707c1000b001b0460c548bmr4675659oab.3.1687976828391; Wed, 28
Jun 2023 11:27:08 -0700 (PDT)
Path: eternal-september.org!news.eternal-september.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer01.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.stat.math
Date: Wed, 28 Jun 2023 11:27:08 -0700 (PDT)
In-Reply-To: <okgi8it94fkjmgngs2130llhu54bbned56@4ax.com>
Injection-Info: google-groups.googlegroups.com; posting-host=38.18.121.198; posting-account=yYxsOwkAAACsM-qvG7dyTrwV5QniYqK1
NNTP-Posting-Host: 38.18.121.198
References: <0a654c29-ebfd-442c-bff8-de6b0e0a49a3n@googlegroups.com>
<cuge8it0hq2sq725kaeicim6gfhq4qvlnf@4ax.com> <okgi8it94fkjmgngs2130llhu54bbned56@4ax.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <f1c6fe64-6589-4c3c-bf84-0340b3f75f0bn@googlegroups.com>
Subject: Re: Q direct observation of statistical comparison
From: bweaver@lakeheadu.ca (Bruce Weaver)
Injection-Date: Wed, 28 Jun 2023 18:27:08 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 4577

View all headers

I don't recall hearing about this test before. Apparently, it is sometimes called the Tukey-Duckworth (quick) test.

https://en.wikipedia.org/wiki/Tukey%E2%80%93Duckworth_test

On Wednesday, June 14, 2023 at 12:43:15 AM UTC-4, Rich Ulrich wrote:
> On Mon, 12 Jun 2023 12:24:53 -0400, Rich Ulrich
> <rich....@comcast.net> wrote:
>
> >On Mon, 12 Jun 2023 08:40:39 -0700 (PDT), Cosine <ase...@gmail.com>
> >wrote:
> >
> >>Hi:
> >>
> >> A formal way to determine if the effect of a random variable is greater than another is to perform the hypothesis to check whether the difference or ratio of the metric is greater and whether this fact is significant.
> >>
> >> However, are there special cases in which one could determine whether the effect of a random variable is greater than that of another without performing the above formal procedure?
> >>
> >> For example, when comparing the salary of the domestic and foreign groups, the average salaries and the associated standard errors of the two groups are: (Avg_d, Se_d) and (Avg_f, Se_f). Could we quickly answer the question of greater salary by directly observing the numeric data given above? Say, the confidence interval of the two average salaries overlaps greatly.
> >
> >Hmm. You say "effect" a couple of times, suggesting
> >something more complicated, before you ask about means.
> >
> >Means and their SDs are the basis of ordinary t-tests.
> >
> >"Directly observing" the data? Do you want something like this?
> >
> >https://www.qimacros.com/hypothesis-testing/tukey-quick-test-excel/
> >Tukey's Quick Test can be used when:
> >
> > There are two unpaired samples of similar size that overlap each
> >other. Ratio of sizes should not exceed 4:3.
> > One sample contains the highest value, the other sample contains
> >the lowest value. One sample cannot contain both the highest and the
> >lowest value, nor can both samples have the same high or low value.
> >
> >By adding the counts of the number of unmatched points on either end,
> >one can determine the 5%, 1% and 0.1% critical values as roughly 7,
> >10, and 13 points.
> >
> >IIRC, the textbook that first showed me this test quoted Tukey
> >exactly. Tukey described the test AND its critical values in two
> >sentences. I was disappointed, a few years later, when I saw
> >that the newer edition of the textbook had dropped the topic.
> >
> >
> >If you want a full test on ranks, editors will prefer the K-S test
> >on ranks.
> By the way -- I remembered the Tukey Quick Test because I
> kept it in mind and used it a number of times, for my own
> confirmation when browsing data.
>
> I've seen a text book (I forget whose) that had an appendix
> with different cutoffs for various pairs of sample Ns. But I
> would not suggest trying to publish something relying on it.
>
> I speculate that the "4:3" ratio of Ns (mentioned above) is a
> pretty good match to where the cutoffs are exact.
>
> Tukey's two sentences did not specify the ratio of sample sizes,
> and called it 'approximate'.
>
> --
> Rich Ulrich

Subject: Re: Q direct observation of statistical comparison
From: Rich Ulrich
Newsgroups: sci.stat.math
Date: Thu, 29 Jun 2023 04:15 UTC
References: 1 2 3 4

Path: eternal-september.org!news.eternal-september.org!border-1.nntp.ord.giganews.com!border-2.nntp.ord.giganews.com!nntp.giganews.com!Xl.tags.giganews.com!local-1.nntp.ord.giganews.com!news.giganews.com.POSTED!not-for-mail
NNTP-Posting-Date: Thu, 29 Jun 2023 04:15:16 +0000
From: rich.ulrich@comcast.net (Rich Ulrich)
Newsgroups: sci.stat.math
Subject: Re: Q direct observation of statistical comparison
Date: Thu, 29 Jun 2023 00:15:16 -0400
Message-ID: <0c0q9i15do5il9ra337qvktj3l9trhn8lp@4ax.com>
References: <0a654c29-ebfd-442c-bff8-de6b0e0a49a3n@googlegroups.com> <cuge8it0hq2sq725kaeicim6gfhq4qvlnf@4ax.com> <okgi8it94fkjmgngs2130llhu54bbned56@4ax.com> <f1c6fe64-6589-4c3c-bf84-0340b3f75f0bn@googlegroups.com>
User-Agent: ForteAgent/8.00.32.1272
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Lines: 73
X-Usenet-Provider: http://www.giganews.com
X-Trace: sv3-bvLqTcwehyiXyDlIeHj2EFlLG+yGvisXp3A3Yn+yDtOxTtylA6fqLwA0jbxFXfg4pqOZGjrOG5j46HK!cy0Gs9OvaK4sihpCWMmaG0PB+ivBN7d6YTr48fAXYIJY4qeh5CkgPZrpWqodCdYgBhObfDQ=
X-Complaints-To: abuse@giganews.com
X-DMCA-Notifications: http://www.giganews.com/info/dmca.html
X-Abuse-and-DMCA-Info: Please be sure to forward a copy of ALL headers
X-Abuse-and-DMCA-Info: Otherwise we will be unable to process your complaint properly
X-Postfilter: 1.3.40

View all headers

On Wed, 28 Jun 2023 11:27:08 -0700 (PDT), Bruce Weaver
<bweaver@lakeheadu.ca> wrote:

>I don't recall hearing about this test before. Apparently, it is sometimes called the Tukey-Duckworth (quick) test.
>
>https://en.wikipedia.org/wiki/Tukey%E2%80%93Duckworth_test

Top-posting? Okay.

Okay. It adds that Duckworth requested a simple test, usable in
the field, and this is what Tukey provided. I'm not surprised if he
gave us some other Quick tests -- so someone added Duckworth?

Tukey was a prolific statistican, with a different perspective from
most of us. I gained useful insights from reading his textbooks,
though I still wonder if they are 'simple' enough to be used in
the intro courses they are written for. I think I got much of my
perspective on the proper use of transformations from his chapters
on the subject.

There is some paper on presenting data with useful graphics (IIRC
the topic rightly) which lists Tukey, whose ideas it presented, as
author #9; a statistician friend said that his professors had referred
to it as "et al. and Tukey" .

>
>
>On Wednesday, June 14, 2023 at 12:43:15?AM UTC-4, Rich Ulrich wrote:
>> On Mon, 12 Jun 2023 12:24:53 -0400, Rich Ulrich

< snip, original problem >

>> >"Directly observing" the data? Do you want something like this?
>> >
>> >https://www.qimacros.com/hypothesis-testing/tukey-quick-test-excel/
>> >Tukey's Quick Test can be used when:
>> >
>> > There are two unpaired samples of similar size that overlap each
>> >other. Ratio of sizes should not exceed 4:3.
>> > One sample contains the highest value, the other sample contains
>> >the lowest value. One sample cannot contain both the highest and the
>> >lowest value, nor can both samples have the same high or low value.
>> >
>> >By adding the counts of the number of unmatched points on either end,
>> >one can determine the 5%, 1% and 0.1% critical values as roughly 7,
>> >10, and 13 points.
>> >
>> >IIRC, the textbook that first showed me this test quoted Tukey
>> >exactly. Tukey described the test AND its critical values in two
>> >sentences. I was disappointed, a few years later, when I saw
>> >that the newer edition of the textbook had dropped the topic.
>> >
>> >
>> >If you want a full test on ranks, editors will prefer the K-S test
>> >on ranks.
>> By the way -- I remembered the Tukey Quick Test because I
>> kept it in mind and used it a number of times, for my own
>> confirmation when browsing data.
>>
>> I've seen a text book (I forget whose) that had an appendix
>> with different cutoffs for various pairs of sample Ns. But I
>> would not suggest trying to publish something relying on it.
>>
>> I speculate that the "4:3" ratio of Ns (mentioned above) is a
>> pretty good match to where the cutoffs are exact.
>>
>> Tukey's two sentences did not specify the ratio of sample sizes,
>> and called it 'approximate'.
>>

--
Rich Ulrich

Subject: Re: Q direct observation of statistical comparison
From: David Jones
Newsgroups: sci.stat.math
Organization: A noiseless patient Spider
Date: Thu, 29 Jun 2023 11:56 UTC
References: 1 2 3 4 5

Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: dajhawkxx@nowherel.com (David Jones)
Newsgroups: sci.stat.math
Subject: Re: Q direct observation of statistical comparison
Date: Thu, 29 Jun 2023 11:56:53 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 81
Message-ID: <u7jri5$258ii$1@dont-email.me>
References: <0a654c29-ebfd-442c-bff8-de6b0e0a49a3n@googlegroups.com> <cuge8it0hq2sq725kaeicim6gfhq4qvlnf@4ax.com> <okgi8it94fkjmgngs2130llhu54bbned56@4ax.com> <f1c6fe64-6589-4c3c-bf84-0340b3f75f0bn@googlegroups.com> <0c0q9i15do5il9ra337qvktj3l9trhn8lp@4ax.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 29 Jun 2023 11:56:53 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="fda352c28a39f92ecf4857c0fd81f9b9";
logging-data="2269778"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19VacjLtYeBUEyFI/sH8u2YKenF7J+ldzg="
User-Agent: XanaNews/1.21-f3fb89f (x86; Portable ISpell)
Cancel-Lock: sha1:qN5AAji7WILVMi+zM2WKHvvXdWY=

View all headers

Rich Ulrich wrote:

> On Wed, 28 Jun 2023 11:27:08 -0700 (PDT), Bruce Weaver
> <bweaver@lakeheadu.ca> wrote:
>
> > I don't recall hearing about this test before. Apparently, it is
> > sometimes called the Tukey-Duckworth (quick) test.
> >
> > https://en.wikipedia.org/wiki/Tukey%E2%80%93Duckworth_test
>
> Top-posting? Okay.
>
> Okay. It adds that Duckworth requested a simple test, usable in
> the field, and this is what Tukey provided. I'm not surprised if he
> gave us some other Quick tests -- so someone added Duckworth?
>
> Tukey was a prolific statistican, with a different perspective from
> most of us. I gained useful insights from reading his textbooks,
> though I still wonder if they are 'simple' enough to be used in
> the intro courses they are written for. I think I got much of my
> perspective on the proper use of transformations from his chapters
> on the subject.
>
> There is some paper on presenting data with useful graphics (IIRC
> the topic rightly) which lists Tukey, whose ideas it presented, as
> author #9; a statistician friend said that his professors had referred
> to it as "et al. and Tukey" .
>
>
> >
> >
> > On Wednesday, June 14, 2023 at 12:43:15?AM UTC-4, Rich Ulrich wrote:
> >> On Mon, 12 Jun 2023 12:24:53 -0400, Rich Ulrich
>
> < snip, original problem >
>
> >> >"Directly observing" the data? Do you want something like this?
> >> >
> >>
> >https://www.qimacros.com/hypothesis-testing/tukey-quick-test-excel/
> >> >Tukey's Quick Test can be used when: >> >
> >> > There are two unpaired samples of similar size that overlap each
> >> >other. Ratio of sizes should not exceed 4:3.
> >> > One sample contains the highest value, the other sample contains
> >> >the lowest value. One sample cannot contain both the highest and
> the >> >lowest value, nor can both samples have the same high or low
> value. >> >
> >> >By adding the counts of the number of unmatched points on either
> end, >> >one can determine the 5%, 1% and 0.1% critical values as
> roughly 7, >> >10, and 13 points.
> >> >
> >> >IIRC, the textbook that first showed me this test quoted Tukey
> >> >exactly. Tukey described the test AND its critical values in two
> >> >sentences. I was disappointed, a few years later, when I saw
> >> >that the newer edition of the textbook had dropped the topic.
> >> >
> >> >
> >> >If you want a full test on ranks, editors will prefer the K-S
> test >> >on ranks.
> >> By the way -- I remembered the Tukey Quick Test because I
> >> kept it in mind and used it a number of times, for my own
> >> confirmation when browsing data.
> >>
> >> I've seen a text book (I forget whose) that had an appendix
> >> with different cutoffs for various pairs of sample Ns. But I
> >> would not suggest trying to publish something relying on it.
> >>
> >> I speculate that the "4:3" ratio of Ns (mentioned above) is a
> >> pretty good match to where the cutoffs are exact.
> >>
> >> Tukey's two sentences did not specify the ratio of sample sizes,
> >> and called it 'approximate'.
> >>

A problem seems to be in "One sample cannot contain both the highest
and the lowest value, nor can both samples have the same high or low
value."

Is a test a test, if you can't always apply it? Is there some action
advised if the test can't be applied?

Subject: Re: Q direct observation of statistical comparison
From: Rich Ulrich
Newsgroups: sci.stat.math
Date: Fri, 30 Jun 2023 18:09 UTC
References: 1 2 3 4 5 6

Path: eternal-september.org!news.eternal-september.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer01.iad!feed-me.highwinds-media.com!news.highwinds-media.com!feeder.usenetexpress.com!tr3.iad1.usenetexpress.com!69.80.99.26.MISMATCH!Xl.tags.giganews.com!local-2.nntp.ord.giganews.com!news.giganews.com.POSTED!not-for-mail
NNTP-Posting-Date: Fri, 30 Jun 2023 18:09:06 +0000
From: rich.ulrich@comcast.net (Rich Ulrich)
Newsgroups: sci.stat.math
Subject: Re: Q direct observation of statistical comparison
Date: Fri, 30 Jun 2023 14:09:05 -0400
Message-ID: <id4u9ilsr74o974jgpd9rtrpsurdthbr8v@4ax.com>
References: <0a654c29-ebfd-442c-bff8-de6b0e0a49a3n@googlegroups.com> <cuge8it0hq2sq725kaeicim6gfhq4qvlnf@4ax.com> <okgi8it94fkjmgngs2130llhu54bbned56@4ax.com> <f1c6fe64-6589-4c3c-bf84-0340b3f75f0bn@googlegroups.com> <0c0q9i15do5il9ra337qvktj3l9trhn8lp@4ax.com> <u7jri5$258ii$1@dont-email.me>
User-Agent: ForteAgent/8.00.32.1272
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Lines: 124
X-Usenet-Provider: http://www.giganews.com
X-Trace: sv3-Ybc5H/cIyemWHR7crO/M4V5AFhsTz4OgcBMGYDzZdRTlR29AM1I8sP3Re/3xxVPshl57A021lQBUH8c!DvC5kcXJYIfmtqFvzIdTW1jdU0IEiHWn8pu4g5paUgpZ/pieVuM+8ZCrnmrMW8BLIMWFQ5o=
X-Complaints-To: abuse@giganews.com
X-DMCA-Notifications: http://www.giganews.com/info/dmca.html
X-Abuse-and-DMCA-Info: Please be sure to forward a copy of ALL headers
X-Abuse-and-DMCA-Info: Otherwise we will be unable to process your complaint properly
X-Postfilter: 1.3.40
X-Received-Bytes: 6417

View all headers

On Thu, 29 Jun 2023 11:56:53 -0000 (UTC), "David Jones"
<dajhawkxx@nowherel.com> wrote:

>Rich Ulrich wrote:
>
>> On Wed, 28 Jun 2023 11:27:08 -0700 (PDT), Bruce Weaver
>> <bweaver@lakeheadu.ca> wrote:
>>
>> > I don't recall hearing about this test before. Apparently, it is
>> > sometimes called the Tukey-Duckworth (quick) test.
>> >
>> > https://en.wikipedia.org/wiki/Tukey%E2%80%93Duckworth_test
>>
>> Top-posting? Okay.
>>
>> Okay. It adds that Duckworth requested a simple test, usable in
>> the field, and this is what Tukey provided. I'm not surprised if he
>> gave us some other Quick tests -- so someone added Duckworth?
>>
>> Tukey was a prolific statistican, with a different perspective from
>> most of us. I gained useful insights from reading his textbooks,
>> though I still wonder if they are 'simple' enough to be used in
>> the intro courses they are written for. I think I got much of my
>> perspective on the proper use of transformations from his chapters
>> on the subject.
>>
>> There is some paper on presenting data with useful graphics (IIRC
>> the topic rightly) which lists Tukey, whose ideas it presented, as
>> author #9; a statistician friend said that his professors had referred
>> to it as "et al. and Tukey" .
>>
>>
>> >
>> >
>> > On Wednesday, June 14, 2023 at 12:43:15?AM UTC-4, Rich Ulrich wrote:
>> >> On Mon, 12 Jun 2023 12:24:53 -0400, Rich Ulrich
>>
>> < snip, original problem >
>>
>> >> >"Directly observing" the data? Do you want something like this?
>> >> >
>> >>
>> >https://www.qimacros.com/hypothesis-testing/tukey-quick-test-excel/
>> >> >Tukey's Quick Test can be used when: >> >
>> >> > There are two unpaired samples of similar size that overlap each
>> >> >other. Ratio of sizes should not exceed 4:3.
>> >> > One sample contains the highest value, the other sample contains
>> >> >the lowest value. One sample cannot contain both the highest and
>> the >> >lowest value, nor can both samples have the same high or low
>> value. >> >
>> >> >By adding the counts of the number of unmatched points on either
>> end, >> >one can determine the 5%, 1% and 0.1% critical values as
>> roughly 7, >> >10, and 13 points.
>> >> >
>> >> >IIRC, the textbook that first showed me this test quoted Tukey
>> >> >exactly. Tukey described the test AND its critical values in two
>> >> >sentences. I was disappointed, a few years later, when I saw
>> >> >that the newer edition of the textbook had dropped the topic.
>> >> >
>> >> >
>> >> >If you want a full test on ranks, editors will prefer the K-S
>> test >> >on ranks.
>> >> By the way -- I remembered the Tukey Quick Test because I
>> >> kept it in mind and used it a number of times, for my own
>> >> confirmation when browsing data.
>> >>
>> >> I've seen a text book (I forget whose) that had an appendix
>> >> with different cutoffs for various pairs of sample Ns. But I
>> >> would not suggest trying to publish something relying on it.
>> >>
>> >> I speculate that the "4:3" ratio of Ns (mentioned above) is a
>> >> pretty good match to where the cutoffs are exact.
>> >>
>> >> Tukey's two sentences did not specify the ratio of sample sizes,
>> >> and called it 'approximate'.
>> >>
>
>A problem seems to be in "One sample cannot contain both the highest
>and the lowest value, nor can both samples have the same high or low
>value."
>
>Is a test a test, if you can't always apply it?

A philosophical question? "Can't" or "shouldn't, because there
is no power or useful table of p-values"?

Pragmatically -- If I have a computer program for it, my program
will give SOME answer. The table of p-values must be a problem,
but it can return '0' for the sum of counts as a safe answer when
there's a doubt. I wonder how robust the Quick test is when the
data are discrete and (therefore) can have a tie at one end, while
the other end can be counted? Pragmatically, I don't know if the
test is robust against that assumption. Monte Carlo randomization
on all the data values could provide an ad-hoc assessment of p.

Assumptions?
The K-S rank test as a test for location has the ASSUMPTION that
the distributions are otherwise similar and differ by the location
parameter. When variances are vastly different, the KS test can
'reject' in either direction, depending on which end the counting
starts from.

No Power?
I've seen a lot of t-tests and contingency tables computed when
the power is virtually nil. For contingency tables and 'exact' tests,
the power for alpha= 0.05 might be exactly nil, for Ns too small.

I have told consultees, "You don't really have a test there, because
the N is too small."

> Is there some action
>advised if the test can't be applied?

Use a test with other assumptions?

--
Rich Ulrich

You are going to be a stickler about assumptions and the
table of p-values?

Communicate! It can't make things any worse.

sci / sci.stat.math / Re: Q direct observation of statistical comparison

Subject	Author
Q direct observation of statistical comparison	Cosine
Re: Q direct observation of statistical comparison	Rich Ulrich
Re: Q direct observation of statistical comparison	Rich Ulrich
Re: Q direct observation of statistical comparison	Bruce Weaver
Re: Q direct observation of statistical comparison	Rich Ulrich
Re: Q direct observation of statistical comparison	David Jones
Re: Q direct observation of statistical comparison	Rich Ulrich