Rocksolid Light

News from da outaworlds

mail  files  register  groups  login

Message-ID:  

Your lucky number has been disconnected.


comp / comp.lang.tcl / Performance of list / array / dict compared

SubjectAuthor
* Performance of list / array / dict comparedRodionGork
+* Re: Performance of list / array / dict comparedGerald Lester
|`* Re: Performance of list / array / dict comparedRodionGork
| `* Re: Performance of list / array / dict comparedRich
|  `- Re: Performance of list / array / dict comparedgustafn
`- Re: Performance of list / array / dict comparedRich

1
Subject: Performance of list / array / dict compared
From: RodionGork
Newsgroups: comp.lang.tcl
Organization: novaBBS
Date: Sun, 18 Aug 2024 07:41 UTC
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!i2pn.org!i2pn2.org!.POSTED!not-for-mail
From: rodiongork@github.com (RodionGork)
Newsgroups: comp.lang.tcl
Subject: Performance of list / array / dict compared
Date: Sun, 18 Aug 2024 07:41:37 +0000
Organization: novaBBS
Message-ID: <96ddc76b99ce6bf7397a5158fece845f@www.novabbs.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Info: i2pn2.org;
logging-data="2979712"; mail-complaints-to="usenet@i2pn2.org";
posting-account="FK20xSPHkh3K4vnO8u2oiUWWGFHCzgkK4jO78trwjP4";
User-Agent: Rocksolid Light
X-Spam-Checker-Version: SpamAssassin 4.0.0
X-Rslight-Posting-User: 32ecc5e38066f1efcacd4ff0a351d3cb52726446
X-Rslight-Site: $2y$10$lAvj0jhfXjK6.KliTRXt3.o.h8lkQRoCefgYu15OPN3a1qQ46ToCy
View all headers

Hi Friends!

I've seen here curious thread on comparing TCL speed with Python - and
as a "sequel" to it here is more TCLish observation.

It happened that I was also measuring languages performance (for
practical purpose - to get understanding how much calculations would fit
in 1 second limited sandbox on my site). There are two tests - for plain
integer calculations - and for calculations involving array.

I initially used list as array and while performance is somewhat lower
than with other popular scripting languages, I thought it is quite
decent, regarding list implementation (I thought it is a kind of
space-separated string by then).

Then I browsed TCL tutorial and rewrote implementations using array and
finally, dict. They were
significantly worse, which is explainable as they are not necessarily
tuned to work as linear array - but I was somewhat surprised to see the
"dict" is the worst of all (perhaps it in improved in the versions above
8.5):

C (long long): 4.28
PHP: 11.99
Python3: 42.57
TCL (list): 63.30
TCL (array): 104.78
TCL (dict): 112.41
Lua: 33.75

Implementation is here and you are welcome to check whether I missed
some obvious way to improve results:
https://github.com/rodiongork/languages-benchmark

As a sidenote, PHP and Lua use single version of array for both "linear"
and "dict-like" storage.

--
to email me substitute github with gmail please

Subject: Re: Performance of list / array / dict compared
From: Gerald Lester
Newsgroups: comp.lang.tcl
Organization: fastusenet - www.fastusenet.org
Date: Sun, 18 Aug 2024 13:28 UTC
References: 1
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!tncsrv06.tnetconsulting.net!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx44.iad.POSTED!not-for-mail
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
Subject: Re: Performance of list / array / dict compared
Newsgroups: comp.lang.tcl
References: <96ddc76b99ce6bf7397a5158fece845f@www.novabbs.com>
Content-Language: en-US
From: Gerald.Lester@gmail.com (Gerald Lester)
In-Reply-To: <96ddc76b99ce6bf7397a5158fece845f@www.novabbs.com>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Lines: 44
Message-ID: <JFmwO.111824$KuXa.56578@fx44.iad>
X-Complaints-To: abuse@fastusenet.org
NNTP-Posting-Date: Sun, 18 Aug 2024 13:28:09 UTC
Organization: fastusenet - www.fastusenet.org
Date: Sun, 18 Aug 2024 08:28:09 -0500
X-Received-Bytes: 2347
View all headers

On 8/18/24 02:41, RodionGork wrote:
> Hi Friends!
>
> I've seen here curious thread on comparing TCL speed with Python - and
> as a "sequel" to it here is more TCLish observation.
>
> It happened that I was also measuring languages performance (for
> practical purpose - to get understanding how much calculations would fit
> in 1 second limited sandbox on my site). There are two tests - for plain
> integer calculations - and for calculations involving array.
>
> I initially used list as array and while performance is somewhat lower
> than with other popular scripting languages, I thought it is quite
> decent, regarding list implementation (I thought it is a kind of
> space-separated string by then).
>
> Then I browsed TCL tutorial and rewrote implementations using array and
> finally, dict. They were
> significantly worse, which is explainable as they are not necessarily
> tuned to work as linear array - but I was somewhat surprised to see the
> "dict" is the worst of all (perhaps it in improved in the versions above
> 8.5):
>
>
> C (long long): 4.28
> PHP: 11.99
> Python3: 42.57
> TCL (list): 63.30
> TCL (array): 104.78
> TCL (dict): 112.41
> Lua: 33.75
>
> Implementation is here and you are welcome to check whether I missed
> some obvious way to improve results:
> https://github.com/rodiongork/languages-benchmark
>
> As a sidenote, PHP and Lua use single version of array for both "linear"
> and "dict-like" storage.
>

Python has the equivalent of Lists, Arrays, and Dictionaries -- which
did you use?

For calculation, C or Golang would likely be best.

Subject: Re: Performance of list / array / dict compared
From: RodionGork
Newsgroups: comp.lang.tcl
Organization: novaBBS
Date: Sun, 18 Aug 2024 14:08 UTC
References: 1 2
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!i2pn.org!i2pn2.org!.POSTED!not-for-mail
From: rodiongork@github.com (RodionGork)
Newsgroups: comp.lang.tcl
Subject: Re: Performance of list / array / dict compared
Date: Sun, 18 Aug 2024 14:08:46 +0000
Organization: novaBBS
Message-ID: <dd2b683b86c31d62c2fc9919ffa6179e@www.novabbs.com>
References: <96ddc76b99ce6bf7397a5158fece845f@www.novabbs.com> <JFmwO.111824$KuXa.56578@fx44.iad>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Info: i2pn2.org;
logging-data="3011025"; mail-complaints-to="usenet@i2pn2.org";
posting-account="FK20xSPHkh3K4vnO8u2oiUWWGFHCzgkK4jO78trwjP4";
User-Agent: Rocksolid Light
X-Rslight-Site: $2y$10$4cuxqbV9nlHzDWvbqwrGrOnzPJxuTujFX9PwU20eIFP4o3y19en2y
X-Spam-Checker-Version: SpamAssassin 4.0.0
X-Rslight-Posting-User: 32ecc5e38066f1efcacd4ff0a351d3cb52726446
View all headers

> Python has the equivalent of Lists, Arrays, and Dictionaries -- which
> did you use?

in Python the plain list is used (well, these details could be quickly
seen by the link to sources above) - as I mentioned in TCL I decided to
try other structures only because I thought list implementation could be
not very effective internally, e.g. due to historical reasons...

> For calculation, C or Golang would likely be best.

Not necessarily, any language compiled to native codes will do
similarly. Moreover, there is optimized version of Python - and there is
JIT-version for Lua, while Java uses JIT anyway, so they results are
very close:

Java: 1.60
Pypy3: 7.31
LuaJit: 2.18

Perhaps someone may one day try to use JIT in TCL also (perhaps even
borrowing it from Lua may work)

--
to email me substitute github with gmail please

Subject: Re: Performance of list / array / dict compared
From: Rich
Newsgroups: comp.lang.tcl
Organization: A noiseless patient Spider
Date: Sun, 18 Aug 2024 17:03 UTC
References: 1
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: rich@example.invalid (Rich)
Newsgroups: comp.lang.tcl
Subject: Re: Performance of list / array / dict compared
Date: Sun, 18 Aug 2024 17:03:14 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 80
Message-ID: <v9t9gi$2euft$2@dont-email.me>
References: <96ddc76b99ce6bf7397a5158fece845f@www.novabbs.com>
Injection-Date: Sun, 18 Aug 2024 19:03:14 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="27a8b87db7433b5fa1dd735d9d0e6b28";
logging-data="2587133"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18aX0TtZcSfaKvqtcwZHo1T"
User-Agent: tin/2.6.1-20211226 ("Convalmore") (Linux/5.15.139 (x86_64))
Cancel-Lock: sha1:DUGsPsPn1WKZ44DTZ6Zh5UUqrd0=
View all headers

RodionGork <rodiongork@github.com> wrote:
> Hi Friends!
>
> I've seen here curious thread on comparing TCL speed with Python - and
> as a "sequel" to it here is more TCLish observation.
>
> It happened that I was also measuring languages performance (for
> practical purpose - to get understanding how much calculations would fit
> in 1 second limited sandbox on my site). There are two tests - for plain
> integer calculations - and for calculations involving array.
>
> I initially used list as array and while performance is somewhat lower
> than with other popular scripting languages, I thought it is quite
> decent, regarding list implementation (I thought it is a kind of
> space-separated string by then).

Tcl lists have not been "space-separated strings" since the advent of
Tcl 8.0. Which according to this page
(http://tcl.tk/software/tcltk/8.0.html) was released March 9, 1999. So 25
years since Tcl's lists were "space-separated strings" (reality is more
complex, they were really "specially formatted strings" with "space
separated" being a simple subset of "specially formatted".

> Then I browsed TCL tutorial and rewrote implementations using array and
> finally, dict. They were
> significantly worse, which is explainable as they are not necessarily
> tuned to work as linear array - but I was somewhat surprised to see the
> "dict" is the worst of all (perhaps it in improved in the versions above
> 8.5):
>
>
> C (long long): 4.28
> PHP: 11.99
> Python3: 42.57
> TCL (list): 63.30
> TCL (array): 104.78
> TCL (dict): 112.41
> Lua: 33.75
>
> Implementation is here and you are welcome to check whether I missed
> some obvious way to improve results:
> https://github.com/rodiongork/languages-benchmark

Which one? There are two.

For Collaz, if you are really on 8.5, then wrapping everything that is
at global level inside a proc (i.e., everthing from line 10 to line
17), and calling that proc as the single global command, will gain a
bit of speed, as for 8.5 the bytecode compiler is limited in what it
can compile outside of procs.

For Primes (beyond the same "in a proc" for 8.5 above), in the 'list
version' using global is costing you a bit of performance (the
"linking" performed by gobal takes some time). Modifing "is_prime" to
take both x and primes as parameters will gain a bit of speed for the
list version.

The array and dict versions will be slower than the list version
because they will always be adding in the overhead of the hashmap
computation for looking up elements (no hashmap lookup in the list
version).

For the dict version (besides all the above), you /might/ gain some
speed using the [dict values] subcommand to get a list of values from
the dict, then iterating that list. I.e.:

foreach d [dict values $primes] {
}

Which should avoid performing all the hash computations to lookup each
element individually. But, that will still be creating a new list each
time your outer loop modifies the dict.

You also don't need [dict append] in the outer loop. The way you are
using the dict, doing [dict append] means paying the cost of a hash
computation and a single element list creation for each new element
added. A [dict set] will produce the same final dict string, but avoid
wrapping each 'prime' in a single element list in each dict slot,
saving that (small) overhead.

Subject: Re: Performance of list / array / dict compared
From: Rich
Newsgroups: comp.lang.tcl
Organization: A noiseless patient Spider
Date: Sun, 18 Aug 2024 17:05 UTC
References: 1 2 3
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: rich@example.invalid (Rich)
Newsgroups: comp.lang.tcl
Subject: Re: Performance of list / array / dict compared
Date: Sun, 18 Aug 2024 17:05:15 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 13
Message-ID: <v9t9ka$2euft$3@dont-email.me>
References: <96ddc76b99ce6bf7397a5158fece845f@www.novabbs.com> <JFmwO.111824$KuXa.56578@fx44.iad> <dd2b683b86c31d62c2fc9919ffa6179e@www.novabbs.com>
Injection-Date: Sun, 18 Aug 2024 19:05:15 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="27a8b87db7433b5fa1dd735d9d0e6b28";
logging-data="2587133"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19WydaNw1MifO3lA1TCXbnf"
User-Agent: tin/2.6.1-20211226 ("Convalmore") (Linux/5.15.139 (x86_64))
Cancel-Lock: sha1:4ezndtClQxvyaPzTLsFiLoEJ7PM=
View all headers

RodionGork <rodiongork@github.com> wrote:
>> Python has the equivalent of Lists, Arrays, and Dictionaries -- which
>> did you use?
>
> in Python the plain list is used (well, these details could be quickly
> seen by the link to sources above) - as I mentioned in TCL I decided to
> try other structures only because I thought list implementation could be
> not very effective internally, e.g. due to historical reasons...

Your 'history' is 25 years out of date. Tcl's lists have been O(1)
complexity indexed arrays for that long (so long as you don't shimmer
them to/from strings on every usage).

Subject: Re: Performance of list / array / dict compared
From: gustafn
Newsgroups: comp.lang.tcl
Organization: novaBBS
Date: Mon, 19 Aug 2024 08:35 UTC
References: 1 2 3 4
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!i2pn.org!i2pn2.org!.POSTED!not-for-mail
From: neumann@wu-wien.ac.at (gustafn)
Newsgroups: comp.lang.tcl
Subject: Re: Performance of list / array / dict compared
Date: Mon, 19 Aug 2024 08:35:29 +0000
Organization: novaBBS
Message-ID: <52b7365dfcec510a4600ab6b0daad03c@www.novabbs.com>
References: <96ddc76b99ce6bf7397a5158fece845f@www.novabbs.com> <JFmwO.111824$KuXa.56578@fx44.iad> <dd2b683b86c31d62c2fc9919ffa6179e@www.novabbs.com> <v9t9ka$2euft$3@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Info: i2pn2.org;
logging-data="3097296"; mail-complaints-to="usenet@i2pn2.org";
posting-account="p+4Dm87JGuJs8u5UavFXrfM1jr+8eOP5kauH444y/E8";
User-Agent: Rocksolid Light
X-Rslight-Posting-User: 5290a2eaf9a37a468f724ff23c0a97a2f047b83d
X-Rslight-Site: $2y$10$JTGnLlzcVkOwQiVeyu4AweKHwxkwgEHas.UM/SLZQ9aEiCN2o/b3O
X-Spam-Checker-Version: SpamAssassin 4.0.0
View all headers

Hi RodionGork,

I took a quick look at the "primes" examples in your comparison on the
GitHub page.

Using any data structures other than lists does not make sense for this
example.
One could get an improvement of about 5% by putting the outer loop into
a proc.

Most of the time in this example is spent in the "is_prime" proc.
One can get much bigger improvements by using critcl for the is_prime
function (see below):

baseline list 1766907.44 100.00
loop proc 1689220.00 95.60
is_prime_list_c 118298.50 6.70

This is in the spirit of thinking in "system languages" and "glue
languages"
by John Ousterhout, where one should find the right mix for the
applications,
when performance matters.

all the best
-g

===================================================================================
package require critcl

critcl::cproc is_prime_list_c {Tcl_Interp* interp list primes int x} int
{ int i;
for (i=0; i<primes.c; i++) {
int d;
if (Tcl_GetIntFromObj(interp, primes.v[i], &d) != TCL_OK) {
fprintf(stderr, "list element is not an integer: '%s'\n",
Tcl_GetString(primes.v[i]));
}
if (d*d > x) return 1;
if (x%d == 0) return 0;
}
return -1;
}

critcl::load
===================================================================================

===================================================================================
proc run_list_c {} {
set primes {2 3 5 7}
set n $::env(MAXN)

for {set i 9} {1} {incr i 2} {
if {[is_prime_list_c $primes $i]} {
lappend primes $i
if {[llength $primes] == $n} {
puts "primes\[$n\] = $i"
break
}
}
}
} ===================================================================================

1

rocksolid light 0.9.8
clearnet tor