Chapel VS Nim benchmarks

Current benchmark data was generated on Sun Dec 01 2024, full log can be found HERE

CONTRIBUTIONS are WELCOME!

[x86_64][4 cores] AMD EPYC 7763 64-Core Processor (Model 1)

* -m in a file name stands for multi-threading or multi-processing

* -i in a file name stands for direct intrinsics usage. (Usage of simd intrinsics via libraries is not counted)

* -ffi in a file name stands for non-stdlib FFI usage

* (You may find time < time(user) + time(sys) for some non-parallelized programs, the overhead is from GC or JIT compiler, which are allowed to take advantage of multi-cores as that's more close to real-world scenarios.)

binarytrees

Input: 18

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 793ms 24ms 34.9MB 777ms 3ms nim 2.2.0
nim 2.nim 986ms 7.4ms 34.2MB 967ms 7ms nim/clang 2.2.0
chapel 4.chpl 1950ms 11ms 66.4MB 1937ms 0ms chpl 1.31.0
chapel 3.chpl 2095ms 33ms 66.4MB 2087ms 3ms chpl 1.31.0

Input: 15

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 72ms 1.1ms 5.5MB 60ms 0ms nim 2.2.0
nim 2.nim 94ms 2.8ms 5.8MB 80ms 0ms nim/clang 2.2.0
chapel 4.chpl 193ms 3.0ms 36.4MB 183ms 7ms chpl 1.31.0
chapel 3.chpl 205ms 18ms 34.3MB 190ms 3ms chpl 1.31.0

coro-prime-sieve

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 2252ms 19ms 555.7MB 4367ms 70ms chpl 1.31.0
nim 1.nim timeout 0.0ms 545.3MB 2497ms 2050ms nim 2.2.0
nim 1.nim timeout 0.0ms 552.6MB 2520ms 2020ms nim/clang 2.2.0

Input: 1000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 196ms 3.7ms 508.7MB 307ms 43ms chpl 1.31.0
nim 1.nim 4405ms 8.9ms 519.8MB 2203ms 1773ms nim/clang 2.2.0
nim 1.nim 4519ms 16ms 519.3MB 2230ms 1857ms nim 2.2.0

edigits

Input: 250001

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 80ms 1.0ms 36.4MB 73ms 0ms chpl 1.31.0

Input: 100000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 41ms 1.3ms 34.4MB 30ms 7ms chpl 1.31.0

fasta

Input: 2500000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 5-m.chpl 130ms 27ms 32.4MB 217ms 0ms chpl 1.31.0
nim 2.nim 188ms 0.7ms 1.5MB 177ms 0ms nim 2.2.0
nim 2.nim 239ms 2.9ms 1.8MB 227ms 0ms nim/clang 2.2.0
nim 1.nim 569ms 1.0ms 1.8MB 433ms 123ms nim/clang 2.2.0
nim 1.nim 596ms 1.0ms 1.5MB 467ms 113ms nim 2.2.0

Input: 250000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 21ms 0.6ms 1.5MB 10ms 0ms nim 2.2.0
nim 2.nim 26ms 0.4ms 1.6MB 17ms 0ms nim/clang 2.2.0
chapel 5.chpl 29ms 0.1ms 32.4MB 27ms 0ms chpl 1.31.0
nim 1.nim 60ms 1.1ms 1.8MB 40ms 7ms nim/clang 2.2.0
nim 1.nim 63ms 1.0ms 1.5MB 43ms 7ms nim 2.2.0

helloworld

Input: QwQ

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 1.nim 1.2ms 0.0ms 1.8MB 0ms 0ms nim/clang 2.2.0
nim 1.nim 1.2ms 0.2ms 1.6MB 0ms 0ms nim 2.2.0
chapel 1.chpl 16ms 0.7ms 32.8MB 7ms 3ms chpl 1.31.0

knucleotide

Input: 2500000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3-m.chpl 760ms 11ms 91.1MB 1427ms 7ms chpl 1.31.0

Input: 250000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3-m.chpl 122ms 1.7ms 92.9MB 200ms 7ms chpl 1.31.0

nbody

Input: 5000000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 319ms 1.8ms 1.8MB 310ms 0ms nim 2.2.0
chapel 2.chpl 334ms 4.0ms 32.8MB 317ms 7ms chpl 1.31.0
nim 2.nim 339ms 0.8ms 2.0MB 330ms 0ms nim/clang 2.2.0

Input: 500000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 34ms 0.6ms 1.8MB 23ms 0ms nim 2.2.0
nim 2.nim 36ms 0.2ms 2.0MB 27ms 0ms nim/clang 2.2.0
chapel 2.chpl 52ms 0.2ms 32.8MB 40ms 0ms chpl 1.31.0

pidigits

Input: 8000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 2.chpl 452ms 1.7ms 32.4MB 440ms 3ms chpl 1.31.0

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 2.chpl 121ms 1.7ms 34.4MB 117ms 3ms chpl 1.31.0

regex-redux

Input: 2500000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3.chpl 1395ms 6.8ms 224.1MB 1333ms 53ms chpl 1.31.0
nim 1.nim 1629ms 16ms 152.2MB 1600ms 17ms nim/clang 2.2.0
nim 1.nim 1651ms 7.0ms 152.1MB 1620ms 17ms nim 2.2.0

Input: 250000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3.chpl 170ms 2.9ms 53.0MB 143ms 20ms chpl 1.31.0
nim 1.nim 173ms 3.9ms 17.6MB 153ms 7ms nim/clang 2.2.0
nim 1.nim 175ms 2.2ms 16.5MB 163ms 0ms nim 2.2.0

secp256k1

Input: 2000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 1235ms 79ms 33.1MB 1227ms 3ms chpl 1.31.0

Input: 500

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 313ms 1.6ms 33.1MB 300ms 7ms chpl 1.31.0

spectral-norm

Input: 8000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 1933ms 21ms 33.0MB 3823ms 0ms chpl 1.31.0
nim 1.nim 3574ms 1.3ms 1.6MB 3563ms 0ms nim/clang 2.2.0
nim 1.nim 3593ms 7.6ms 1.4MB 3583ms 0ms nim 2.2.0
chapel 1.chpl 3709ms 10.0ms 32.8MB 3703ms 0ms chpl 1.31.0

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 496ms 1.0ms 32.8MB 960ms 0ms chpl 1.31.0
nim 1.nim 896ms 0.8ms 1.5MB 887ms 0ms nim/clang 2.2.0
nim 1.nim 904ms 3.3ms 1.3MB 893ms 0ms nim 2.2.0
chapel 1.chpl 946ms 5.3ms 32.8MB 937ms 7ms chpl 1.31.0

Input: 2000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 140ms 0.3ms 32.9MB 243ms 3ms chpl 1.31.0
nim 1.nim 226ms 0.6ms 1.5MB 213ms 0ms nim/clang 2.2.0
nim 1.nim 228ms 1.3ms 1.3MB 220ms 0ms nim 2.2.0
chapel 1.chpl 257ms 3.3ms 32.8MB 247ms 0ms chpl 1.31.0