Chapel VS Nim benchmarks

Current benchmark data was generated on Tue Dec 31 2024, full log can be found HERE

CONTRIBUTIONS are WELCOME!

[x86_64][4 cores] AMD EPYC 7763 64-Core Processor (Model 1)

* -m in a file name stands for multi-threading or multi-processing

* -i in a file name stands for direct intrinsics usage. (Usage of simd intrinsics via libraries is not counted)

* -ffi in a file name stands for non-stdlib FFI usage

* (You may find time < time(user) + time(sys) for some non-parallelized programs, the overhead is from GC or JIT compiler, which are allowed to take advantage of multi-cores as that's more close to real-world scenarios.)

binarytrees

Input: 18

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 741ms 8.3ms 34.9MB 727ms 0ms nim 2.2.0
nim 2.nim 919ms 4.1ms 34.7MB 900ms 7ms nim/clang 2.2.0
chapel 4.chpl 1855ms 24ms 66.3MB 1843ms 3ms chpl 1.31.0
chapel 3.chpl 1925ms 21ms 66.3MB 1917ms 0ms chpl 1.31.0

Input: 15

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 67ms 0.5ms 5.5MB 57ms 0ms nim 2.2.0
nim 2.nim 85ms 2.2ms 5.8MB 77ms 0ms nim/clang 2.2.0
chapel 3.chpl 183ms 2.0ms 36.3MB 173ms 0ms chpl 1.31.0
chapel 4.chpl 185ms 4.8ms 36.4MB 180ms 0ms chpl 1.31.0

coro-prime-sieve

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 2149ms 35ms 527.8MB 4187ms 63ms chpl 1.31.0
nim 1.nim timeout 0.0ms 561.4MB 2450ms 2080ms nim 2.2.0
nim 1.nim timeout 0.0ms 570.6MB 2543ms 1987ms nim/clang 2.2.0

Input: 1000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 183ms 18ms 488.7MB 287ms 40ms chpl 1.31.0
nim 1.nim 4127ms 12ms 519.8MB 2037ms 1663ms nim/clang 2.2.0
nim 1.nim 4262ms 22ms 519.3MB 2010ms 1827ms nim 2.2.0

edigits

Input: 250001

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 75ms 2.3ms 36.4MB 70ms 0ms chpl 1.31.0

Input: 100000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 38ms 0.3ms 34.5MB 27ms 7ms chpl 1.31.0

fasta

Input: 2500000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 5-m.chpl 110ms 2.4ms 32.4MB 183ms 3ms chpl 1.31.0
nim 2.nim 174ms 1.9ms 1.5MB 160ms 0ms nim 2.2.0
nim 2.nim 226ms 6.1ms 1.8MB 213ms 0ms nim/clang 2.2.0
nim 1.nim 542ms 6.4ms 1.8MB 420ms 110ms nim/clang 2.2.0
nim 1.nim 569ms 4.4ms 1.5MB 470ms 87ms nim 2.2.0

Input: 250000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 20ms 0.4ms 1.5MB 10ms 0ms nim 2.2.0
nim 2.nim 25ms 1.4ms 1.8MB 17ms 0ms nim/clang 2.2.0
chapel 5.chpl 27ms 0.7ms 32.4MB 23ms 0ms chpl 1.31.0
nim 1.nim 58ms 0.7ms 1.8MB 30ms 17ms nim/clang 2.2.0
nim 1.nim 60ms 2.1ms 1.5MB 37ms 10ms nim 2.2.0

helloworld

Input: QwQ

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 1.nim 1.0ms 0.0ms 1.6MB 0ms 0ms nim 2.2.0
nim 1.nim 1.1ms 0.1ms 1.8MB 0ms 0ms nim/clang 2.2.0
chapel 1.chpl 17ms 1.9ms 32.8MB 10ms 0ms chpl 1.31.0

knucleotide

Input: 2500000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3-m.chpl 719ms 7.0ms 102.9MB 1350ms 7ms chpl 1.31.0

Input: 250000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3-m.chpl 118ms 3.2ms 81.0MB 200ms 0ms chpl 1.31.0

nbody

Input: 5000000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 302ms 2.8ms 1.8MB 290ms 0ms nim 2.2.0
chapel 2.chpl 310ms 1.1ms 32.8MB 300ms 0ms chpl 1.31.0
nim 2.nim 330ms 8.5ms 2.0MB 320ms 0ms nim/clang 2.2.0

Input: 500000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
nim 2.nim 32ms 0.5ms 1.8MB 20ms 0ms nim 2.2.0
nim 2.nim 35ms 0.9ms 2.0MB 23ms 0ms nim/clang 2.2.0
chapel 2.chpl 49ms 1.1ms 32.9MB 43ms 0ms chpl 1.31.0

pidigits

Input: 8000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 2.chpl 428ms 1.0ms 34.2MB 417ms 0ms chpl 1.31.0

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 2.chpl 115ms 2.9ms 34.4MB 103ms 7ms chpl 1.31.0

regex-redux

Input: 2500000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3.chpl 1315ms 8.8ms 224.1MB 1263ms 40ms chpl 1.31.0
nim 1.nim 1541ms 6.9ms 152.3MB 1513ms 13ms nim/clang 2.2.0
nim 1.nim 1567ms 8.6ms 151.5MB 1530ms 23ms nim 2.2.0

Input: 250000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3.chpl 161ms 3.9ms 52.9MB 140ms 10ms chpl 1.31.0
nim 1.nim 162ms 0.2ms 17.6MB 143ms 3ms nim/clang 2.2.0
nim 1.nim 163ms 1.8ms 17.4MB 153ms 0ms nim 2.2.0

secp256k1

Input: 2000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 1147ms 46ms 33.1MB 1137ms 0ms chpl 1.31.0

Input: 500

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 297ms 3.2ms 33.0MB 287ms 3ms chpl 1.31.0

spectral-norm

Input: 8000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 1822ms 10.0ms 32.9MB 3613ms 0ms chpl 1.31.0
nim 1.nim 3383ms 20ms 1.6MB 3370ms 0ms nim/clang 2.2.0
nim 1.nim 3439ms 35ms 1.4MB 3430ms 0ms nim 2.2.0
chapel 1.chpl 3541ms 31ms 32.8MB 3530ms 3ms chpl 1.31.0

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 467ms 2.3ms 32.8MB 897ms 3ms chpl 1.31.0
nim 1.nim 848ms 4.3ms 1.5MB 837ms 0ms nim/clang 2.2.0
nim 1.nim 860ms 8.6ms 1.3MB 853ms 0ms nim 2.2.0
chapel 1.chpl 893ms 7.1ms 32.8MB 887ms 0ms chpl 1.31.0

Input: 2000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 132ms 0.7ms 32.8MB 230ms 0ms chpl 1.31.0
nim 1.nim 214ms 1.3ms 1.3MB 203ms 0ms nim 2.2.0
nim 1.nim 217ms 5.1ms 1.5MB 207ms 0ms nim/clang 2.2.0
chapel 1.chpl 241ms 1.9ms 32.9MB 230ms 3ms chpl 1.31.0