Chapel VS C++ benchmarks

Current benchmark data was generated on Mon Oct 03 2022, full log can be found HERE

CONTRIBUTIONS are WELCOME!

[x86_64][2 cores] Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz (Model 106)

* -m in a file name stands for multi-threading or multi-processing

* -i in a file name stands for direct intrinsics usage. (Usage of simd intrinsics via libraries is not counted)

* -ffi in a file name stands for non-stdlib FFI usage

* (You may find time < time(user) + time(sys) for some non-parallelized programs, the overhead is from GC or JIT compiler, which are allowed to take advantage of multi-cores as that's more close to real-world scenarios.)

binarytrees

Input: 18

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 4.chpl 2316ms 24ms 60.2MB 2290ms 13ms chpl 1.28.0
chapel 3.chpl 2468ms 4.1ms 50.3MB 2450ms 10ms chpl 1.28.0

Input: 15

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 4.chpl 220ms 0.8ms 26.1MB 207ms 3ms chpl 1.28.0
chapel 3.chpl 229ms 1.1ms 18.1MB 217ms 3ms chpl 1.28.0

coro-prime-sieve

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 3847ms 71ms 87.6MB 7483ms 30ms chpl 1.28.0

Input: 1000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1-m.chpl 259ms 1.5ms 32.2MB 480ms 0ms chpl 1.28.0

edigits

Input: 250001

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 97ms 1.7ms 36.0MB 83ms 3ms chpl 1.28.0

Input: 100000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 47ms 1.1ms 34.0MB 37ms 0ms chpl 1.28.0

fasta

Input: 2500000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 5-m.chpl 147ms 0.4ms 32.1MB 253ms 3ms chpl 1.28.0

Input: 250000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 5.chpl 33ms 0.9ms 32.1MB 33ms 0ms chpl 1.28.0

helloworld

Input: QwQ

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
cpp 1.cpp 2.4ms 0.6ms 1.1MB 0ms 0ms clang++ 11.0.0
cpp 1.cpp 3.1ms 2.0ms 1.0MB 0ms 0ms g++ 12.2.0
chapel 1.chpl 20ms 3.3ms 32.4MB 10ms 3ms chpl 1.28.0

knucleotide

Input: 2500000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3-m.chpl 965ms 11ms 96.7MB 1783ms 17ms chpl 1.28.0

Input: 250000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3-m.chpl 167ms 2.3ms 87.2MB 263ms 17ms chpl 1.28.0

nbody

Input: 5000000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
cpp 0-i.cpp 226ms 1.0ms 0.9MB 220ms 0ms g++ 12.2.0
cpp 0-i.cpp 258ms 1.2ms 1.1MB 247ms 0ms clang++ 11.0.0
cpp 1.cpp 339ms 0.7ms 1.0MB 330ms 0ms g++ 12.2.0
chapel 2.chpl 377ms 3.1ms 32.4MB 363ms 0ms chpl 1.28.0
cpp 1.cpp 384ms 0.4ms 1.1MB 370ms 0ms clang++ 11.0.0

Input: 500000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
cpp 0-i.cpp 26ms 1.3ms 1.0MB 13ms 0ms g++ 12.2.0
cpp 0-i.cpp 29ms 0.6ms 1.1MB 20ms 0ms clang++ 11.0.0
cpp 1.cpp 37ms 0.7ms 1.0MB 30ms 0ms g++ 12.2.0
cpp 1.cpp 41ms 0.2ms 1.1MB 30ms 0ms clang++ 11.0.0
chapel 2.chpl 58ms 0.7ms 32.4MB 47ms 3ms chpl 1.28.0

pidigits

Input: 8000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 2.chpl 594ms 3.6ms 34.1MB 583ms 3ms chpl 1.28.0

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 2.chpl 152ms 0.4ms 34.1MB 147ms 0ms chpl 1.28.0

regex-redux

Input: 2500000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3.chpl 1965ms 33ms 221.9MB 1803ms 147ms chpl 1.28.0

Input: 250000_in

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 3.chpl 217ms 2.1ms 52.8MB 183ms 23ms chpl 1.28.0

secp256k1

Input: 2000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 1533ms 0.6ms 32.3MB 1520ms 0ms chpl 1.28.0

Input: 500

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
chapel 1.chpl 400ms 0.4ms 32.3MB 390ms 3ms chpl 1.28.0

spectral-norm

Input: 8000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
cpp 8-m.cpp 1061ms 1.3ms 3.1MB 2073ms 0ms clang++ 11.0.0
cpp 7-m.cpp 1061ms 3.4ms 3.2MB 2073ms 3ms clang++ 11.0.0
cpp 7-m.cpp 1062ms 2.6ms 0.9MB 2083ms 0ms g++ 12.2.0
cpp 8-m.cpp 1063ms 0.8ms 0.9MB 2077ms 0ms g++ 12.2.0
cpp 6-im.cpp 2243ms 24ms 3.2MB 4400ms 0ms clang++ 11.0.0
cpp 6-im.cpp 2243ms 21ms 1.0MB 4400ms 0ms g++ 12.2.0
chapel 1-m.chpl 2302ms 99ms 32.3MB 4450ms 3ms chpl 1.28.0
chapel 1.chpl 4379ms 1.9ms 32.3MB 4363ms 7ms chpl 1.28.0

Input: 4000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
cpp 8-m.cpp 276ms 1.4ms 3.2MB 517ms 0ms clang++ 11.0.0
cpp 7-m.cpp 279ms 4.6ms 0.9MB 523ms 0ms g++ 12.2.0
cpp 8-m.cpp 279ms 3.6ms 0.9MB 520ms 0ms g++ 12.2.0
cpp 7-m.cpp 279ms 5.3ms 3.2MB 520ms 0ms clang++ 11.0.0
cpp 6-im.cpp 567ms 1.3ms 1.0MB 1100ms 0ms g++ 12.2.0
cpp 6-im.cpp 568ms 1.5ms 3.2MB 1097ms 0ms clang++ 11.0.0
chapel 1-m.chpl 581ms 1.0ms 32.3MB 1110ms 0ms chpl 1.28.0
chapel 1.chpl 1113ms 1.7ms 32.3MB 1100ms 0ms chpl 1.28.0

Input: 2000

lang code time stddev peak-mem mem time(user) time(sys) compiler compiler/runtime
cpp 8-m.cpp 76ms 1.1ms 3.2MB 130ms 0ms clang++ 11.0.0
cpp 7-m.cpp 76ms 0.8ms 1.0MB 130ms 0ms g++ 12.2.0
cpp 7-m.cpp 76ms 2.1ms 3.2MB 130ms 0ms clang++ 11.0.0
cpp 8-m.cpp 77ms 1.9ms 1.0MB 130ms 0ms g++ 12.2.0
cpp 6-im.cpp 150ms 0.2ms 3.2MB 277ms 0ms clang++ 11.0.0
cpp 6-im.cpp 151ms 1.4ms 1.0MB 280ms 0ms g++ 12.2.0
chapel 1-m.chpl 166ms 1.7ms 32.3MB 290ms 0ms chpl 1.28.0
chapel 1.chpl 296ms 1.4ms 32.3MB 287ms 3ms chpl 1.28.0