diff options
author | Alon Zakai <alonzakai@gmail.com> | 2011-06-16 16:19:58 -0700 |
---|---|---|
committer | Alon Zakai <alonzakai@gmail.com> | 2011-06-16 16:19:58 -0700 |
commit | 4d0f6917baef98bcf4007d22e169af9f322e698f (patch) | |
tree | 25f7c4db85f220a64d5c79529f259f49d5048d28 /docs | |
parent | e9d279b362b2beb4a93a4713fd64537ffc057d96 (diff) |
add dlmalloc benchmark
Diffstat (limited to 'docs')
-rw-r--r-- | docs/paper.pdf | bin | 203771 -> 204696 bytes | |||
-rw-r--r-- | docs/paper.tex | 44 |
2 files changed, 26 insertions, 18 deletions
diff --git a/docs/paper.pdf b/docs/paper.pdf Binary files differindex 3b3382b2..9cc9f207 100644 --- a/docs/paper.pdf +++ b/docs/paper.pdf diff --git a/docs/paper.tex b/docs/paper.tex index 215b7f5a..8e3580c0 100644 --- a/docs/paper.tex +++ b/docs/paper.tex @@ -615,13 +615,15 @@ We will now take a look at some performance benchmarks: \bigskip -\begin{tabular}{ l | c | c | c } +\begin{tabular}{ l | c | c | c || c } \hline - \textbf{benchmark} & \textbf{JS} & \textbf{gcc} & \textbf{ratio} \\ - fannkuch (9) & 0.577 & 0.054 & 10.68 \\ - fasta (100,000) & 0.640 & 0.055 & 11.63 \\ - primes & 0.490 & 0.160 & 3.06 \\ - raytrace (5, 64) & 1.078 & 0.033 & 32.66 \\ + \textbf{benchmark} & \textbf{SM} & \textbf{V8} & \textbf{gcc} & \textbf{ratio} \\ + \hline + fannkuch (10) & 1.158 & \textbf{0.931} & 0.231 & 4.04 \\ + fasta (2100000) & \textbf{1.115} & 1.128 & 0.452 & 2.47 \\ + primes & \textbf{1.443} & 3.194 & 0.438 & 3.29 \\ + raytrace (7,256) & \textbf{1.930} & 2.944 & 0.228 & 8.47 \\ + dlmalloc (400,400) & 5.050 & \textbf{1.880} & 0.315 & 5.97 \\ \hline \end{tabular} @@ -634,23 +636,29 @@ can be found at \url{https://github.com/kripken/emscripten/tree/master/tests} embedded inside runner.py in the function test\_primes). The second column is the elapsed time (in seconds) when running the compiled code (generated using all Emscripten and LLVM optimizations as well as the Closure Compiler) in the SpiderMonkey JavaScript -engine (specifically, the JaegerMonkey branch, checked out April 2nd, 2011, running with \emph{-m -n -a}). -The third column is the elapsed time (in seconds) when running the original code compiled with \emph{gcc -O3}, -using GCC 4.4.4. The last column is the ratio, that is, how much slower the JavaScript code is -when compared to gcc. All the tests were run on a Lenovo N500 laptop with -an Intel T6400 Core 2 Duo CPU clocked at 1.2GHz, running on Ubuntu 10.10. - -Clearly the results greatly vary by the benchmark, with the generated JavaScript running from 3.06 to 32.66 times -slower. It appears that code that does simple numerical operations -- like +engine (specifically the JaegerMonkey branch, checked out June 15th, 2011). +The third column is the elapsed time when running the same JavaScript code in the V8 JavaScript engine +(checked out Jun 15th, 2011). In both the second and third column lower values +are better; the best of the two is in bold. +The fourth column is the elapsed time when running the original code compiled with \emph{gcc -O3}, +using GCC 4.4.4. The last column is the ratio, that is, how much slower the JavaScript code +(running in the faster of the two engines for that test) is +when compared to gcc. All the tests were run on a MacBook Pro with +an Intel i7 CPU clocked at 2.66GHz, running on Ubuntu 10.04. + +Clearly the results greatly vary by the benchmark, with the generated JavaScript running from 2.47 to 8.47 times +slower. There are also significant differences between the two JavaScript engines, with each better +at some of the benchmarks. +It appears that code that does simple numerical operations -- like the primes test -- can run fairly fast, while code that has a lot of memory accesses, for example due to using structures -- like the raytrace test -- -will be much slower. (The main issue with structures is that Emscripten does not +will be slower. (The main issue with structures is that Emscripten does not `nativize' them yet, as it does to simple local variables.) Code that has a mix of simple numerical operations and memory accesses -tends to run around 10 times slower than the most-optimized C++ code, -as we can see in the fannkuch and fasta benchmarks. While a factor of 10 -is a significant slowdown, it is still more than fast enough for +tends to run around 5-10 times slower than the most-optimized C++ code, +as we can see above. While this is a significant slowdown compared to +optimized native code, it is still more than fast enough for many purposes, and the main point of course is that the code can run anywhere the web can be accessed. Further work on Emscripten is expected to improve the speed as well, as are improvements to LLVM, the Closure |