A WebAssembly compiler that fits in a tweet

10 months ago (wasmgroundup.com)

Interesting how the obfuscated code is explained by slowly unobfuscating it step by step. This is the reverse of how obfuscated code is normally created: by starting with understandable code, and then slowly obfuscating it bit by bit (as I explained for this IOCCC submission [1]).

I say normally because one could also have a superoptimizer search for a minimal program that achieves some desired behaviour, and then one has to understand it by slowly unraveling the generated code.

[1] https://tromp.github.io/maze.html

I’ve used reverse Polish notation as an interview question many times. It works well because if someone’s never seen it you can learn a lot about their basic understanding of algorithms. But if they are aware of how easy it is you can extend it forever by adding symbols, improving the algo they build, or doing something like this.

  • You are a crazy sadic bastard

    • Such is the result of delving into languages such as Forth[0].

      Can you make programs with it smaller than assembly language? Sure.

      Will you come out the other side a mad hatter speaking of things such as words, dictionaries, and washing machine firmware? Well, I can only speak for myself...

      :-D

      0 - https://forth-standard.org/

> if you take the time to understand what this code does, you’ll learn a surprising amount about WebAssembly!

It's a shame the article mostly teaches about codegolf tricks, and the actual wasm info is left to a single commented code block.

Nonetheless an interesting article about JavaScript quirks though!

Really interesting. I considered having a go at a RPN to WASM converter when I was making https://c50.fingswotidun.com/

I ended up converting the RPN style notation into a JavaScript string and creating a new function, which lets the JIT sort it out.

https://c50.fingswotidun.com/show/?code=xy!2*!2y!*6%2Bo2%2Fv...

which has the code

    xy!2*!2y!*6+o2/vy#!*:Cy#*+z#d!;*:ze!xy*4s*43/*e+*+

becomes

    ((round(z) * ((v * (1 - round(y))) + (clamp((( ((x*(2**((2 * (1 - y)) + 6))) ^ ((1 - ((1 - y) * 2))*(2**((2 * (1 - y)) + 6)))) /(2**((2 * (1 - y)) + 6))) / 2)) * round(y)))) + ((1 - round(z)) * ((1 - smoothStep(z)) + smoothStep((((x * y) * sin(4)) * (4 / 3))))))

It would be interesting to see the performance difference from a wasm version, but in the end I found the human(ish) readable expression to be quite useful too.

Originally I created an interpreter for a code as a texture maker for code golfed javascripted games. https://github.com/Lerc/stackie

There's potential for a WASM implementation to be both smaller than the small version and Faster than the fast version.

post co-author here, let me know if you have any questions :)

WASM is cool; I've started implemented a CPU that runs unmodified WASM in Verilog, but I'm finding the feature creep on the instruction set (SIMD, GC) to take away from the initial values behind WASM (simple, small)

  • You can ignore SIMD and GC (for now). SIMD explodes the complexity level of Wasm, esp when there is WebGPU. I am curious how you are handling layout and how you are handling all the irregular sizes.

  • I don't think WASM's value was ever in a hardware instantiation of the actual instruction set.

    • Oh, I don't think so either, but if you think back to the asm.js times, there was a clear goal of "simple and higher perf", but now it's going in a direction for maximum compatibility with existing stacks (GC, WASI, etc) at "any" cost

This is really impressive. It is over 140 characters, but I guess "a tweet" can be any length now.

  • I have never used Twitter so I might be mistaken but I believe the limit has been 280 for a while now, which is why the first one at 269 bytes would also have fit.

    • Yeah it was changed to 280 for all users in 2017. That's still the default limit, but paying users can exceed it now.