Learning to read Arthur Whitney's C to become smart (2024)

5 days ago (needleful.net)

166 comments

gudzpoz

They're all macros to make common operations more compact

I read the J Incunabulum before encountering this, and the point that stands out is that you don't start by jumping into the middle of it like many programmers who are familiar with C will do; the macros defined at the beginning will confuse you otherwise. They also build upon previous ones, so the code ends up climbing the "abstraction ladder" very quickly. I personally like the Iterate macro (i), for how it compresses a relatively verbose loop into a single character; and of course in an array language, the iteration is entirely implicit.

In other words, I believe the reason this code is hard to read for many who are used to more "normal" C styles is because of its density; in just a few dozen lines, it creates many abstractions and uses them immediately, something which would otherwise be many many pages long in a more normal style. Thus if you try to "skim read" it, you are taking in a significantly higher amount of complexity than usual. It needs to be read one character at a time.

As someone who has spent considerable time working with huge codebases composed of hundreds of tiny files that have barely any substance to them, and trying to find where things happen becomes an exercise in search, this extreme compactness feels very refreshing.

aragonite 4 days ago
>In other words, I believe the reason this code is hard to read for many who are used to more "normal" C styles is because of its density; in just a few dozen lines, it creates many abstractions and uses them immediately, something which would otherwise be many many pages long in a more normal style.
I also spent some time with the Incunabulum and came away with a slightly different conclusion. I only really grokked it after going through and renaming the variables to colorful emojis (https://imgur.com/F27ZNfk). That made me think that, in addition to informational density, a big part of the initial difficulty is orthographic. IMO two features of our current programming culture make this coding style hard to read: (1) Most modern languages discourage or forbid symbol/emoji characters in identifiers, even though their highly distinctive shapes would make this kind of code much more readable, just as they do in mathematical notation (there's a reason APL looked the way it did!). (2) When it comes to color, most editors default to "syntax highlighting" (each different syntactic category gets a different color), whereas what's often most helpful (esp. here) is token-based highlighting, where each distinct identifier (generally) gets its own color (This was pioneered afaik by Sublime Text which calls it "hashed syntax highlighting" and is sometimes called "semantic highlighting" though that term was later co-opted by VSCode to mean something quite different.) Once I renamed the identifiers so it becomes easier to recognize them at a glance by shape and/or color the whole thing became much easier to follow.
- geocar 4 days ago
  
  I've experimented a few times with coloring my variables explicitly (using a prefix like R for red, hiding the letters, etc) after playing with colorforth. I agree getting color helps with small shapes, but I think the colors shouldn't be arbitrary: every character Arthur types is a choice about how the code should look, what he is going to need, and what he needs to see at the same time, and it seems like a missed opportunity to turn an important decision about what something is named (or colored) over to a random number generator.
- pif 4 days ago
  
  > (1) Most modern languages discourage or forbid symbol/emoji characters in identifiers
  > (2) When it comes to color,
  Call me boomer if you wish, but if you can't grasp the value of having your code readable on a 24 rows by 80 columns, black and white screen, you are not a software developer. You are not even a programmer: at most, you are a prompt typist for ChatGPT.
  
  5 replies →
bruce343434 4 days ago
When you zoom out on Google maps, usually the street names and bussiness pins disappear. This code style is like having a fully informationally dense zoomed out map. Being too zoomed in is definitely frustrating, and reminds me of early attempts at "mobile" web pages. But I'm not sure that this condensed code style is a good general solution either, it certainly seems overwhelming, like a where's waldo puzzle.
I prefer having a consistent information density regardless of zoom level. When I would like to see more details, I would like to achieve that by zooming in. When I want the overview, I'd prefer to zoom out and have certain details omitted from the map.
Having a clear and by-convention code organization is one way to achieve this. Then to drill into the details, just navigate the project directory to the right file.
For rabbit Hole style hunting, following references/symbol usage/definition is ideal, which is enabled by modern IDEs.
K and family are made for data analysis. Data analyses are relatively simple software projects that don't have a very wide scope. I think this dense style of programming falls apart when you consider the breadth of requirements of typical modern application software.
- fifilura 4 days ago
  
  I have definitely seen many 'typical modern applications' where the business logic can be summarized into 100 lines of code. The rest is just shoveling things around.
  
  2 replies →
1vuio0pswjnm7 4 days ago
"... trying to find where things happen becomes an exercise in search."
It seems like developers actually prefer large codebases and having to use recursive search through multiple layers of sub directories
I prefer no sub directories and being able to just grep against *.[ch] with no recursion
I think projects in Java languages are the worst when it comes to having to search through numerous small verbose files having barely any substance to them. IDEs probably make this easier but I don't use one
Years ago I read that Whitney's "IDE" is something like the Windows console and Notepad. He has said in interviews that he wants all the code to fit on a single page
- bruce343434 4 days ago
  
  What about `grep -R .`?
rbanffy 4 days ago

Adding some comments would have been advisable, but I guess he doesn’t need comments.
APL taught me the importance of comments - if I didn’t comment my code thoroughly I would forget how it works and what it did as soon as I moved away from the keyboard. It is a cruel language.

electroly 5 days ago

The way to understand Arthur Whitney's C code is to first learn APL (or, more appropriately, one of his languages in the family). If you skip that part, it'll just look like a weirdo C convention, when really he's trying to write C as if it were APL. The most obvious of the typographic stylings--the lack of spaces, single-character names, and functions on a single line--are how he writes APL too. This is perhaps like being a Pascal programmer coming to C and indignantly starting with "#define begin {" and so forth, except that atw is not a mere mortal like us.

xelxebar 4 days ago
> The way to understand Arthur Whitney's C code is to first learn APL
This is the main insight in my breakdown of the J Incunabulum:
https://blog.wilsonb.com/posts/2025-06-06-readable-code-is-u...
When I first encountered it years ago, the thing was impenetrable, but after learning APL to a high level, it now reads like a simple, direct expression of intent. The code even clearly communicates design tradeoffs and the intended focus of experimentation. Or more on the nose, to me the code ends up feeling primarily like extremely readable communication of ideas between like-minded humans. This is a very rare thing in software development in my experience.
IMHO, ideas around "readable code" and "good practices" in software development these days optimize for large, high-turnover teams working on large codebases. Statistically speaking, network effects mean that these are the codebasese and developer experiences we are most likely to hear about. However, as an industry, I think we are relatively blind to alternatives. We don't have sufficient shared language and cognitive tooling to understand how to optimize software dev for small, expert teams.
- vanderZwan 4 days ago
  
  Thanks for that breakdown, that does make it a lot more understandable.
  > DO defines our basic loop operation, so iterations will probably all naïvely be O(1);
  Shouldn't that be "naïvely be O(n)"?
mlochbaum 5 days ago
It looks like a weirdo C convention to APLers too though. Whitney writes K that way, but single-line functions in particular aren't used a lot in production APL, and weren't even possible before dfns were introduced (the classic "tradfn" always starts with a header line). All the stuff like macros with implicit variable names, type punning, and ternary operators just doesn't exist in APL. And what APL's actually about, arithmetic and other primives that act on whole immutable arrays, is not part of the style at all!
- electroly 5 days ago
  
  "the typographic stylings ... are how he writes" is what I said, isn't it? :) Well said.
  
  1 reply →
maximilianburke 5 days ago

>This is perhaps like being a Pascal programmer coming to C and indignantly starting with "#define begin {" and so forth
Ah, like Stephen Bourne
raddan 5 days ago

My first thought was "oh, this just looks like a functional language" but my next thought was "with the added benefit of relying on the horrors of the C preprocessor."
brudgers 5 days ago
Would learning J work instead?
It’s probably more accessible than APL since its symbols can be found on conventional keyboards.
- thechao 5 days ago
  
  Every time I read about APL, I'm reminded of Lev Grossman's "The Magicians" — I'm always imagining some keyboard with just a little bit more than two dimensions; and, with sufficient capabilities, I could stretch to hit the meta-keys that let me type APL directly on my modified split MTGAP keyboard.
arboles 5 days ago
We know, the beginning of the article tells us his C code is APL-inspired. So many comments that just summarize the article on a surface level.
- jacquesm 5 days ago
  
  Yes, but... even if you know that it is APL inspired, that does not change the fact that this is not how you want to write C.
  The C pre-processor is probably one of the most abused pieces of the C toolchain and I've had to clean up more than once after a 'clever' programmer left the premises and their colleagues had no idea of what they were looking at. Just don't. Keep it simple, and comment your intent, not what the code does. Use descriptive names. Avoid globally scoped data and functions with side effects.
  That doesn't look smart and it won't make you look smart, but it is smart because the stuff you build will be reliable, predictable and maintainable.
  
  3 replies →
- electroly 5 days ago
  
  The beginning of the article talks about not learning APL--specifically mentions that he's not here to talk about APL--and proceeds into a wide-eyed dissection of the C without mentioning APL syntax again. It also doesn't, literally, say that the C is like APL; it says Arthur is an APL guy who writes weird C code. Another comment disagrees that this is APL style at all--which is it?? I think you could have given me more credit than this. I read the article and participated as best I could. I'm always happy to bump APL related articles so they get more visibility.
  
  5 replies →

svat 5 days ago

IMO this is a really good blog post, whatever you think of the coding style. Great effort by the author, really good for eight hours' work (as mentioned), and some illuminating conclusions: https://needleful.net/blog/2024/01/arthur_whitney.html#:~:te...

kolektiv 4 days ago

I was curious about Shakti after reading this and the comments, so followed the link to shakti.com on Wikipedia. It seems it now redirects to the k.nyc domain, which displays a single letter 'k'.

I wondered if I was missing something, so looked at the source, to find the following:

  <div style='font-family:monospace'>k

Nothing but that. Which is, surely, the HTML equivalent of the Whitney C style: relying on the compiler/interpreter to add anything implicit, and shaving off every element that isn't required, such as a closing tag (which, yes, only matters if you're going to want something else afterwards, I guess...). Bravo.

rbanffy 4 days ago

You shouldn’t have seen that. By now the cleaners must have gotten to you and erased your memory of these events.
w4yai 4 days ago
could have been `<pre>k`
- geocar 4 days ago
  
  <tt>k
- kolektiv 3 days ago
  
  Ooh, perhaps, but maybe not semantically identical? Visually probably the same though, devilish!
FrequentLurker 4 days ago
k
- robotresearcher 4 days ago
  
  Won’t usually be monospaced type.

MisterTea 5 days ago

Reminds me of Bourne's attempt at beating C into Algol: https://www.tuhs.org/cgi-bin/utree.pl?file=V7/usr/src/cmd/sh...

Example: https://www.tuhs.org/cgi-bin/utree.pl?file=V7/usr/src/cmd/sh...

Awesomedonut 4 days ago

Link is dead :(

epolanski 5 days ago

There are best or accepted practices in every field.

And in every field they work well for the average case, but are rarely the best fit for that specific scenario. And in some rare scenarios, doing the opposite is the solution that fits best the individual/team/project.

The interesting takeaway here is that crowd wisdom should be given weight and probably defaulted if we want to turn off our brains. But if you turn on your brain you will unavoidably see the many cracks that those solutions bring for your specific problem.

Pannoniae 5 days ago

That's why I hate them being called "best" practices. No, they aren't the best practices, they are the mediocre practices. Sometimes, that's a good thing (you don't want to have the really bad results!), but if you aim for the very best practices, all of them will hold you back. It's basically a tradeoff, sacrificing efficiency / good performance in exchange for maintainability, consistency and reliability.
WhitneyLand 5 days ago
Having a solid product that solves a problem well can be orthogonal to how well a codebase lends itself to readability, learning curve, and efficiently ramping up new developers on a project.
Just because you succeed at one says nothing about other practical and important metrics.
- epolanski 5 days ago
  
  I don't think you're reading this correctly.
  The proper way to read it is to understand the problem and its pros and cons.
  Without going long in the speculation, the situation likely was: there's only one guy who really can deliver this because of his knowledge, cv and experience and we need it.
  And at that point your choice is having a solution or not.
  
  1 reply →

taeric 5 days ago

Kudos on not just taking a combative stance on the code!

This was a very fun read that I'm fairly convinced I will have to come back to.

internet_points 4 days ago

TIL `a ?: b`, that's actually pretty nice, a bit like Haskell's `fromMaybe b a` (or `a <|> b` if b can also b "empty")

and I do like `#define _(e...) ({e;})` – that's one where I feel the short macro name is OK. But I'd like it better if that were just how C worked from the get-go.

Very nice discussion at the end of the article. There are good things to be learnt from this code and its discussions even if you disagree with some or even most of the style.

susam 4 days ago

Yes, '?:' is also known as the Elvis operator [1][2]. I sometimes use it in other languages such as Groovy. But I don't use it in C because this happens to be a GCC extension [3][4] and I've often had to compile my C projects with compilers that do not support GCC extensions. The C standard [5] defines the conditional operator as:

  conditional-expression:
    logical-OR-expression
    logical-OR-expression ? expression : conditional-expression

So per the C standard there must be an expression between '?' and ':' and an expression cannot be empty text. To confirm this we need to check the grammar for expression, which unfortunately is a little tedious to verify manually due to its deeply nested nature. Here it is:

  expression:
    assignment-expression
    expression , assignment-expression

  assignment-expression:
    conditional-expression
    unary-expression assignment-operator assignment-expression

  unary-expression:
    postfix-expression
    ++ unary-expression
    -- unary-expression
    unary-operator cast-expression
    sizeof unary-expression
    sizeof ( type-name )
    alignof ( type-name )

  assignment-operator: one of
    = *= /= %= += -= <<= >>= &= ^= |=

  ... and so on ...

The recursion goes further many more levels deep but the gist is that no matter whichever branch the parser takes, it expects the expression to have at least one symbol per the grammar. Perhaps an easier way to confirm this is to just have the compiler warn us about it. For example:

  $ cat foo.c
  int main(void) {
      if () {}
  }

  $ clang -std=c17 -pedantic -Wall -Wextra foo.c && ./a.out
  foo.c:2:9: error: expected expression
      2 |     if () {}
        |         ^
  1 error generated.

Or more explicitly:

  $ cat bar.c
  #include <stdio.h>
  int main(void) {
      printf("%d\n", 0 ?: 99);
      printf("%d\n", 1 ?: 99);
  }

  $ clang -std=c17 bar.c && ./a.out
  99
  1

  $ clang -std=c17 -pedantic bar.c && ./a.out
  bar.c:3:23: warning: use of GNU ?: conditional expression extension, omitting middle operand [-Wgnu-conditional-omitted-operand]
      3 |     printf("%d\n", 0 ?: 99);
        |                       ^
  bar.c:4:23: warning: use of GNU ?: conditional expression extension, omitting middle operand [-Wgnu-conditional-omitted-operand]
      4 |     printf("%d\n", 1 ?: 99);
        |                       ^
  2 warnings generated.
  99
  1

[1] https://kotlinlang.org/docs/null-safety.html#elvis-operator

[2] https://groovy-lang.org/operators.html#_elvis_operator

[3] https://gcc.gnu.org/onlinedocs/gcc/Syntax-Extensions.html

[4] https://gcc.gnu.org/onlinedocs/gcc/Conditionals.html

[5] https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3299.pdf

shawn_w 5 days ago

Much as a Real Programmer can write FORTRAN programs in any language, Whitney can write APL programs in any language.

sebstefan 5 days ago

```

#define _(e...) ({e;})

#define x(a,e...) _(s x=a;e)

#define $(a,b) if(a)b;else

#define i(n,e) {int $n=n;int i=0;for(;i<$n;++i){e;}}

```

>These are all pretty straight forward, with one subtle caveat I only realized from the annotated code. They're all macros to make common operations more compact: wrapping an expression in a block, defining a variable x and using it, conditional statements, and running an expression n times.

This is war crime territory

maldev 5 days ago
Some of these are wrong to. You can encounter issues with #define
#define $(a,b) if(a)b;else
due to not having brackets. So it's just extremely lazy to.
- jacquesm 4 days ago
  
  This should not be downvoted, this sort of error is indeed a very easy one to make when dealing with the C pre-processor.
  > Some of these are wrong to[o] <- that needs an extra 'o' > due to not having brackets. <- that one is fine > So it's just extremely lazy to[o]. <- that needs an extra 'o' too
  'to' comes in two versons, 'too' and 'to', both have different meanings.
  
  7 replies →

uvaursi 5 days ago

This is a good use of macros. I understand people are frightened by how it looks but it’s just C in a terse, declarative style. It’s mostly straightforward, just dense and yes - will challenge you because of various obscure macro styles used.

I believe “oo” is probably an infinity error condition or some such not 100% sure. I didn’t see the author discuss it since they said it’s not used. Was probably used during development as a debug printout.

saulpw 5 days ago
I agree, some of the macros are very useful, and I've found myself wanting DO(n, code) as a simpler for-loop construct. In my own code, when I have some dozens of small things (like opcodes or forth words or APL operators), I specifically do want a "one-liner" syntax for most of them. The individual elements are usually so small that it's distasteful to spend 10 lines of code on them, and especially because the real understanding lies in the 'space between', so I want to see a large subset of the elements at once, and not put code-blinders on to focus on one element at a time.
- uvaursi 4 days ago
  
  In reading many C code bases, including the Linux kernel, every one finds a use case for macros of this nature.
sebstefan 4 days ago

From the article
>These are all pretty straight forward, [...] wrapping an expression in a block, defining a variable x and using it, conditional statements, and running an expression n times.
Making your reader learn some ad-hoc shorthands you wrote to avoid declaring blocks, defining variables or writing conditions in my book is very impolite
Style doesn't need to be innovative.
procaryote 5 days ago

> This is a good use of macros.
no.

romperstomper 5 days ago

Is this supposed to be a specific coding style or paradigm?

I’ve never seen code written like this in real-world projects — maybe except for things like the "business card ray tracer". When I checked out Arthur Whitney’s Wikipedia page I noticed he also made the J programming language (which is open source) and the code there has that same super-dense style https://github.com/jsoftware/jsource/blob/master/jsrc/j.c

susam 4 days ago
> Is this supposed to be a specific coding style or paradigm?
This is indeed Whitney's distinctive coding style, well known for its use in his various array programming language interpreters. His coding style is famously minimalist and idiosyncratic often fitting entire implementations of interpreters in a few pages.
This has been discussed a number of times on HN. I have collected some of the interesting comments on this topic from previous threads here in this meta comment: https://news.ycombinator.com/item?id=45800777#45805346
- romperstomper 4 days ago
  
  Thanks! I can't imagine how to code in this style everyday, tbh :)
  
  1 reply →
jacquesm 5 days ago
> I’ve never seen code written like this in real-world projects
Lucky you. I've seen far worse (at least this is somewhat consistent). But this isn't C anymore, it is a new language built on top of C and then a program written in that language. C is merely the first stage compilation target.
- robotresearcher 4 days ago
  
  IIRC, C++ started out this way, or at least its precursor ‘C with classes’. A compiler came later.
- taneq 4 days ago
  
  And everyone says you can't do DSLs in boring old languages. :P
  
  1 reply →
rcxdude 5 days ago

It's similar to J and that family of languages (K is another). Those are inspired by APL, which also has this super compact nature but in addition it largely uses non-ascii symbols. Apparently it is something you can get used to and notionally has some advantages (extreme density means you can see 'more' of the program on a given page, for example, and you need fewer layers of abstraction).
tom_ 5 days ago

Possibly related(ish): video about co-dfns, prompted by a previous HN thread (links in video summary), not written in C but put together in a similarly dense style: https://www.youtube.com/watch?v=gcUWTa16Jc0
leoc 5 days ago
I believe it’s usually referred to as ‘OCC’. ;)
- romperstomper 4 days ago
  
  Could you elaborate? :) I found OrangeC Compiler but I'm not sure this is the OCC you've mentioned.
  
  3 replies →

FilosofumRex 4 days ago

All great industrial apps are DSLs for specific domains, because often time end users are much smarter & craftier than developers. Some great examples: - AutoCad (vector drawing DSL on top of Lisp) - Mathematica (symbolic algebra DSL - Lisp & C) - Aspen One (Thermodynamics/Chemistry DSL on FORTRAN) - COMSOL (Multiphysics DSL C++) - Verilog (FPGA design DSL C) and also general purpose tools like Regex, XLA, CERN/Root, SQL, HTML/CSS,...

piazz 5 days ago

I can’t explain why but “He’s assigning 128 to a string called Q” made me absolutely lose it.

arlyle 5 days ago

ksimple is eight bit. 128 is the unsigned middle or one plus signed max. usually using it for null or error signal. on sixty for bit k implementations it would be two to the sixty three.

8474_s 4 days ago

The macros are fine as concept, i've used something similar before for reducing code size,e.g. defining hundreds of similar functions and stuff. What is incomprehensible and puts the entire thing into "Obfuscated C" territory is one-letter variables. You'll need to memorize all of them and can't reuse them in normal code. If at least the variables were self-descriptive i'd support such coding style, but it clearly need comments.

voidhorse 4 days ago

Nice write up!

When I see stuff like this, personally, I don't try to understand it, as code like this emerges from basically three motivations:

- The other person wanted to write in some other more (functional|object oriented|stack) language but couldn't, so they did this.

- The person couldn't be bothered to learn idioms for the target language and didn't care about others being able to read the program.

- The person intentionally wanted to obfuscate the program.

And none of these are good reasons to write code in a particular way. Code is about communication. Code like this is the equivalent to saying "I know the grammatical convention in English is subject-verb-object but I feel like speaking verb-object-subject and other people will have to just deal with it"—which, obviously, is a horrible way to communicate if you actually want to share ideas/get your point across.

That all said, the desire to have logic expressed more compactly and declaratively definitely resonates. Unfortunately C style verbosity and impurity remains dominant.

holografix 4 days ago

“would you rather spend 10 days reading 100,000 lines of code, or 4 days reading 1000?"

More like 10 days understanding 100K loc or 30 days stabbing yourself in the eye over 4K loc

blibble 5 days ago

there's the java version too

https://github.com/KxSystems/javakdb/blob/8a263abee29de582cd...

svat 4 days ago

It's interesting to compare that version from 2017 with the current version from 2025: https://github.com/KxSystems/javakdb/blob/9a94dc5af9288fe845... — the current one is over ten times as long in terms of number of lines, and has copious comments, but still has the short names and dense code.
andylynch 5 days ago

People here might not notice - your link is the official client interface for talking to KDB+ processes from Java.
There's a decent chance your broker (or their dealers) are using stuff built on this.

cozzyd 5 days ago

The C preprocessor allows you to define a limited DSL on top of C. This is... sometimes a good thing, and often convenient, even if it makes it hard to understand.

gitonthescene 4 days ago
I think _all_ programming is about finding an appropriate DSL for the problem at hand. First you need to understand the “language” of the problem then you develop a “lingo”.
- rbanffy 4 days ago
  
  Exactly. You build the language up to the problem. When you do that, the program almost writes itself.
jacquesm 4 days ago
For extremely small values of 'sometimes' where sometimes is constrained by the following expressions evaluating to 'true':
- you have no interest in maintaining your code
- your code will never be maintained by someone else
- you know your C preprocessor better than you know your C compiler
- your favorite language isn't available for this particular target
- you don't mind object level debugging
- your idea of a fun time is to spend a few hours per day memorizing code
- you really are smarter than everybody else
- cozzyd 4 days ago
  
  It depends how far down the rabbit hole you go.
  Something like JCON (originally from mongodb, see https://blogs.gnome.org/chergert/2016/10/21/jcon/) is actually pretty nice IMO.

JSR_FDED 4 days ago

It’s cool that you can do this in C! And it’s cool that this article explores that.

As developers we have to decide where and when this makes sense, just like with other language features, libraries, architectural patterns, etc.

ryao 4 days ago
It uses multiple non-standard extensions to C. A strictly standards conformant compiler would refuse to compile it.
- JSR_FDED 4 days ago
  
  That might be an excellent reason not to use some of these capabilities. And maybe in a different situation it would make sense to use the mechanisms provided. Programmer’s responsibility to decide what’s appropriate in each case, that’s all I’m saying.

m463 5 days ago

This reminds me of when I was learning perl.

At first, I thought it looked like line noise. $var on the left of the = sign? Constructs like $_ and @_? more obscure constructs were worse.

But I had to keep going and then one day something happened. It was like one of those 3d stereograms where your eyes have to cross or uncross. The line noise became idioms and I just started becoming fluent in perl.

I liked some of it too - stuff like "unless foo" being more a readable/human of saying if not foo.

perl became beautiful to me - it was the language I thought in, and at the highest level. I could take an idea in my mind and express it in perl.

But I had some limits. I would restrain myself on putting entire loops or nested expression on one line just to "save space".

I used regular expressions, but sometimes would match multiple times instead of all in one giant unreadable "efficient" expression.

and then, I looked at other people's perl. GAH! I guess other people can "express themselves in perl", but rarely was it beautiful or kind, it was statistically worse and closer to vomit.

I like python now. more sanity, (somewhat) more likely that different people will solve a problem with similar and/or readable code.

by the way, very powerful article (even if I intensely dislike the code)

rramadass 4 days ago

Nice. Previous attempts by other users to decode Whitney's style of C programming can be found here - https://news.ycombinator.com/item?id=32202742

munchler 5 days ago

The person who wrote this code might be a genius, but learning to read it isn’t going to make anyone smart. It’s basically obfuscated assembly code.

rbanffy 4 days ago

For the APL fans (or haters) Unicomp makes keycaps with APL symbols for their (excellent) Model M mechanical keyboards.

kazinator 5 days ago

You will not become smart, only crazy and unemployable. :)

Keyframe 5 days ago
Or an unrealized IOCCC champion Whitney seems to aspire to.
- IncreasePosts 5 days ago
  
  Whitney would never submit his code because it is trivially understandable and not obfuscated?
gitonthescene 4 days ago

Are you saying most employers are smart by default??

russellbeattie 5 days ago

> "Opinions on his coding style are divided, though general consensus seems to be that it's incomprehensible."

I wholeheartedly concur with popular opinion. It's like writing a program in obfuscated code.

Hmmm... his way of basically making C work like APL made me wonder: Is there a programming language out there that defines its own syntax in some sort of header and then uses that syntax for the actual code?

IncreasePosts 5 days ago

In racket, you can say something like "#lang X", which can modify the reader and let you create your own arbitrary syntax
fifticon 5 days ago

forth and lisp?

bluedino 5 days ago

Reminds me of a Python codebase I used to work with

The company was originally a bunch of Access/VB6 programmers.

Then they wrote their VB code in PHP.

And then they wrote their PHP code in Python. It was disgusting.

realo 5 days ago

Ah yes... very tempting to ask an AI to refactor some large Java program (pick your language) "in the style of Arthur Whitney".

wvbdmp 5 days ago
I asked ChatGPT to explain the code from the OP (without the header file), and it seems to have given a really good breakdown. Although I know nothing about interpreters, C, or this fucked style, so who really knows if it makes any sense at all…
- ebcode 4 days ago
  
  The header file does most of the work. I submitted the output of gcc -E (preprocessor only) to ChatGPT: https://chatgpt.com/share/69093ba2-ae74-8006-abbb-5c7f24be23... -- and I found out about "tagged pointers".
  https://en.wikipedia.org/wiki/Tagged_pointer

avadodin 4 days ago

This man casually codes up IOCCC entries.

The code registers a bit like FORTH in concept.

jandrese 5 days ago

> His languages take significantly after APL, which was a very popular language for similar applications before the invention of (qwerty) keyboards.

Ok, so this article is tongue in cheek. Good to know that up front.

internet_points 4 days ago

https://en.wikipedia.org/wiki/APL_(programming_language)#Har...
you would add a special "typeball" into your IBM Selectric Typewriter. Some pics:
https://www.duxburysystems.org/downloads/library/texas/apple...
https://pierce.smugmug.com/Misc/APL-Typeball/i-pjq6hWC

igleria 5 days ago

Holy molly this must be the equivalent of reading the necronomicon and getting cosmic madness disease as a result.

What a flex of patience!

readthenotes1 5 days ago

During code reviews I would always ask for clear code because it's much harder to tell whether it's correct if it's unclear.

I got too much other stuff to do than decode the voynich manuscript...

richhhh 5 days ago

Kerrnigan’s law seems to apply:

Everyone knows that debugging is twice as hard as writing a program in the first place. So if you’re as clever as you can be when you write it, how will you ever debug it?

pragma_x 5 days ago
Agreed. Although it's also a bit worse than that for coding exclusively with macros. You have to add an extra degree of complexity for any additional code generator you add to your toolchain, when that path comes into play for debugging. Since we whole-buffalo'ed this situation, that's 100% of the code you could possibly need to debug.
- jacquesm 4 days ago
  
  Yes, precisely, that's when all that cleverness will come back to bite you hard.
  "Which line was that again? Oh... "
  Pics up the phone, dials.
  "Honey, I won't be home in time for dinner."
  
  1 reply →

jacquesm 5 days ago

As a very long time C programmer: don't try to be smart. The more you rely on fancy preprocessor tricks the harder it will be to understand and debug your code.

The C preprocessor gives you enough power to shoot yourself in the foot, repeatedly, with anything from small caliber handguns to nuclear weapons. You may well end up losing control over your project entirely.

One nice example: glusterfs. There are a couple of macros in use there that, when they work are magic. But when they don't you lose days, sometimes weeks. This is not the way to solve coding problems, you only appear smart as long as you remember what you've built. Your other self, three years down the road is going to want to kill the present one, and the same goes for your colleagues a few weeks from now.

5- 5 days ago

> as long as you remember what you've built
yes! like any craft, this works only if you keep practising it.
various implementations of k, written in this style (with iterative improvements), have been in constant development for decades getting very good use out of these macros.
gitonthescene 4 days ago
Losing control of a project is likely more due to the programmers on it than the tools they use. IMHO _anything_ done consistently can be reasoned about and if necessary undone.
- jacquesm 4 days ago
  
  Not necessarily. Sometimes the rot goes so deep that there is really no way out.
  And the C pre-processor has figured prominently in more than one such case in my career. And it was precisely in the kind of way that is described in TFA.
  For something to be doable it needs to make economic sense as well and that's the problem with nightmare trickery like this. Initially it seems like a shortcut, but in the long run the price tag keeps going up.
  
  5 replies →
- gitonthescene 3 days ago
  
  Just to double down here I took a code base written in this style (not exactly atw but inspired by him) and spent about a day expanding it to this point: https://codeberg.org/growler/k/src/branch/expand/a.c My guess is it would only take a week to get it to what people here are calling “acceptable”.
ryao 4 days ago

ZFS has a very nice set of macros that work very well:
https://github.com/openzfs/zfs/blob/master/include/os/freebs...
See P2PHRASE() and friends. They were inherited from OpenSolaris.
switchbak 5 days ago
Seems to me that this is now exponentially true with AI coding assistants. If you don't understand what you're adding, and you're being clever - you can quickly end up in a situation where you can't reason effectively about your system.
I'm seeing this on multiple fronts, and it's quickly becoming an unsustainable situation in some areas. I expect I'm not alone in this regard.
- gitonthescene 4 days ago
  
  I’d bet that a lot of the work done with AI assistants is decidedly _not_ clever.

Jean-Papoulos 4 days ago

This style is inherently worse because there's no spaces. My brain has been wired since 4 years old to read words, not letters. Words are separated by spaces. Havingnospacesbetweenwordsmakesthemexponentiallyhardertoreadandcomprehend.

t-3 4 days ago

Your spaceless sentence is quite easy to read though. It's not even 10% harder to read. Humans are quite good at recognizing patterns.

susam 4 days ago

HN stories about Whitney's code tend to predictably attract a lot of comments about the coding style, so I thought I'd share a couple of positive discussions from previous related posts.

Here's one from one of my favourite HN commenters posted at https://news.ycombinator.com/item?id=40544283#40545004 (Jun 2024):

"Whitney is famous for writing code like this, it's been his coding style for decades. For example, he wrote an early J interpreter this way in 1989. There's also a buddy allocator he wrote at Morgan Stanley that's only about 10 lines of C code." -- papercrane

tempodox 4 days ago

The same article is available under “I read Arthur Whiteney's code and all I got was Mental Illness”, which is apt.

This parades all the reasons why you may want to avoid C like the plague, and then some. This stuff gives me nightmares.

Joel_Mckay 4 days ago

Obfuscation is usually just a lack of accountability, and naive job security through avoiding peer-review.

Practically speaking, if people can't understand you, than why are you even on the team? Some problems can't be solved alone even if you live to a 116 years old.

Also, folks could start dropping code in single instruction obfuscated C for the lols =3

https://github.com/xoreaxeaxeax/movfuscator

qayxc 4 days ago
Whitney has valid reasons to write code this way. If you look at his career, you'll understand how this is not a problem - he literally spent decades working on "one-page" programs written that way. It's not "for the lols", it's simply what he's been comfortable with for 50+ years.
He's a software developer from a different era, when individual programmers wrote tiny (by today's standard) programs that powered entire industries. So for what he's been doing his entire career, neither lack of accountability, job security, or working with teams are really applicable.
- Joel_Mckay 4 days ago
  
  > He's a software developer from a different era
  Ivory tower politics is never an excuse, and failure to adapt to the shop standards usually means your position ends. Inflicting a goofy meta-circular interpreter on people is a liability.
  Anyone competent would normally revert that nonsense in about 30 seconds, as it looks like a compressed/generated underhanded payload. "Trust me bro" is also not a valid excuse. =3
  https://en.wikipedia.org/wiki/Conways_Law
  
  8 replies →

sanskarix 4 days ago

the obsession with code elegance vs shipping velocity is telling here. Whitney's style works for him because he's building tools he'll maintain himself for decades. same product, same developer, same context.

most startups are in the opposite situation. you need three different engineers to understand what you built last quarter because two people quit and one went to a different team. your clever abstractions become technical debt when the person who made them isn't around to explain them.

here's the real question: are you optimizing for the code or the business? sometimes boring, verbose, googleable patterns beat clever compression because your constraint isn't keystrokes - it's hiring, onboarding, and velocity when half your team is new. that's startup reality.

ontouchstart 4 days ago

I’m wondering now with LLM in the loop, how the languages of solving complex problems will evolve in the long run.
Perhaps I will start to playing with this macro style ladder of abstraction with the help of LLM. Such as literate programming with an AI agent. Computer is much better at parsing than us. We can stand on highest rung of the ladder.

sodikidos 5 days ago

[dead]

widikidiw 5 days ago

[dead]

qmr 5 days ago

[flagged]

dang 4 days ago
"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."
https://news.ycombinator.com/newsguidelines.html
- qmr 1 day ago
  
  My comment is not a shallow dismissal.

netbioserror 5 days ago

This code style is psychotic. I had to reverse-engineer and verify a C codebase that was machine-obfuscated and it was still clearer to follow than this. Increasing clarity through naming is great, but balancing information density is, dare I say, also a desirable goal. Compacting code rapidly diminishes returns once you're relying on a language having insignificant whitespace.

qwertytyyuu 4 days ago

I don't writing code like that will make the average programmer team any faster. Unless you are really deep into the code and have a good mental model of how the symbols are structured it think its going to take longer with the constant need to refer back/ re work out what a symbol means. I'd rather have the descriptive variable names. What he writes looks akin to minified JS to me.