Nimony (Nim 3.0) Design Principles

2 months ago (nim-lang.org)

94 comments

andsoitis

> "Modern" languages try to avoid exceptions by using sum types and pattern matching plus lots of sugar to make this bearable. I personally dislike both exceptions and its emulation via sum types. ... I personally prefer to make the error state part of the objects: Streams can be in an error state, floats can be NaN and integers should be low(int) if they are invalid.

Special values like NaN are half-assed sum types. The latter give you compiler guarantees.

SJMG 2 months ago

Not a defense of the poison value approach, but in this thread Araq (Nim's principal author) lays out his defense for exceptions.
https://forum.nim-lang.org/t/9596#63118
kace91 2 months ago
I’d like to see their argument for it. I see no help in pushing NaN as a number through a code path corrupting all operations it is part of, and the same is true for the others.
- snek_case 2 months ago
  
  The reason NaN exists is for performance AFAIK. i.e. on a GPU you can't really have exceptions. You don't want to be constantly checking "did this individual floating-point op produce an error?" It's easier and faster for the individual floating point unit to flag the output as a NaN. Obviously NaNs long predate GPUs, but floating-point support was also hardware accelerated in a variety of ways for a long time.
  That being said, I agree that the way NaNs propagate is messy. You can end up only finding out that there was an error much later during the program's execution and then it can be tricky to find out where it came from.
  
  1 reply →
- cb321 2 months ago
  
  There is no direct argument/guidence that I saw for "when to use them", but masked arrays { https://numpy.org/doc/stable/reference/maskedarray.html } (an alternative to sentinels in array processing sub-languages) have been in NumPy (following its antecedents) from its start. I'm guessing you could do a code-search for its imports and find arguments pro & con in various places surrounding that.
  From memory, I have heard "infecting all downstream" as both "a feature" and "a problem". Experience with numpy programs did lead to sentinels in the https://github.com/c-blake/nio Nim package, though.
  Another way to try to investigate popularity here is to see how much code uses signaling NaN vs. quiet NaN and/or arguments pro/con those things / floating point exceptions in general.
  I imagine all of it comes down to questions of how locally can/should code be forced to confront problems, much like arguments about try/except/catch kinds of exception handling systems vs. other alternatives. In the age of SIMD there can be performance angles to these questions and essentially "batching factors" for error handling that relate to all the other batching factors going on.
  Today's version of this wiki page also includes a discussion of Integer Nan: https://en.wikipedia.org/wiki/NaN . It notes that the R language uses the minimal signed value (i.e. 0x80000000) of integers for NA.
  There is also the whole database NULL question: https://en.wikipedia.org/wiki/Null_(SQL)
  To be clear, I am not taking some specific position, but I think all these topics inform answers to your question. I think it's something with trade-offs that people have a tendency to over-simplify based on a limited view.
  
  2 replies →
- otabdeveloper4 2 months ago
  
  There is no argument. It's literally just a "programming is hard, let's go shopping" sentiment.
elcritch 2 months ago
The compiler can still enforce checks, such as with nil checks for pointers.
In my opinion it’s overall cleaner if the compiler handles enforcing it when it can. Something like “ensure variable is initialized” can just be another compiler check.
Combined with an effects system that lets you control which errors to enforce checking on or not. Nim has a nice `forbids: IOException` that lets users do that.
- umanwizard 2 months ago
  
  > The compiler can still enforce checks, such as with nil checks for pointers.
  Only sometimes, when the compiler happens to be able to understand the code fully enough. With sum types it can be enforced all the time, and bypassed when the programmer explicitly wants it to be.
  
  6 replies →
- ux266478 2 months ago
  
  Both of these things respectively are just pattern matches and monads, just not user-definable ones.
  
  1 reply →
saghm 2 months ago
Yeah, I'm not sure I've ever seen NaN called or as an example to be emulated before, rather than something people complain about.
- echelon 2 months ago
  
  Holy shit, I'd love to see NaN as a proper sum type. That's the way to do it. That would fix everything.
  
  5 replies →
lairv 2 months ago
That's why I always disliked calling null the "billion dollar mistake", null and Options<T> are basically the same, the mistake is not checking it at compile time
- the_gipsy 2 months ago
  
  ...and if everything was wrapped in Option<>.
  If my grandmother had wheels, she'd be a bike.

mwkaufma 2 months ago

Big "college freshman" energy in this take:

  I personally prefer to make the error state part of the objects: Streams can be in an error state, floats can be NaN and integers should be low(int) if they are invalid (low(int) is a pointless value anyway as it has no positive equivalent).

It's fine to pick sentinel values for errors in context, but describing 0x80000000 as "pointless" in general with such a weak justification doesn't inspire confidence.

ratmice 2 months ago
Without the low int the even/odd theorem falls apart for wrap around I've definitely seen algorithms that rely upon that.
I would agree, whether error values are in or out of band is pretty context dependent such as whether you answered a homework question wrong, or your dog ate it. One is not a condition that can be graded.
- Mond_ 2 months ago
  
  Meh, you also see algorithms that have subtle bugs because the author assumed that for every integer x, -x has the same absolute value and opposite sign.
  I view both of these as not great. If you strictly want to rely on wraparound behavior, ideally you specify exactly how you're planning to wrap around in the code.
- umanwizard 2 months ago
  
  What is the "even/odd theorem" ?
  
  13 replies →
k__ 2 months ago

I had the impression, the creator of Nim isn't very fond of academic( solution)s.
sevensor 2 months ago
I have been burned by sentinel values every time. Give me sum types instead. And while I’m piling on, this example makes no sense to me:
proc fib[T: Fibable](a: T): T = if a <= 2: result = 1 else: result = fib(a-1) + fib(a-2)
Integer is the only possible type for T in this implementation, so what was the point of defining Fibable?
- Hendrikto 2 months ago
  
  I agree about sentinel values. Just return an error value.
  I think the fib example is actually cool though. Integers are not the only possible domain. Everything that supports <=, +, and - is. Could be int, float, a vector/matrix, or even some weird custom type (providing that Nim has operator overloading, which it seems to).
  May not make much sense to use anything other than int in this case, but it is just a toy example. I like the idea in general.
  
  8 replies →
- jibal 2 months ago
  
  You're completely missing the point of this casual example in a blog post ... as evidenced by the fact that you omitted the type definition that preceded it, that is the whole point of the example. That it's not the best possible example is irrelevant. What is relevant is that the compiler can type check the code at the point of definition, not just at the point of instantiation.
  And FWIW there are many possible types for T, as small integer constants are compatible with many types. And because of the "proc `<=`(a, b: Self): bool" in the concept definition of Fibable, the compiler knows that "2" is a constant of type T ... so any type that has a conversion proc for literals (remember that Nim has extensive compile-time metaprogramming features) can produce a value of its type given "2".
- treeform 2 months ago
  
  There can be a lot of different integers, int16, int32 ... and unsigned variants. Even huge BigNum integers of any lengths.

kbd 2 months ago

The biggest thing I still don’t like about Nim is its imports:

    import std/errorcodes

    proc p(x: int) {.raises.} =
      if x < 0:
        raise ErrorCode.RangeError
      use x

I can’t stand that there’s no direct connection between the thing you import and the names that wind up in your namespace.

PMunch 2 months ago
There is a direct connection, you just don't have to bother with typing it. Same as type inference, the types are still there, you just don't have to specify them. If you have a collision in name and declaration then the compiler requires you to specify which version you wanted. And with language inspection tools (like LSP or other editor integration) you can easily figure out where something comes from if you need to. Most of the time though I find it fairly obvious when programming in Nim where something comes from, in your example it's trivial to see that the error code comes from the errorcodes module.
Oh, and as someone else pointed out you can also just `from std/errorcodes import nil` and then you _have_ to specify where things come from.
- kbd 2 months ago
  
  When I was learning Nim and learned how imports work and that things stringify with a $ function that comes along with their types (since everything is splat imported) and $ is massively overloaded I went "oh that all makes sense and works together". The LSP can help figure it out. It still feels like it's in bad taste.
  It's similar to how Ruby (which also has "unstructured" imports) and Python are similar in a lot of ways yet make many opposite choices. I think a lot of Ruby's choices are "wrong" even though they fit together within the language.
  
  1 reply →
xigoi 2 months ago
It needs to be this way so that UFCS works properly. Imagine if instead of "a,b".split(','), you had to write "a,b".(strutils.split)(',').
- polotics 2 months ago
  
  ok I do not understand.
  What is preventing this import std/errorcodes
  from allowing me to use: raise errorcodes.RangeError instead of what Nim has?
  or even why not even "import std/ErrorCodes" and having the plural in ErrorCodes.RangeError I wouldn't mind
  
  1 reply →
treeform 2 months ago

Nim imports are great. I would hate to qualify everything. It feels so bureaucratic when going back to other languages. They never cause me issues and largely transparent. Best feature.
summarity 2 months ago
You are free to import nil and type the fully qualified name.
- Symmetry 2 months ago
  
  There are many things to like about Nim, but it does benefit from adherence to a style guide more than most languages.
ThouYS 2 months ago
100% my beef with it. same style as c++ where you never know where something comes from, when clangd starts throwing one of its fits
- cb321 2 months ago
  
  PMunch and summarity both already said this, but because maybe code speaks louder than words (like pictures?)... This works:
  from strutils as su import nil echo su.split "hi there"
  (You can put some parens () in there if you like, but that compiles.) So, you can do Python-style terse renames of imports with forced qualification. You just won't be able to say "hi there".(su.split) or .`su.split` or the like.
  You can revive that, though, with a
  template suSplit(x): untyped = su.split x echo "hi there".suSplit`
  That most Nim code you see will not do this is more a cultural/popularity thing that is kind of a copy-paste/survey of dev tastes thing. It's much like people using "np" as the ident in `import numpy as np`. I was doing this renaming import before it was even widely popular, but I used capital `N` for `numpy` and have had people freak out at me for such (and yet no one freaking out at Travis for not just calling it `np` in the first place).
  So, it matters a little more in that this impacts how you design/demo library code/lib symbol sets and so on, but it is less of a big deal than people make it out to be. This itself is much like people pretending they are arguing about "fundamental language things", when a great deal of what they actually argue about are "common practices" or conventions. Programming language designers have precious little control over such practices.
fithisux 2 months ago

Java is going to do the same. C already does is.
Not the best but there is precedent.

esafak 2 months ago

From my interaction with the Nim community, I came to the conclusion that nim could be more popular if its founder devolved decision making to scale up the community. I think he likes it the way it is; small, but his. He is Torvaldsesque in his social interactions.

nallerooth 2 months ago
I feel the same way - as I suspect a lot of people here do. Nim posts are always upvoted and usually people say nice things about the language in the comments.. but there are few who claim to actually -use- the language for more than a small private project, if even that.
- cb321 2 months ago
  
  The only way to really test out a programming language is by trying it out or reading how someone else approached a problem that you're interested in/know about.
  There are over 2200 nimble packages now. Maybe not an eye-popping number, but there's still a good chance that somewhere in the json at https://github.com/nim-lang/packages you will find something interesting. There is also RosettaCode.org which has a lot of Nim example code.
  This, of course, does not speak to the main point of this subthread about the founder but just to some "side ideas".
oscillonoscope 2 months ago

I worked in nim for a little bit and it truly has a lot of potential but ultimately abandoned it for the same reason. It's never going to grow beyond the founder's playground.
xigoi 2 months ago
Please no. Design by committee would lead to another C++.
- pjmlp 2 months ago
  
  Languages with design by committee are a plenty, including all mainstream ones, not a single one is still being developed by a single person.
- almostgotcaught 2 months ago
  
  The second or third most popular language of all time? God forbid lol
  
  5 replies →

andyferris 2 months ago

> floats can be NaN and integers should be low(int) if they are invalid (low(int) is a pointless value anyway as it has no positive equivalent).

I have long thought that we need a NaI (not an integer) value for our signed ints. Ideally, the CPU would have overflow-aware instructions similar to floats that return this value on overflow and cost the same as wrapping addition/multiplication/etc.

mikepurvis 2 months ago

From an implementation point of view, it would be similar to NaN; a designated sentinel value that all the arithmetic operations are made aware of and have special rules around producing and consuming.
fithisux 2 months ago

R has it.

ninjaquiv 2 months ago

Does Nimony/Nim 3.0 have pattern matching, or any plans for it?

SJMG 2 months ago
Yes
Design: https://github.com/nim-lang/RFCs/issues/559
Plan: https://forum.nim-lang.org/t/13357#81170
- matchee 2 months ago
  
  Interesting.
  I wonder how Nim 3/Nimony handles or will handle bindings in patterns regarding copy, move or reference. Rust can change it per binding, and Ada's experimental pattern matching might have some plans or properties regarding that.[1]
  > By default, identifier patterns bind a variable to a copy of or move from the matched value depending on whether the matched value implements Copy.
  > This can be changed to bind to a reference by using the ref keyword, or to a mutable reference using ref mut. For example:
  match a { None => (), Some(value) => (), } match a { None => (), Some(ref value) => (), }
  .
  The Github issue had a strange discussion. I really disliked goteguru's equals-sign-based syntax, though I had difficulty judging the main design syntax.
  I wonder what Araq thinks of Scala's Expression AST type. Tree, TermTree, and all the subtype case classes [2]. Tree has fields. Though I am not certain how the common variables are initialized.
  [1] https://doc.rust-lang.org/reference/patterns.html#r-patterns...
  [2] https://github.com/scala/scala3/blob/main/compiler/src/dotty...

jcmfernandes 2 months ago

>WCET ("worst case execution time") is an important consideration: Operations should take a fixed amount of time and the produced machine code should be predictable.

Good luck. Give the avionics guys a call if you solve this at the language level.

kapija 2 months ago

great, now they just need to fix the whitespaces and people will start using it

xigoi 2 months ago

By modularizing the compiler, it will presumably be easier to create syntactic skins for Nim. I’m planning to make an S-expression syntax if nobody else does.

hota_mazi 2 months ago

> It is not possible to say which exceptions are possible

So repeating the same mistake that Spring made by using runtime exceptions everywhere.

Now you can never know how exactly a function can fail, which means you are flying completely blind.