Comment by omoikane

1 year ago

Luon looks mostly like Oberon and not so much like Lua, it's not obvious which of the Lua features it incorporated. It didn't seem to have coroutines, for example.

But I am glad that it went with Oberon's 0-based array indices, as opposed to Lua's 1-based table indices.

https://github.com/rochus-keller/Luon/blob/master/specificat...

> Luon looks mostly like Oberon and not so much like Lua

Luon can indeed look similar to Oberon if you use upper-case keywords and semicolons, but there is not need for this. Both - Lua and Luon - have much in common with Modula-2 (given lower-case keywords). There are many elements in Luon which are pretty similar to Lua, e.g. constructors, pcall, most control and loop statements. But there are also significant differences of course, because Luon is a statically typed language and Lua isn't.

  • Modula-2 – a blast from my past. Back in the ‘80s I used it to implement a prototype distributed OS (as designed by a team at JPL) on VAXen. Logitech (!) had a really nice VAX compiler back then. One of the most productive 6-month stretches of my career. Luon looks like an answer to my unspoken prayers. Thank you!

Don't all of Wirth's languages let you use any ordinal as the array index? At leat in Pascal you can declare an integer subtype, say 7..17 and use that as the type of the index of an array. Then the first element is item 7.

The point being that both the start at 0 and start at 1 camps can have it their own way.

  • > Don't all of Wirth's languages let you use any ordinal as the array index?

    In Oberon, Wirth kicked out everything to the bare minimum, including subrange types. Array indices in Oberon start with 0.

  • >Don't all of Wirth's languages let you use any ordinal as the array index? At leat in Pascal you can declare an integer subtype, say 7..17 and use that as the type of the index of an array. Then the first element is item 7.

    Can confirm that about Pascal, since I had used it a lot earlier.

    Don't know about the other Wirth languages.

    >The point being that both the start at 0 and start at 1 camps can have it their own way.

    Yes, but that is not the only point. Another reason, and maybe the more important one, is that having such custom array index ranges, can more naturally fit the problem domain. In fact you can even use user defined types for the ranges, e.g. so you can define an array with Sunday to Saturday as the indices and the values of (the equivalent of) an enum representing 1) weekdays and 2) weekends, as the corresponding values.

    Then your code involving days and weekdays and weekends, will read more naturally, so will be easier to both read and maintain. And you only have to do those custom definitions once, up front, so it is not much extra work for the benefit gained.

I have a really hard time understanding why people like 0 based indexes. They are a relic of C style arrays that are based on and interchangeable with pointers which use offsets that are naturally 0 based. Use in later languages gives us endless off-by-1 issues and rise to "for 0 to count/len/num - 1" or even better range syntax that is start inclusive BUT end exclusive. It is a horrible cludge just to support 1970s language perfomace optimization. Arrays should start and end at whatever start index is required, not at offset 0 of pointer to fist element of array.

  • Hang on. Off by one issues are the argument frequently given in favour of zero-based indices, not the other way around. For example, let's iterate through items placing them in 3 different groups;

    JS:

        for (let i = 0; i < items.length; i++) {
            groups[i % 3].push(items[i]);
        }
    

    Lua:

        for i = 1, #items do
            table.insert(groups[((i - 1) % 3) + 1], items[i])
        end
    

    Don't get me wrong. I like Lua, I've made my own IDE for it, https://plugins.jetbrains.com/plugin/14698-luanalysis, but this is definitely not an argument in favour of 1-based indices.

    • Off by one issues are also an argument given in favour of no indexing.

          groups=new Array(3).fill([])
          items.reduce(function(a,x,y){y=a.shift();y.push(x);a.push(y);return a},groups)
      

      Array languages typically have a reshaping operator so that you can just do something like:

          groups:3 0N#items
      

      Does that seem so strange? 0N is just null. numpy has ...reshape([3,-1]) which wouldn't be so bad in a hypothetical numjs or numlu; I think null is better, so surely this would be nice:

          groups = table.reshape(items,{3,nil})   -- numlu?
          groups = items.reshape([3,null])        // numjs?
      

      Such a function could hide an ugly iteration if it were performant to do so. No reason for the programmer to see it every day. Working at rank is better.

      On the other hand, Erlang is also 1-based, and there's no numerl I know of, so I might write:

          f(N,Items) -> f_(erlang:make_tuple(N,[]),Items,0,N).
          f_(Groups,[],_,_N) -> Groups;
          f_(G,Items,0,N) -> f_(G,Items,N,N);
          f_(G,[X|XS],I,N) -> f_(setelement(I,G,X),XS,I-1,N).
      

      I don't think that's too bad either, and it seems straightforward to translate to lua. Working backwards maybe makes the 1-based indexing a little more natural.

          n = 0
          for i = 1,#items do
            if n < 1 then n = #groups end
            table.insert(groups[n],items[i])
            n = n - 1
          end
      

      Does that seem right? I don't program in lua very much these days, but the ugly thing to me is the for-loop and how much typing it is (a complaint I also have about Erlang), not the one-based nature of the index I have in exactly one place in the program.

      The cool thing about one-based indexes is that 0 meaningfully represents the position before the first element or not-an-element. If you use zero-based indexes, you're forced to either use -1 which precludes its use for referring to the end of the list, or null which isn't great for complicated reasons. There are other mathematical reasons for preferring 1-based indexes, but I don't think they're as cool as that.

      1 reply →

    • Your second example subtracts and adds 1 nearly arbitrarily, which wouldn't be needed if the convention of the 0-index wasn't so widespread.

      6 replies →

  • Having done fairly extensive parsing work in Lua and Julia on the one hand (one-based), and Python, Javascript, and Zig on the other (zero-based), the zero-based semiopen standard makes intervals dramatically easier to calculate and work with. It's really the semiopen intervals which make this the case, but as the Word of Dijkstra makes clear, zero-basis comes along for the ride, to combine semiopen intervals with a one-basis is perverse.

    Naturally it's true that for collections and naïve indexing, 1-based is more natural. But those are rare places for bugs to occur, while interval calculations are a frequent place for them to occur.

    Clearly I'm far from allergic to the other standard, but I come down on the side of the zero basis for that reason.

  • I have a really hard time understanding why people like 1-based indexes! 0 is the smallest unsigned integer in every programming language I know of that supports the concept of unsigned integer. Why shouldn’t an array at the smallest possible index correspond to the beginning of the array?

    It’s also very natural to think of arr[i] as “i steps past the beginning of arr”. With one-based indexing arr[i] has no natural interpretation that I know of. It’s “i-1 (for some reason) steps past the beginning of arr”. The only reason I can think of to prefer that extra -1 in your formula is just because human languages (at least the ones I know of) work this way — the 42nd element of a sequence, in normal colloquial English, means the one 41 steps past the beginning. But I’m not sure if there is any logical justification for that.

    I also, despite being American, find the convention used in many countries of numbering building floors starting with zero to be more logical. I’m on the third floor, how many stories up did I travel to get here? Three.

    • > Why shouldn’t an array at the smallest possible index correspond to the beginning of the array?

      Because then there is no good way to refer to the index before that point: You are stuck using -1 (which means you can't use it to refer to the end of the array), or null (which isn't great either).

      > every programming language I know of that supports the concept of unsigned integer

      Surely you know Python which uses a signed integer as an index into their arrays: list[-1] is the last element of a list. If they only used one-based indexing then list[1] would be the first and that would be nicely symmetrical. It would also mean that list[i-1] would NEVER refer to a value after ‹i› eliminating a whole class of bugs.

      > It’s also very natural to think of arr[i] as “i steps past the beginning of arr.”

      I think it's more natural to think of arr[i] as “the ‹i›th element of arr” because it doesn't require explaining what a step is or what the beginning is.

      The exact value of ‹i› matters very little until you try to manipulate it: Starting array indexes at one and using signed indexes instead of unsigned means less manipulation overall.

      > find the convention used in many countries of numbering building floors starting with zero to be more logical

      In Europe, we typically mark the ground-floor as floor-zero, but there are often floors below it just as there are often floors above it, so the floors might be numbered "from" -2 for example in a building with two below-ground floors. None of this has anything to do with arrays, it's just using things like "LG" or "B" for "lower ground" or "basement" don't translate very well to the many different languages used in Europe.

      The software in the elevator absolutely doesn't "start" its array of sense-switches in the middle (at zero).

      6 replies →

    • In India too, the floor at the ground level is called the ground floor (probably that is where the name came from), the one above it is called the first floor, and so on. The convention is probably from British colonial times.

      Also LED floor numbers in lifts (elevators) in India start from 0 for the ground floor, as do the buttons that you press to go to specific floors.

      Also, Ground Zero.

      https://en.m.wikipedia.org/wiki/World_Trade_Center_site

      https://en.m.wikipedia.org/wiki/Hypocenter#

    • > I also, despite being American, find the convention used in many countries of numbering building floors starting with zero to be more logical. I’m on the third floor, how many stories up did I travel to get here? Three.

      Alternatively the ground floor is the first floor because it’s the first floor you arrived at when you entered the building.

      The same point of view applies to 1-based indexing.

      That said I prefer 0-based in programming and 1-based in buildings.

      2 replies →

    • > find the convention used in many countries of numbering building floors starting with zero to be more logical.

      Ukrainian here. Multi-floor buildings always have at least one stair section to first floor due to need of thermal basement isolation. (I guess this is not pertaining to Western Europe due to more clement winters.) And, yep, it is called "first" floor. Using zero number is rare but possible (in this case it is called "tsokolny" floor) if a real "basement floor" is present, but in this case still 1-based numbering is preferred.

    • I'd argue that 1-based indexing is the "natural interpretation". Mathematics is inherently 1-based, and it isn't surprising that languages designed to do mathematics like R, Matlab, Mathematica, Julia all do 1-based arrays because that makes modeling paper mathematics in programs easier.

      2 replies →

  • > [0-based indexes] are a relic of C style arrays

    I don't think this is true. They exist in other disciplines (maths for instance) that have no relationship with C or other programming languages from the 1970s.

    > for 0 to count/len/num - 1

    I will counter saying that such a for...to syntax is a relic of BASIC.

    > or even better range syntax that is start inclusive BUT end exclusive

    I know that your "better" is sarcastic, but I actually find left-inclusive+right-exclusive ranges fantastic. They allow perfect partitioning, easy calculation of lenght, etc.

    > Arrays should start and end at whatever start index is required

    I agree. An accommodating language would let you define both lower and upper bounds of an array, instead of its size.

  • There are 360 degrees in a circle, and the first entry is 0 degrees. The first time element of a day is 0:00:00(and enough 0s to satisfy whatever resolution you require). These were not established in the 1970s, and somehow pretty much everyone understands and works quite well with these systems.

    • > There are 360 degrees in a circle, and the first entry is 0 degrees.

      To be pedantic, "first" is associated with 1. And a circle does not have a "first" entry, whatever you mean by entry. I think what you're trying to say is that a circle is a continuous arc going from 0 to 360 degrees, but you should recognize that the "starting point" is arbitrary, any point will do, so there isn't really a "first", and that this is not the same as counting because counting is done with natural numbers, which are non-continuous. The problem of 0 VS 1 makes sense only in counting exactly because it's subjective whether you prefer to count from 0 or from 1. Because zero is the absence of anything, I find it hard to start counting from 0 (when you do, your "first" item is actually your zeroth item, and the next item would be the "first"??!), to be honest, despite being completely familiar with doing so since I've used 0-index programming languages my whole life.

      5 replies →

    • Also, how many years old are you when you're born? Zero. (at least in mainstream Western culture).

  • Some countries consider the 1st floor to be the ground floor, others consider the 1st floor to be the floor above the ground floor, which the formerly mentioned countries consider the 2nd floor… I think 0/1-based indexing is more subjective than simply being a “relic of C” or a “horrible kludge” :P

    • I've been in the US for over a decade and it still occasionally makes me double-take when a room numbered 1xx is on the ground floor

  • Here’s the ultimate authority on why computer languages should count from zero:

    <https://www.cs.utexas.edu/users/EWD/ewd08xx/EWD831.PDF>

    • I find that argument to be written in a terse "mathy" style that makes it a bit hard to follow. So let me try to restate it in more concrete "programmy" terms.

      To iterate over an array with "len" elements, it’s most elegant if “len” appears as a loop bound, rather than "len+1" or "len-1". Thus, in 0-based languages we use half-open ranges, whereas in 1-based languages we use closed ranges:

        // real C
        for (int i = 0; i < len; ++i)
            process(array[i]);
      
      
        // C-like language with 1-based indexing
        for (int i = 1; i <= len; ++i)
            process(array[i]);
      

      But the second is inelegant when len is zero, because 0 isn’t a valid index at all, so it’s weird for it to appear as a bound.

  • >They are a relic of C style arrays

    Doesn't it predate that by a good amount? I would think it is a relic of the EEs who built the digital world, those early languages show a great deal more relation to the bare metal than modern languages. Creating an array whose index starts at 1 just doesn't make sense from the discrete logic point of view, you are either wasting an element or adding in an extra step.

    But in this day and age how can a language not have ⎕IO ← 0?

  • I honestly think that most of the problem arises from the fact that we just culturally start counting at 1 when talking about everyday things. As it stands, we're all used to it that way, and then computers come along and show us that counting from 0 is often more useful. So we adjust, but only for computer programming purposes.

    If our species had established counting from 0 as the norm right away (element #n is the one that has n elements before it; you think of the number as the number of steps you have to move away from the starting point), then I suspect the reverse would not be true: I don't think anyone would find a situation in which counting from 1 is so much more convenient that it's worth going against the grain of established norm.

    So in summary, I think we only think of counting from 1 as natural because it's in our culture. And it's in our culture because ancient superstitious humans had an irrational problem with the number 0.

    • I absolutely agree with you. People want to start with 1 because English (and presumably a lot of other languages) happen to use the word "first" to refer to the first element of a sequence, and not for any logical reason independent of arbitrary human language.

  • It's funny that nearly half of all comments are below your comment. The topic seems to unsettle people much more than a new programming language. This is also another example of how a downvote button is primarily misused in practice.

    > Arrays should start and end at whatever start index is required

    That's what you were indeed able to do with Pascal and also Modula-2, but with Oberon, Wirth came to the conclusion, that other index ranges than 0..n-1 were not needed. In his 1988 paper "From Modula to Oberon" he considers it "inessential" and providing "hardly any additional expressive power", but causing "a hidden computational effort that is incommensurate with the supposed gain in convenience". I think, in the end, it is in the eye of the beholder.