Comment by fine_tune

2 months ago

I was going to argue "LLM's need code samples to-do well on languages and if we are honest C# is a language mostly held in private repo's" but Github's 2024 report[0] says its the 5th most used language (I'm to lazy to check if this report includes private repo's but I'll assume it doesn't).

So kinda neat to see this paper!

[0]https://github.blog/news-insights/octoverse/octoverse-2024/#...

The big labs are almost certainly using compiler/repl output for generated code as an oracle for RL. I doubt they have C# in the mix.

  • Why do you doubt that? It's a widely used language. And there is even an open source C# REPL.

    • Because RL time is expensive and I don't think the languages which are more popular than C# have such high performance that it's worth bumping their batches for C#.

      1 reply →

5th most used language based on private repos that the group making the report has the exclusive direct access to seeing

I don't see that contradicting your assumption