Comment by johnfn

5 months ago

Some comments:

- This is a really remarkable graph. I just didn't realize how thoroughly it was over for SO. It stuns me as much as when Encyclopædia Britannica stopped selling print versions a mere 9 years after the publication of Wikipedia, but at an even faster timescale.

- I disagree with most comments that the brusque moderation is the cause of SO's problems, though it certainly didn't help. SO has had poor moderation from the beginning. The fundamental value proposition of SO is getting an answer to a question; if you can the same answer faster, you don't need SO. I suspect that the gradual decline, beginning around 2016, is due to growth in a number of other sources of answers. Reddit is kind of a dark horse here, as I began seeing answers on Google to more modern technical questions link to a Reddit thread frequently along with SO from 2016 onwards. I also suspect Discord played a part, though this is harder to gauge; I certainly got a number of answers to questions for, e.g., Bun, by asking around in the Bun Discord, etc. The final nail in the coffin is of course LLMs, which can offer a SO-level answer to a decent percentage of questions instantly. (The fact that the LLM doesn't insult you is just the cherry on top.)

- I know I'm beating a dead horse here, but what happens now? Despite stratification I mentioned above, SO was by far the leading source of high quality answers to technical questions. What do LLMs train off of now? I wonder if, 10 years from now, LLMs will still be answering questions that were answered in the halcyon 2014-2020 days of SO better than anything that came after? Or will we find new, better ways to find answers to technical questions?

> I disagree with most comments that the brusque moderation is the cause of SO's problems, though it certainly didn't help. SO has had poor moderation from the beginning.

I was an early SO user and I don’t agree with this.

The moderation was always there, but from my perspective it wasn’t until the site really pushed into branching out and expanding Stack Exchange across many topics to become a Quora style competitor that the moderation started taking on a life of its own. Stack Overflow moderator drama felt constant in the later 2010s with endless weird drama spilling across Twitter, Reddit, and the moderator’s personal blogs. That’s about the same time period where it felt like the moderation team was more interested in finding reasons to exercise their moderation power than in maintaining an interesting website.

Since about 2020 every time I click a Stack Overflow link I estimate there’s a 50/50 chance that the question I clicked on would be marked as off topic or closed or something before anyone could answer it. Between the moderator drama and the constant bait-and-switch feeling of clicking on SO links that didn’t go anywhere the site just felt more exhausting than helpful.

  • There was definitely a bit of Pournelle's Iron Law of Bureaucracy [0] at work. I worked there during a lot of the drama you allude to, and... It sucked, for everyone. But also...

    For most of SO's history, the vast majority of visitors (and this questions, answers) came in via Google. Not "search engines"; Google. This was pretty much baked in right at the start, and it effectively served as the site's primary user interface for years. And it worked. It worked pretty well! Until it didn't.

    At some point, Google started surfacing fewer "tried and true" Q&A examples and more unanswered, poorly-answered or moderated examples. This broke the fundamental assumption that sat behind SO's moderation - that curating a smaller set of posts was preferable to encouraging more, and newer. Suddenly, Google wasn't a very good UI for SO anymore.

    ...and SO didn't really have a fallback. Heck, for a while during this period they actually stopped showing questions on their homepage unless you were already logged in; the core assumption remained that there was a never-ending spring of new people feeding the thing. LLMs weren't the start of the problem, they were the end - the final wake-up call.

    I don't know that a site like SO can exist without the old Google, the old Internet; it is a product of all that, in the same way that mass-market TV shows were a product of 20th-century broadcast technology, or trade paperbacks of a particular intersection of printing tech and reading habits.

    [0]: https://www.jerrypournelle.com/reports/jerryp/iron.html

    • Lots of moderation issues are also UI issues.

      I suspect it’s the same issue for whatever is the “meta” in a competitive video game.

      Optimization based on the available affordances ?

    • > the core assumption remained that there was a never-ending spring of new people feeding the thing.

      Hi Shog, hope you're doing well! Just thought this bit was insightful; I can fully believe this was the idea and the motivating factor for a lot of the decisions made seemingly in a vacuum (from the outside).

      How much do you think Area51 and the push for the SE network rather than sticking with the Big Three affected things? I always got the impression that they tried to scale into places that ultimately attracted too much noise and overestimated the willingness of (community) moderators to effectively work for free for them to take on the wave of less technical/principled users.

      1 reply →

    • Shog9, excellent comment and very apt. I have to point out that you were also part of the toxicity and bad tone. You very much were part of the problem. Moderation and staff were very much the downfall.

      7 replies →

  • I know the feeling of being happy not being the only one with that same problem (and that somebody bothered to actually ask on SO) and the crushing feeling that the question was closed as off topic (so no reason for me to ask) or marked as duplicate (referencing that is clearly not a duplicate and just showing that the mod took no effort to understand the question)

  • The moderation definitely got kind of nasty in the last 5 years or so. To the point where you would feel unwelcome for asking a question you had already researched, and felt was perfectly sound to ask. However, that didn't stop millions of people from asking questions every day, it just felt kinda shitty to those of us who spent more time answering, when we actually needed to ask one on a topic we were lacking in. (Speaking as someone who never moderated).

    My feeling was always that the super mods were people who had too much time on their hands... and the site would've been better without them (speaking in the past tense, now). But I don't think that's what killed it. LLMs scraping all its content and recycling it into bite-sized Gemini or GPT answers - that's what killed it.

    • >it just felt kinda shitty to those of us who spent more time answering, when we actually needed to ask one on a topic we were lacking in. (Speaking as someone who never moderated).

      Great observation. Just like friendship, open communities psychologically feel as though there should be some balance. Spending free time contributing to something (even if you don't directly expect anything in return with ulterior motives) to benefit others, then getting an anvil dropped on your head when you dare to ask for a morsel in return, was an awful feeling which occurred too often there. The site and moderation, especially since the late 2010s (and especially in 2020 and beyond), became malignantly predatory.

  • I asked a question for the first time mid last year. It was a question about "default" sizes in HTML layout calculations, with lots of research and links to relevant parts of the spec.

    It was immediately closed as off topic, and there were a bunch of extremely vitriolic comments offended that I'd ask such a question on SO. It was briefly reopened weeks (?) later and then I guess closed again and now is deleted, so you can't even view the question any more.

    I'd long heard of abusive moderation but... experiencing it first hand is something else. Anecdote of one, but I know I'm never going to ask there again.

    In case anyone's wondering, I ended up asking on the WhatWG or W3C or something github project (via an issue?). The TLDR was rather eye opening, that basically the spec only codifies points of contention for browsers and old behaviors are generally undocumented. With some pointers I figured out the default size behavior through code diving, and it was complex (as in, hard to use) and very unintuitive.

    • Very similar to my experience. I never managed to either ask or answer a question on there. Everything I did was "bad" for reasons that were never explained to me in a constructive way that made me feel empowered to get a better outcome.

      I used it as a reference when someone had a similar question to mine, but over time the bad taste in my mouth caused me to avoid it in google search results.

      I fell into using an early — and I would say, far superior — form of ChatGPT, which consisted of carefully and clearly laying out my question, point-by-point, in a blank text file, and then usually having an insight as to what my particular stumbling block actually was and thereby being able to move forward.

    • Questions are never really deleted , post a link so people with enough reputation may have a look and maybe resurrect it if the question is really good.

      35 replies →

    • >I'd long heard of abusive moderation but... experiencing it first hand is something else. Anecdote of one, but I know I'm never going to ask there again.

      And it was a real gut punch when this would happen (or getting suspended/banned) to long-time users, as well. They largely precipitated their own demise, so I say good riddance.

    • its not just you, I saw this happen to others' posts many times and it happened to me several times

      I gave up on Stack Overflow when my jobs started requiring me to use Terraform and suddenly every time I posted a well researched and well formed question about Terraform, it would immediately get flagged and closed with responses that "Terraform is not programming and thus questions about Terraform should not be posted on Stack Overflow", which was insane to me because Stack Overflow has a "terraform" tag and category. If you visit it, you will see tons of users trying to post valid questions only to have the mods shut them down angrily.

      2 replies →

  • Friend in my group was in the public beta back in '08. We all ended up signing up by the end of '09. I used it off-and-on over the years (have some questions and replies with hundreds of upvotes). Though SO had a rap for having what might seem like harsh replies or moderation, it was often imho just blunt/curt, to the point, and often objectively defensible. I also agree with your timeframe that, in the later 2010s, the site became infected with drama, and moderation suddenly started reaching its tendrils into non-technical areas, when it should not have. And on an ostensibly technical site, no less!

    I found myself contributing less and less (same with Wikipedia), because I merely wanted to continue honing my craft through learning and contributing technical data with others who shared this same passion... I did not want to have politics shoved in my face, or have every post of mine have to be filtered through an increasingly extreme ideology which had nothing to do with the technical nature of the site. When I had my SO suspended with no warning or recourse for writing "master" in a reply, I knew it was time to leave for good. Most of the admins on the site transformed from technical (yet sometimes brash!) geeks, into political flag-waving and ideology-pushing avatars (including pushing their sexual agendas front and center), and not of the FSF/FLOSS kind, either.

    These types of dramas have infected nearly everything online, especially since 2020. Even Linus has lost his mind with pushing politics into what should be purely technical areas https://news.ycombinator.com/item?id=41936049

    LLMs were a final blow for many reasons, though I think that a huge part of it is that LLMs won't chide you and suspend/ban you for wanting to stick to strictly technical matters. I don't have to pledge allegiance to a particular ideology and pass a purity test before asking technical questions to an LLM.

    • I have been highly active in online technical communities since the usenet days in the 1980s and I have never once found myself in a situation where my opinions on sex or gender were solicited, interrogated, or judged.

      This really sounds like people were letting their personal flags fly (in avatars or sigs or whatever) and you could not stand to see that because they were not like you. All you have to do is ignore it and look at the content.

      This reminds me of someone I worked with, who asked me "why does [Colleague 2] have to shove his gay lifestyle in everyone's face?" after that Colleague 2 put a framed holiday photo with his husband on his desk.

      The person who asked this had a photo with his wife on his desk. He was unable to understand (A) how that is "shoving" his sexual orientation in other people's faces to the exact same degree as Colleague 2's photo was; and (B) that the photo was for Colleague 2's own comfort and solace, and for positive engagement with anyone who wanted to engage in same, and that nobody else was required to dwell on it or give it a second glance.

  • Quite frankly you are wrong. Jeff and Joel spoke about their goals for very harsh moderation in their podcast while they were still building SO. The moderation from the very beginning was a direct result of the culture they created and it was completely intentional.

    • Quite frankly you have missed the point of my comment.

      The late 2010s moderator drama I was talking about was beyond the strict question curation. When StackOverflow expanded into StackExchange and started trying to be another Quora the moderation grew beyond curating technical questions. For years there was needless moderator drama and arguments over how the moderator team should run that were spilling over into social media everywhere.

> The fundamental value proposition of SO is getting an answer to a question

I read an interview once with one of the founders of SO. They said the main value stackoverflow provided wasn't to the person who asked the question. It was for the person who googled it later and found the answer. This is why all the moderation pushes toward deleting duplicates of questions, and having a single accepted answer. They were primarily trying to make google searches more effective for the broader internet. Not provide a service for the question-asker or answerer.

Sad now though, since LLMs have eaten this pie.

  • > This is why all the moderation pushes toward deleting duplicates of questions, and having a single accepted answer.

    My personal single biggest source of frustration with SO has been outdated answers that are locking out more modern and correct answers. There are so many things for which there is no permanently right answer over time. It feels like SO started solidifying and failed to do the moderation cleaning and maintenance needed to keep it current and thriving. The over-moderation you described helps people for a short time but then doesn’t help the person who googles much later. I’ve also constantly wished that bad answers would get hidden or cleaned out, and that accepted answers that weren’t very good would get more actively changed to better ones that showed up, it’s pretty common to see newer+better answers than the accepted one.

    • > outdated answers that are locking out more modern and correct answers. There are so many things for which there is no permanently right answer over time.... I’ve also constantly wished that bad answers would get hidden or cleaned out, and that accepted answers that weren’t very good would get more actively changed to better ones that showed up, it’s pretty common to see newer+better answers than the accepted one.

      Okay, but who's going to arbitrate that? It's not like anyone was going to delete answers with hundreds of upvotes because someone thought it was wrong or outdated. And there are literally about a million questions per moderator, and moderators are not expected to be subject matter experts on anything in particular. Re-asking the question doesn't actually help, either, except sometimes when the question is bad. (It takes serious community effort to make projects like https://stackoverflow.com/questions/45621722 work.)

      The Trending sort was added to try to ameliorate this, though.

      4 replies →

    • yes I noticed this as well, over the past few years, its happened again and again that the "Top Answer" ends up being useless and I found myself constantly sorting the answers by "Recent" to find the ones that are actually useful and relevant

    • Having gotten used to SO, I was shocked when I found I could mark multiple answers correct on AskMetafilter. It felt like an innovation.

    • > There are so many things for which there is no permanently right answer over time.

      Yeah it's doubly stupid because the likelihood of becoming outdated is one of the reasons they don't allow "recommendation" questions. So they know that it's an issue but just ignore it for programming questions.

  • > This is why all the moderation pushes toward deleting duplicates of questions, and having a single accepted answer.

    Having duplicates of the question is precisely why people use LLMs instead of StackOverflow. The majority of all users lack the vocabulary to properly articulate their problems using the jargon of mathematicians and programmers. Prior to LLMs, my use case for StackOverflow was something like this:

    30 minutes trying (and failing) to use the right search terms to articulate the problem (remember, there was no contextual understanding, so if you used a word with two meanings and one of those meanings was more popular, you’d have to omit it using the exclusion operator).

    30 minutes reading through the threads I found (half of which will have been closed or answered by users who ignored some condition presented by the OP).

    5 minutes on implementation.

    2 minutes pounding my head on my desk because it shouldn’t have been that hard.

    With an LLM, if the problem has been documented at any point in the last 20 years, I can probably solve it using my initial prompt even as a layman. When you’d actually find an answer on StackOverflow, it was often only because you finally found a different way of phrasing your search so that a relevant result came up. Half the time the OP would describe the exact problem you were having only for the thread to be closed by moderators as a duplicate of another question that lacked one of your conditions.

    • > Having duplicates of the question is precisely why people use LLMs instead of StackOverflow. The majority of all users lack the vocabulary to properly articulate their problems using the jargon of mathematicians and programmers.

      Yes; so the idea is they fail to find the existing question, and ask it again, and get marked as a duplicate; and then everyone else with the same problem can search, possibly find the new duplicate version, and get automatically redirected to the main version with high quality answers.

      37 replies →

    • > to properly articulate their problems using the jargon of mathematicians and programmers

      I once distilled a real-life problem into mathematical language exactly like how the Introduction to Algorithms book would pose them only to have the quesiton immediately closed with the explanation "don't post your CS homework".

      (My employer at the time was very sensitive about their IP and being able to access the Internet from the work computer was already a miracle. I once sat through a whole day of InfoSec and diciplinary meetings for posting completely dummy bug repoduction code on Github.

      1 reply →

  • I think that's a great policy. I don't think anyone wants duplicate questions. The problem is moderation marking unrelated questions as duplicates.

    I'd say 9/10 times I find a direct match for my question on SO it's been closed as offtopic with links to one or more questions that are only superficially similar.

    There are other problems that they don't even try to address. If 10 people ask the same question, why does only the first person to ask it get to choose the answer? Then lots of "XY" questions where the original asker didn't actually have problem X so selects an answer for Y, leaving the original X unsolved, and now all the duplicates only have an answer for Y too.

    • > The problem is moderation marking unrelated questions as duplicates.

      This problem isn't directly solvable (what counts as a "duplicate" is inherently subjective, and therefore mistakes/differences of opinion are inevitable).

      I think a deeper problem is that once a question becomes closed (for any reason), it's unlikely that it'll ever be reopened. The factors behind this are social (askers interpret close votes as signals that they should give up), cultural (there's not much training/feedback/guidelines about what "duplicate" means for those with voting privileges), and technical (there's no first-class feature for askers to contest closure, and it takes just as many votes to reopen a question as it does to close it (with the same voter reputation requirement)).

      1 reply →

    • > The problem is moderation marking unrelated questions as duplicates.

      The idea was, if there's an answer on the other question that solves your question, your question remains in existence as a signpost pointing to the other one without having to pollute and confuse by having a mixture of similar answers across both with different amounts of votes.

  • Sad? No. A good LLM is vastly better than SO ever was. An LLM won't close your question for being off-topic in the opinion of some people but not others. It won't flame you for failing to phrase your question optimally, or argue about exactly which site it should have been posted on. It won't "close as duplicate" because a vaguely-similar question was asked 10 years ago in a completely-different context (and never really got a great answer back then).

    Moreover, the LLM has access to all instances of similar problems, while a human can only read one SO page at a time.

    The question of what will replace SO in future models, though, is a valid one. People don't realize what a massive advantage Google has over everyone else in that regard. So many site owners go out of their way to try to block OpenAI's crawlers, while simultaneously trying to attract Google's.

    • What's sad about it is that SO was yet another place for humans to interact that is now dead.

      I was part of various forums 15 years ago where I could talk shop about many technical things, and they're all gone without any real substitute.

      > People don't realize what a massive advantage Google has over everyone else in that regard. Site owners go out of their way to try to block OpenAI's crawlers, while simultaneously trying to attract Google's.

      Not really. Website operators can only block live searches from LLM providers like requests made when someone asks a question on chatgpt.com, only because of the quirk that OpenAI makes the request from their server as a quick hack.

      We're quickly moving past that as LLMs just make the request from your device with your browser if it has to (to click "I am not a robot").

      As for scraping the internet for training data, those requests are basically impossible to block and don't have anything in common with live answer requests made to answer a prompt.

      2 replies →

  • Quite often, when my search returned a 'closed as duplicate' reply, I found the allegedly duplicate question did not accurately describe my problem, and the answers to it were often inferior, for my purposes, than those which had been given to my original question before the gate was closed.

    • I think many would agree that this policy was the single biggest moderation failure of the site. And it would Have been so easy to fix. But management believed fewer high quality answers were better. Management was wrong.

  • The disconnect here is that they built it this way, but still call it a question and answer site and give a lot of power over to the person who created the question. They get to mark an answer as the solution for themselves, even if the people coming from Google have another answer as the solution.

    If they were to recreate the site and frame it as a symptom and issue site, which is what the interview described, that would yield many different choices on how to navigate the site, and it would do a lot better. In particular, what happens when two different issues have the same symptom. Right now, that question is closed as a duplicate. Under a symptom and issue site, it's obvious that both should stay as distinct issues.

  • > They were primarily trying to make google searches more effective for the broader internet

    This is mostly how I engaged with SO for a long, long time. I think it’s a testament to SO’s curation of answers that I didn’t ask almost any questions for like 5+ years after starting programming

  • LLMs also search Google for answers. Hence the knowledge may be not lost even for those who only supervises machines that write code.

  • If this were true, then treating any question as an X-Y problem shouldn't be allowed at all. I.e. answers should at least address the question as posed before/instead of proposing an alternative approach.

    In reality the opposite is encouraged. For countless times, I've landed on questions with promising titles/search extracts, only to find irrelevant answers because people grabbed onto some detail in the question irrelevant to my case and provided X-Y answers.

    This often also causes subsequent useful questions to be marked as dups even though they no longer contain that irrelevant detail. The appeal process is so unfriendly that most would not bother.

    See also: https://news.ycombinator.com/item?id=36068243

  • I agree with that and I think it was the right decision. There was grousing about overmoderation but I think a lot of people got unreasonably annoyed when their question was closed. And the result was a pretty well-curated and really useful knowledge base.

  • > Sad now though, since LLMs have eaten this pie.

    By regenerating an answer on command and never caring about the redundancy, yeah.

    The DRY advocate within me weeps.

Thinking from first principles, a large part of the content on stack overflow comes from the practical experience and battle scars worn by developers sharing them with others and cross-curating approaches.

Privacy concerns notwithstanding, one could argue having LLMs with us every step of the way - coding agents, debugging, devops tools etc. It will be this shared interlocutor with vast swaths of experiential knowledge collected and redistributed at an even larger scale than SO and forum-style platforms allow for.

It does remove the human touch so it's quite a different dynamic and the amount of data to collect is staggering and challenging from a legal point of view, but I suspect a lot of the knowledge used to train LLMs in the next ten years will come from large-scale telemetry and millions of hours in RL self-play where LLMs learn to scale and debug code from fizzbuzz to facebook and twitter-like distributed system.

  • > Privacy concerns notwithstanding, one could argue having LLMs with us every step of the way - coding agents, debugging, devops tools etc.

    That might work until an LLM encounters a question it's programmed to regard as suspicious for whatever reason. I recently wanted to exercise an SMTP server I've been configuring, and wanted to do it by an expect script, which I don't do regularly. Instead of digging through the docs, I asked Google's Gemini (whatever's the current free version) to write a bare bones script for an SMTP conversation.

    It flatly refused.

    The explanation was along the lines "it could be used for spamming, so I can't do that, Dave." I understand the motivation, and can even sympathize a bit, but what are the options for someone who has a legitimate need for an answer? I know how to get one by other means; what's the end game when it's LLMs all the way down? I certainly don't wish to live in such a world.

  • I don't know how others use LLMs, but once I find the answer to something I'm stuck on I do not tell the LLM that it's fixed. This was a problem in forums as well but I think even fewer people are going to give that feedback to a chatbot

    • The problem that you worked out is only really useful if it can be recreated and validated, which in many cases it can be by using an LLM to build the same system and write tests that confirm the failure and the fix. Your response telling the model that its answer worked is more helpful for measuring your level of engagement, not so much for evaluating the solution.

    • You can also turn off the feature to allow ChatGPT to learn from your interactions. Not many people do but those that do would also starve OpenAI for information assume they respect that setting

  • Am I the only one that sees this as a hellscape?

    No longer interacting with your peers but an LLM instead? The knowledge centralized via telemetry and spying on every user’s every interaction and only available thru a enshitified subscription to a model that’s been trained on this stolen data?

    • Asking questions on SO was an exercise in frustration, not "interacting with peers". I've never once had a productive interaction there, everything I've ever asked was either closed for dumb reasons or not answered at all. The library of past answers was more useful, but fell off hard for more recent tech, I assume because people all were having the same frustrations as I was and just stopped going there to ask anything.

      I have plenty of real peers I interact with, I do not need that noise when I just need a quick answer to a technical question. LLMs are fantastic for this use case.

      8 replies →

    • Y'know how "users" of modern tech are the product? And how the developers were completely fine with creating such systems?

      Well, turns out developers are now the product too. Good job everyone.

    • Replying to my own comment surprised that everyone is latching on to just poor moderation on a single site and ignoring the wealth of other options for communication and problem solving like slack communities, Reddit, blog posts, running a site like SO but with a better/different moderation policy, the list goes on and on.

      I’ve seen this trend a number of times on HN that feels strawman-y. Taking the worst possible example of the status quo but also yada-yadaing or outright ignoring the massive risks of the tech du jour.

      The comment I’m replying to hand waves over “legal issues” and totally ignores the fact that this hypothetical (and idealized) version of AI fundamentally destroys core aspects of community problem solving and centralizes the existing knowledge into a black box subscription all for the benefit of a clunky UX and underlying product that has yet to be proven effective enough to justify all the negative externalities.

    • I actively hated interacting with the power users on SO, and I feel nothing about an LLM, so it's a definite improvement in QoL for me.

    • How is it much different than trading say a bar for livestream? For any org if you can remove the human meatware you should otherwise you are just making a bunch of busywork to exlude people from using your service.

      Just through the act of existing meatware prevents other humans from joining. The reasons may be shallow or well thought out. 95+% of answers on stack overflow are written by men so for most women stack overflow is already a hellscape.

      If companies did more work on bias (or at least not be so offensive to various identities) that benefit, of distributing knowledge/advice/RTFM, could be even greater.

      1 reply →

As long as software is properly documented, and documentation is published in LLM-friendly formats, LLMs may be able to answer most of the beyond basic questions even when docs don't explicitly cover a particular scenario.

Take an API for searching products, one for getting product details, and then an API for deleting a product.

The documentation does not need to cover the detailed scenario of "How to delete a product" where the first step is to search, the second step is to get the details (get the ID), and the third step is to delete.

The LLM is capable of answering the question "how to delete the product 'product name'".

To some degree, many of the questions on SO were beyond basic, but still possible for a human to answer if only they read documentation. LLMs just happen to be capable of reading A LOT of documentation a LOT faster, and then coming up with an answer A LOT faster.

  • If the LLM is also writing the documentation, because the developers surely don’t want to, I’m not sure how well this will work out.

    I have some co-workers who have tried to use Copilot for their documentation (because they never write any and I’m constantly asking them questions as a result), and the results were so bad they actually spent the time to write proper documentation. It failed successfully, I suppose.

    • Indeed, how documentation is written is key. But funny enough, I have been a strong advocate that documentation should always be written in Reference Docs style, and optionally with additional Scenario Docs.

      The former is to be consumed by engineers (and now LLMs), while the later is to be consumed by humans.

      Scenario Docs, or use case docs, are what millions of blog articles were made of in the early days, then we turned to Stack Overflow questions/answers, then companies started writing documentation in this format too. Lots of Quick Starts for X, Y, and Z scenarios using technology K. Some companies gave away completely on writing reference documentation, which would allow engineers to understand the fundamentals of technology K and then be able to apply to X, Y, and Z.

      But now with LLMs, we can certainly go back to writing Reference docs only, and let LLMs do the extra work on Scenario based docs. Can they hallucinate still? Sure. But they will likely get most beyond-basic-maybe-not-too-advanced scenarios right in the first shot.

      As for using LLMs to write docs: engineers should be reviewing that as much as they should be reviewing the code generated by AI.

  • "In this imaginary world where everything is perfect and made to be consumed by LLMs, LLMs are the best tool for the job".

    •     world where everything is perfect and made to be consumed by LLMs
      

      I believe the parent poster was clearly and specifically talking about software documentation that was strong and LLM consumption-friendly, not "everything"

    • Yeah, old news? It's how it is today with humans.

      You SHOULD be making things in a human/LLM-readable format nowadays anyway if you're in tech, it'll do you well with AIs resorting to citing what you write, and content aggregators - like search engines - giving it more preferential scores.

> I disagree with most comments that the brusque moderation is the cause of SO's problems

The moderation was precisely the reason I stopped using stackoverflow and started looking for answers and asking questions elsewhere. It was nearly impossible to ask anything without someone replying "Why would you even want to do that, do <something completely different that does not solve my problem> instead!". Or someone claiming it's a duplicate and you should use that ancient answer from another question that 1) barely fits and doesnt solve my problem and 2) is so outdated, it's no longer useful.

Whenever I had to ask something, I had to add a justification as to why I have to do it that way and why previous posts do not solve the issue, and that took more space than the question itself.

I certainly won't miss SO.

  • That's not moderation. That's a lack of moderation. Comments like this weren't posted by mods but by people looking for stuff to answer. A mod would definitely delete such a comment if they were notified about it. But not enough users flagged stuff and preferred to complain about being hurt instead. There's also a problem that not enough mods were present at all times to handle the volume of comments posted.

  • I will miss it but you are right about moderation. I don't know what the issue is on some platforms, reddit and SO come to mind. Moderators on many other platforms or forums seem to be alright and keep a clear head, even when they have to deal with a lot of vitriol and they get little thanks for their work.

    There are probably negative examples as well but some platforms seem to be especially vulnerable. If I had to run reddit or SO, I would limit moderation to one subreddit/subdomain. No idea if that would help, but the problem isn't exactly invisible.

If we're going to diagnose pre-AI Stack Overflow problems I see two obvious ones:

1. The attempt to cut back on the harshness of moderation meant letting through more low-quality questions.

2. More importantly, a lot of the content is just stale. Like you go to some question and the accepted answer with the most votes is for a ten-year-old version of the technology.

  • > Like you go to some question and the accepted answer with the most votes is for a ten-year-old version of the technology.

    This is still a problem with LLMs as a result. The bigger problem is that now the LLM doesn’t show you it was a 10 year old solution, you have to try it, watch it fail, then find out it’s old, and ask for a more up to date example, then watch it flounder around. I’ve experienced this more times than I can count.

    • Then you're doing it wrong?

      I'd need to see a few examples, but this is easily solved by giving the llm more context, any really. Give it the version number, give it a url to a doc. Better yet git clone the repo and tell it to reference the source.

      Apologies for using you as an example, but this is a common theme on people who slam LLMs. They ask it a specific/complex question with little context and then complain when the answer is wrong.

      3 replies →

    • Usually that's resolved by saying "I want you to use v2" or whatever it is, which you can't really do with a Stack Overflow answer as easily.

    • Have you tried using context7 or a similar MCP to have the agent automatically fetch up to date documentation?

> The fundamental value proposition of SO is getting an answer to a question

But the horrible moderation was in part a reason why many SO questions had no answers.

I am not saying poor moderation caused all of this, but it contributed negatively and many people were pissed at that and stopped using SO. It is not the only reason SO declined, but there are many reasons for SO failure after its peak days.

  • To the extent that moderation ever prevented questions from getting answers, that was by closing them.

    When a question gets closed before an answer comes in, the OP has nine days to fix it before it gets deleted automatically by the system.

    The value proposition is getting an answer to a question that is useful to a reasonably broad audience. That very often means a question that someone else asked, the answer to which is useful to you. It is not getting an "answer" to a "question" where an individual dumps some code trying to figure out what's wrong.

    • > When a question gets closed before an answer comes in, the OP has nine days to fix it before it gets deleted automatically by the system.

      One of the bigger problems with the site's moderation systems was that 1) this system was incredibly opaque and unintuitive to new users, 2) the reopen queue was almost useless, leading to a very small percentage of closed questions ever getting reopened, and 3) even if a question did get reopened, it would be buried thousands of posts down the front page and answerers would likely never see it.

      There were many plans and proposals to overhaul this system -- better "on hold" UI that would walk users through the process of revising their question, and a revamp of the review queues aimed at making them effective at pushing content towards reopening. These efforts got as far as the "triage" queue, which did little to help new users without the several other review queues that were planned to be downstream of it but scrapped as SE abruptly stopped working on improvements to the site.

      Management should have been aggressively chasing metrics like "percentage of closed questions that get reopened" and "number of new users whose first question is well-received and answered". But it wasn't a priority for them, and the outcome is unsurprising.

      1 reply →

    • And that was the core problem with Stack Overflow - they wanted to build a system of core Q&As to be a reference, but everyone treated it as a "fix my very specific problem now".

      99% of all the junk that got closed was just dumps of code and 'it doesn't work'. Not useful to anyone.

      2 replies →

  • There was, obviously, only one main reason: LLMs. Anything else makes no sense. Even if the moderation was "horrible" (which sounds to me like a horrible exaggeration), there was nothing which came close to being as good as SO. There was no replacement. People will use the best available platform, even if you insist in describing it as "horrible". It's was not horrible compared to the alternatives, web forums like Reddit and HN, which are poorly optimized for answering questions.

    • Look at the data - it had already been on the downslide for years before LLMs became a meaningful alternative. AI was the killing blow, but there was undoubtedly other factors.

      1 reply →

    • You overvalue the impact of LLMs in regards to SO. They did have an impact, but it's the moderation that ultimately bent and broke the camel's back. An LLM may give seemingly good answers, but it always lacks in nuance and, most importantly, in being vetted by another person. It's the quality assurance that matters, and anyone with even a bit of technical skill quickly brushes up against that illusion of knowledge an LLM gives and will either try to figure it out on their own or seek out other sources to solve it if it matters. Reddit, for all its many problems, was often still easier to ask on and easier to get answers on without needing an intellectual charade and without some genius not reading the post, closing it and linking to a similar sounding title despite the content being very different. Which is the crux of the issue; you can't ask questions on SO. Or rather, you can't ask questions. No, no, that's not enough. You'll have to engage with the community, answer many other questions first, ensure that your account has enough "clout" to overturn stupid closures of questions, and when you have wasted enough time doing that, then you can finally ask your own question. Or you can just go somewhere else that isn't an intellectual charade and circle jerking and figure it out without wasting tons of time chasing clout and hoping a moderator won't just close the question as duplicate. SO was never the best platform, exactly because of its horrendous moderation. It was good, yes. It had the quality assurance, to a degree, yes. But when just asking a question becomes such a monumental task, people will go elsewhere, to better platforms. Which includes other forums, and, LLMs. So no, what you're attributing to LLMs is merely a symptom of the deeper issue.

    • It was bad enough that many people resorted to asking their questions in Discord instead which is a massive boomerang back to trying to get help in IRC and just praying that someone is online and willing to help you on the spot. Having to possibly ask your question multiple times before you get some spotty help in a real time chat where it's next to impossible to find again seems unimaginably worse than using an online forum but the fact of it remains and tells us there was something driving people away from sites like SO.

That "Dead Internet" phrase keeps becoming more likely, and this graph shows that. Human-to-human interactions, LLMs using those interactions, less human-to-human interactions because of that, LLMs using... ?

> I disagree with most comments that the brusque moderation is the cause of SO's problems, though it certainly didn't help.

By the time my generation was ready to start using SO, the gatekeeping was so severe that we never began asking questions. Look at the graph. The number of questions was in decline before 2020. It was already doomed because it lost the plot and killed any valuable culture. LLMs were a welcome replacement for something that was not fun to use. LLMs are an unwelcome replacement for many other things that are a joy to engage with.

> I disagree with most comments that the brusque moderation is the cause of SO's problems, though it certainly didn't help. SO has had poor moderation from the beginning.

Overwhelmingly, people consider the moderation poor because they expect to be able to come to the site and ask things that are well outside of the site's mission. (It's also common to attribute community actions to "moderators" who in reality have historically done hardly any of it; the site simply didn't scale like that. There have been tens of millions of questions, versus a couple dozen moderators.)

The kinds of questions that people are getting quick, accurate answers for from an LLM are, overwhelmingly, the sort of thing that SO never wanted. Generally because they are specific to the person asking: either that person's issue won't be relevant to other people, or the work hasn't been done to make it recognizable by others.

And then of course you have the duplicates. You would not believe the logic some people put forward to insist that their questions are not duplicate; that they wouldn't be able, in other words, to get a suitable answer (note: the purpose is to answer a question, not solve a problem) from the existing Q&A. It is as though people think they are being insulted when they are immediately given a link to where they can get the necessary answer, by volunteers.

I agree that Reddit played a big role in this. But not just by answering questions; by forming a place where people who objected to the SO content model could congregate.

Insulting other users is and always has been against Stack Overflow Code of Conduct. The large majority of insults, in my experience, come from new users who are upset at being politely asked to follow procedures or told that they aren't actually allowed to use the site the way they're trying to. There have been many duplicate threads on the meta site about why community members (with enough reputation) are permitted to cast close votes on questions without commenting on what is wrong. The consensus: close reasons are usually fairly obvious; there is an established process for people to come to the meta site to ask for more detailed reasoning; and comments aren't anonymous, so it makes oneself a target.

  • It seems you deny each problem that everyone sees in SO. The fact is SO repulsed people, so there is a gap between your interpretation and reality.

    > It is as though people think they are being insulted when they are immediately given a link to where they can get the necessary answer, by volunteers.

    This, for example. Question can be marked as duplicate without an answer. In this case yes, it feels insulting because the other is asked in such a weird way, that no-one will find the old when they search for the new (for example after a library change) and marking it as duplicate of an unanswered answer if a guarantee that the next SEO user won’t see it.

    • > Question can be marked as duplicate without an answer.

      No, they literally cannot. The only valid targets for closure are existing questions that have an upvoted or accepted answer. The system will not permit the closure (or vote to close) otherwise.

      If you mean "without writing a direct answer to the new question first", that is the exact point of the system. Literally all you have to do is click the link and read the existing answers.

      > it feels insulting because the other is asked in such a weird way, that no-one will find the old when they search for the new

      Sure. But someone else knew about the old question, found it for you, and directly pointed you at it so that you could get an answer immediately. And did all of this for free.

      And, by doing this, now everyone else who thinks of your phrasing for the question, will be immediately able to find the old question, without even having to wait for someone to recognize the duplicate.

      7 replies →

  • > It is as though people think they are being insulted when they are immediately given a link to where they can get the necessary answer, by volunteers.

    Multiple times my questions closed as duplicates of question that was answering a different question.

    Even when I explicitly linked that QA in my question and described how it differs from mine.

I don't think "good moderation or not" really touches what was happening with SO.

I joined SO early and it had a "gamified" interface that I actually found fun. Putting in effort and such I able to slowly gain karma.

The problem was as the site scaled, the competition to answer a given question became more and more intense and that made it miserable. I left at that point but I think a lot people stayed with dynamic that was extremely unhealthy. (and the quality of accepted questions declined also).

With all this, the moderation criteria didn't have to directly change, it just had to fail to deal with the effects that were happening.

This doesn't mean that it's over for SO. It just means we'll probably trend towards more quality over quantity. Measuring SO's success by measuring number of questions asked is like measuring code quality by lines of code. Eventually SO would trend down simply by advancements of search technology helping users find existing answers rather than asking new ones. It just so happened that AI advanced made it even better (in terms of not having to need to ask redundant questions).

> I wonder if, 10 years from now, LLMs will still be answering questions that were answered in the halcyon 2014-2020 days of SO better than anything that came after?

I've wondered this too and I wonder if the existing corpus plus new GitHub/doc site scrapes will be enough to keep things current.

> I disagree with most comments that the brusque moderation is the cause of SO's problems

Just to add another personal data point: i started posting in on StackOverflow well before llms were a thing and moderation instantly turned ne off and i immediately stopped posting.

Moderators used to edit my posts and reword what i wrote, which is unacceptable. My posts were absolutely peaceful and not inflammatory.

Moderation was an incredible problem for stack overflow.

  • > Moderators used to edit my posts and reword what i wrote, which is unacceptable. My posts were absolutely peaceful and not inflammatory.

    99.9% probability the people who made those edits a) were not moderators; b) were acting completely in accordance with established policy (please read: "Why do clear, accurate, appropriately detailed posts still get edited?" https://meta.stackexchange.com/questions/403176)

    Why do you think you should be the one who gets to decide whether that's "acceptable"? The site existed before you came to it, and it has goals, purposes and cultural norms established beforehand. It's your responsibility, before using any site on the Internet that accepts user-generated content, to try to understand the site's and community's expectations for that content.

    On Stack Overflow, the expectations are:

    1. You license the content to the site and to the community, and everyone is allowed to edit it. (This is also explicitly laid out in the TOS.)

    2. You are contributing to a collaborative effort to build a useful resource for the programming community: a catalog of questions whose answers can be useful to many people, not just to yourself.

    3. Content is intended to be matter-of-fact and right to the point, and explicitly not conversational. You are emphatically not participating in a discussion forum.

"I suspect that the gradual decline, beginning around 2016, is due to growth in a number of other sources of answers."

I think at least one other reason is that a lot of the questions were already posted. There are only so many questions of interest, until a popular new technology comes along. And if you look at mathoverflow (which wouldnt have the constant shocks from new technologies) the trend is pretty stable...until right around 2022. And even since then, the dropoff isn't nearly so dramatic. https://data.stackexchange.com/mathoverflow/query/edit/19272...

>>what happens now?

I'll tell you what happens now: LLMs continue to regurgitate and iterate and hallucinate on the questions and answers they ingested from S.O. - 90% of which are incorrect. LLM output continues to poison itself as more and more websites spring up recycling outdated or incorrect answers, and no new answers are given since no one wants to waste the time to ask a human a question and wait for the response.

The overall intellectual capacity sinks to the point where everything collaboratively built falls apart.

The machines don't need AGI to take over, they just need to wait for us to disintegrate out of sheer laziness, sloth and self-righteous.... /okay.

there was always a needy component to Stack Overflow. "I have to pass an exam, what is the best way to write this algorithm?" and shit like that. A lazy component. But to be honest, it was the giving of information which forced you to think, and research, and answer correctly, which made systems like S.O. worthwhile, even if the questioners were lazy idiots sometimes. And now, the apocalypse. Babel. The total confusion of all language. No answer which can be trusted, no human in the loop, not even a smart AI, just a babbling set of LLMs repeating Stack Overflow answers from 10 years ago. That's the fucking future.

Things are gonna slide / in all directions / won't be nothin you can measure anymore. The blizzard of the world has crossed the threshold and it's overturned the order of the soul.[0]

[0] https://www.youtube.com/watch?v=8WlbQRoz3o4

  • Labs are spending billions on data set curation and RL from human experts to fill in the areas where they're currently weak. It's higher quality data than SO, the only issue is that it's not public.

    • Can you explain what you're saying in greater depth?

      Are you saying that the reason there is no human expertise on the internet anymore is that everyone with knowledge is now under contract to train AIs?

      2 replies →

The fundamental value proposition of SO is getting an answer to a question

For me, the value was writing answers on topics I was interested in…and internet points as feedback on their quality.

When SE abandoned their app, it broke my habit.

There's another significant forum: GitHub, the rise of which coincided with the start of SO's decline. I bet most niche questions went over to GH repos' issue/discussion forums, and SO was left with more general questions that bored contributors.

The moderation was a lot of the problem, but not the whole problem. Honestly these days a larger part is that old, low-quality answers are usually stuck as the top answer on old questions, despite the fact that the situation has changed massively in the past decade and there are newer answers further down that give the new answer. Or better yet: when the top answer is a decade old and says "that doesn't even make sense, why would you want to do that, it's impossible, but you can look at literally the entire Handbook to see what you can do" (with a link to the frontpage of the FreeBSD Handbook) and you have to scroll down nearly to the bottom to find the one answer that actually answers the question (how to add an on-link route on FreeBSD) (and that it's not actually impossible like the arrogant jerk on top claimed)...

Too bad stack overflow didn't high-quality-LLM itself early. I assume it had the computer-related brainpower.

with respect to the "moderation is the cause" thing... Although I also don't buy moderation as the cause, I wonder if any sort of friction from the "primary source of data" can cause acceleration.

for example, when I'm doing an interenet search for the definition of a word like buggywhip, some search results from the "primary source" show:

> buggy whip, n. meanings, etymology and more | Oxford English Dictionary

> Factsheet What does the noun buggy whip mean? There is one meaning in OED's entry for the noun buggy whip. See 'Meaning & use' for definition, usage, and quotation evidence.

which are non-answer to keep their traffic.

but the AI answer is... the answer.

If SO early on had had some clear AI answer + references, I think that would have kept people on their site.

  • The meta post describing the policy of banning AI-generated answers from the site (https://meta.stackoverflow.com/questions/421831) is the most popular of all time. Company interference with moderator attempts to enforce that policy lead to a moderator strike. The community is vehemently against the company's current repeated attempts to sneak AI into the system, which have repeatedly produced embarrassing results (see for example https://meta.stackoverflow.com/questions/425081 and https://meta.stackoverflow.com/questions/425162 ; https://meta.stackoverflow.com/questions/427807 ; https://meta.stackoverflow.com/questions/425766 etc.).

    What you propose is a complete non-starter.

    • Your first example is a public announcement of an llm assisted ask question form. A detailed request for feedback on an experiment isn't "sneaking" and the replies are a tire fire of stupidity. One of your top complaints about users in this thread is they ask the wrong sort of questions so AI review seems like it should be useful.

      The top voted answer asks why SO is even trying to improve anything when there's a moderator strike on. What is this, the 1930s? It's a voluntary role, if you don't like it just don't do it.

      The second top voted answer says "I was able to do a prompt injection and make it write me sql with an injection bug". So? It also complains that the llm might fix people's bad English, meaning they ask the wrong question, lol.

      It seems clear these people started from a belief that ai is always bad, and worked backwards to invent reasons why this specific feature is bad.

      It's crazy that you are defending this group all over this HN thread, telling people that toxicity isn't a problem. I've not seen such a bitchy passive aggressive thread in years. Those replies are embarrassing for the SO community, not AI.

If my questions become more open ended and challenging as I become more senior, and I am not allowed to ask them on SO, why would I stay on the website?

So you get junior questions only and no seniors to answers.

The decline started in 2016, not 2026, they killed themselves

The newer questions that LLMs can't answer will be answered in forums - either SO, reddit, or elsewhere. There will be a much higher percentage of relevant content with far fewer new pages regurgitating questions about solved problems. So the LLMs will be able to keep up.

> - I know I'm beating a dead horse here, but what happens now? Despite stratification I mentioned above, SO was by far the leading source of high quality answers to technical questions. What do LLMs train off of now? I wonder if, 10 years from now, LLMs will still be answering questions that were answered in the halcyon 2014-2020 days of SO better than anything that came after? Or will we find new, better ways to find answers to technical questions?

To me this shows just how limited LLMs are. Hopefully more people realize that LLMs aren't as useful as they seem, and in 10 years they're relegated to sending spam and generating marketting websites.

I think the interesting thing here for those of us who use open source frameworks is that we can ask the LLM to look at the source to find the answer (eg. Pytorch or Phoenix in my case). For closed source libraries I do not know.

> What do LLMs train off of now? I wonder if, 10 years from now, LLMs will still be answering questions that were answered in the halcyon 2014-2020 days of SO better than anything that came after? Or will we find new, better ways to find answers to technical questions?

That's a great question. I have no idea how things will play out now - do models become generalized enough to handle "out of distrubition" problems or not ? If they don't then I suppose a few years from now we'll get an uptick in Stackoverflow questions; the website will still exist it's not going anywhere.

Specialized research AI agents are coming at which point we'll have numerous LLMs running and verifying experiments and creating a higher quality text corpus than the 2014-2020 halcyon, which is then used for other LLMs to be trained on.

It will be the reverse I suspect. Eventually we will see that LLM quality is lower when it is training data from 2014-2020 and will chalk it up to human limitations and the data not being written with a laser-focused goal of training better AI.

Instead of having chat-interfaces target single developers, moving towards multiplayer interfaces may bring back some of what has been lost--looping in experts or third-party knowledge when a problem is too though to tackle via agentic means.

Now all our interactions are neatly kept in personalised ledgers, bounded and isolated from one another. Whether by design or by technical infeasability, the issue remains that knowledge becomes increasingly bounded too instead of collaborative.

> SO was by far the leading source of high quality answers to technical questions

We will arrive on most answers by talking to an LLM. Many of us have an idea about we want. We relied on SO for some details/quirks/gotchas.

Example of a common SO question: how to do x in a library or language or platform? Maybe post on the Github for that lib. Or forums.. there are quirky systems like Salesforce or Workday which have robust forums. Where the forums are still much more effective than LLMs.

> The fundamental value proposition of SO is getting an answer to a question; if you can the same answer faster, you don't need SO.

Plus they might find the answer on SO without asking a new question - You probably would expect the # of new questions to peak or plateau even if the site wasn't dying, due to the accumulation of already-answered questions.

We'll get to the point where we can mass moderate core knowledge eventually. We may need to hand out extra weight for verified experts and some kind of most-votes-win type logic (perhaps even comments?), but live training data updates will be a massive evolution for language models.

> will we find new, better ways to find answers to technical questions?

I honestly don't think they need to. As we've seen so far, for most jobs in this world, answers that sound correct are good enough.

Is chasing more accuracy a good use of resources if your audience can't tell the difference anyway?

Google also played a part. After a while, I noticed that for my programming related questions, almost no SO discussions showed up. When they did appear on the first page, they were usually abysmal and unusable for me.

When it started all kinds of very clever people were present and helped even with very deep and complex questions and problems. A few years later these people disappeared. The moderation was ok in the beginning, then they started wooing away a lot of talented people. And then the mods started acting like nazis, killing discussions, proper questions on a whim.

And then bots (?) or karma obsessed/farming people started to upvote batshit crazy, ridiculous answers, while the proper solution had like 5 upvotes and no green marker next to it.

It was already a cesspool before AI took over and they sold all their data. Initial purpose achieved.

> I disagree with most comments that the brusque moderation is the cause of SO's problems

Questions asked on SO that got downvoted by the heavy handed moderation would have been answered by LLMs without any of the flak whatsoever.

Those who had downvoted other's questions on SO for not being good enough, must be asking a lot of such not good enough questions to an LLM today.

Sure, the SO system worked, but it was user hostile and I'm glad we all don't have to deal with it anymore.

As an early user of SO [1], I feel reasonably qualified to discuss this issue. Note that I barely posted after 2011 or so so I can't really speak to the current state.

But what I can say is that even back in 2010 it was obvious to me that moderation was a problem, specifically a cultural problem. I'm really talking about the rise of the administrative/bureaucratic class that, if left unchecked, can become absolute poison.

I'm constantly reminded of the Leonard Nimoy voiced line from Civ4: "the bureaucracy is expanding to meet the needs of the expanding bureaucracy". That sums it up exactly. There is a certain type of person who doesn't become a creator of content but rather a moderator of content. These are people who end up as Reddit mods, for example.

Rules and standards are good up to a point but some people forget that those rules and standards serve a purpose and should never become a goal unto themselves. So if the moderators run wild, they'll start creating work for themselves and having debates about what's a repeated question, how questions and answers should be structured, etc.

This manifested as the war of "closed, non-constructive" on SO. Some really good questions were killed this way because the moderators decided on their own that a question had to have a provable answer to avoid flame wars. And this goes back to the rules and standards being a tool not a goal. My stance was (and is) that shouldn't we solve flame wars when they happen rather than going around and "solving" imaginary problems?

I lost that battle. You can argue taht questions like "should I use Javascript or Typescript?" don't belong on SO (as the moderators did). My position was that even though there's no definite answer, somebody can give you a list of strengths and weaknesses and things to consider.

Even something that does have a definite answer like "how do I efficiently code a factorial function?" has multiple but different defensible answers. Even in one language you can have multiple implementations that might, say, be compile-time or runtime.

Another commenter here talked about finding the nearest point on an ellipse and came up with a method they're proud of where there are other methods that would also do the job.

Anyway, I'd occasionally login and see a constant churn on my answers from moderators doing pointless busywork as this month they'd decided something needed to be capitalized or not capitalized.

A perfect example of this kind of thing is Bryan Henderson's war on "comprised of" on Wikipedia [2].

Anyway, I think the core issue of SO was that there was a lot of low-hanging fruit and I got a lot of accepted answers on questions that could never be asked today. You'll also read many anecdotes about people having a negative experience asking questions on SO in later years where their question was immediately closed as, say, a duplicate when the question wasn't a duplicate. The moderator just didn't understand the difference. That sort of thing.

But any mature site ultimately ends with an impossible barrier to entry as newcomers don't know all the cultural rules that have been put in place and they tend to have a negative experience as they get yelled at for not knowing that Rule 11.6.2.7 forbids the kind of question they asked.

[1]: https://stackoverflow.com/users/18393/cletus

[2]: https://www.npr.org/2015/03/12/392568604/dont-you-dare-use-c...

  • > This manifested as the war of "closed, non-constructive" on SO. Some really good questions were killed this way because the moderators decided on their own that a question had to have a provable answer to avoid flame wars.

    It's literally a Q&A site. Questions need actual answers, not just opinions or "this worked for me".

  • > This manifested as the war of "closed, non-constructive" on SO. Some really good questions were killed this way because the moderators decided on their own that a question had to have a provable answer to avoid flame wars.

    Please point at some of these "really good" questions, if you saved any links. (I have privileges to see deleted questions; deletion is normally soft unless there's a legal requirement or something.) I'll be happy to explain why they are not actually what the site wanted and not compatible with the site's goals.

    The idea that the question "should have provable answers" wasn't some invention of moderators or the community; it came directly from Atwood (https://stackoverflow.blog/2011/01/17/real-questions-have-an...).

    > I lost that battle. You can argue taht questions like "should I use Javascript or Typescript?" don't belong on SO (as the moderators did). My position was that even though there's no definite answer, somebody can give you a list of strengths and weaknesses and things to consider.

    Please read "Understanding the standard for "opinion-based" questions" (https://meta.stackoverflow.com/questions/434806) and "What types of questions should I avoid asking?" (https://stackoverflow.com/help/dont-ask).

    • I believe that this tension about what type of questions was baked into the very foundation of StackOverflow.

      https://www.joelonsoftware.com/2008/09/15/stack-overflow-lau...

      > What kind of questions are appropriate? Well, thanks to the tagging system, we can be rather broad with that. As long as questions are appropriately tagged, I think it’s okay to be off topic as long as what you’re asking about is of interest to people who make software. But it does have to be a question. Stack Overflow isn’t a good place for imponderables, or public service announcements, or vague complaints, or storytelling.

      vs

      https://blog.codinghorror.com/introducing-stackoverflow-com/

      > Stackoverflow is sort of like the anti-experts-exchange (minus the nausea-inducing sleaze and quasi-legal search engine gaming) meets wikipedia meets programming reddit. It is by programmers, for programmers, with the ultimate intent of collectively increasing the sum total of good programming knowledge in the world. No matter what programming language you use, or what operating system you call home. Better programming is our goal.

      (the emphasis on "good" is in the original)

      And this can be seen in the revision history of https://stackoverflow.com/posts/1003841/revisions (take note of revision 1 and the moderation actions 2011)

      ---

      Questions that are fun and slightly outside of the intended domain of the site are manageable ... if there is sufficient moderation to keep those types of questions from sucking up all available resources.

      That was the first failing of NotProgrammingRelated.StackExchange ... later Programming.StackExchange ... later SoftwareEngineering.StackExchange.

      The fun things, while they were fun took way more moderation resources than was available. People would ask a fun question, get a good bit of rep - but then not help in curating those questions. "What is your favorite book" would get countless answers... and then people would keep posting the same answers rather than reading all of them themselves and voting to cause the "good" content to bubble up to the top.

      That's why TeX can have https://tex.stackexchange.com/questions/tagged/fun and MathOverflow can have https://mathoverflow.net/questions/tagged/soft-question and https://mathoverflow.net/questions/tagged/big-list -- there is a very high ratio for the active in moderation to active users.

      Stack Overflow kind of had this at its start... but over time the "what is acceptable moderation" was curtailed more and more - especially in the face of more and more questions that should be closed.

      While fun questions are fun... the "I have 30 minutes free before my next meeting want to help someone and see a good question" is something that became increasingly difficult. The "Keep all the questions" ideal made that harder and so fewer and fewer of the - lets call them "atwoodians" remained. From where I sit, that change in corporate policy was completely solidified when Jeff left.

      As moderation and curation restricted (changing the close reasons to more and more specific things - "it's not on that list, so you can't close it") meant that the content that was not as well thought out but did match the rules became more and more prevalent and overwhelmed the ability for the "spolskyites" to close since so many of the atwoodians have left.

      What remained where shells of rules that were the "truce" in the tension between the atwoodians and spolskyites and a few people trying to fight the oncoming tide of poorly asked questions with insufficient and neglected tooling.

      As the tide of questions went out and corporate realized that there was necessary moderation that wasn't happening because of the higher standards from the earlier days they tried to make it easier. The golden hammer of duplication was a powerful one - though misused in many cases. The "this question closes now because its poorly asked and similar to that other canonical one that works through the issue" was far easier than "close as {something}" that requires another four people to take note of it before the question gets an answer from the Fastest Gun in the West. Later the number of people needed was changed from needing five people to three, but by then there was tide was in retreat.

      Corporate, seeing things there were fewer questions being asked measured this as engagement - and has tried things to increase engagement rather than good questions. However, those "let's increase engagement" efforts were also done with even more of a moderation burden upon the community without the tooling to fix the problems or help the diminishing number of people who were participating in moderating and curating the content of the site.

      3 replies →

  • Dunno why you are being downvoted - there is a certain type of person who contributes virtually nothing on Wikipedia except peripheral things like categories. BrownHairedGirl was the most toxic person in Wikipedia but she was lauded by her minions - and yet she did virtually no content creation whatsoever. Yet made millions of edits!

> What do LLMs train off of now?

Perhaps they’ll rely on what was used by people who answered SO questions. So: official docs and maybe source code. Maybe even from experience too, i.e. from human feedback and human written code during agentic coding sessions.

> The fact that the LLM doesn't insult you is just the cherry on top.

Arguably it does insult even more, just by existing alone.