Comment by Grimblewald

8 days ago

I wear a few hats, but as a chemist and I'm not happy with fable. As a statistician I'm not happy with fable. As a data scientist I am not happy with fable. As an academic and a researcher I am not happy with fable. It's useless. I'd be surprised if anyone can get any output from it that couldn't easily be replaced with a search from wikipedia. Given how verbose claude models have become, wiki articles are probably less verbose too, and the tok/s is unmatched for a wiki article pull.

I work on software that talks to mass spectrometers and it consistently refuses to refactor even an input file parser, presumably because it can infer it’s related to biology? Useless indeed.

  • I was reverse engineering a medical device, and had to do a lot of trickery to get Opus 4.5 - not even Fable/Mythos, Opus - not to trip up its fucking CBRN filter.

    What happened with Fable is basically what I feared when they announced those restrictions. They took the shitty Opus CBRN filter and made it even worse.

    I pity the fools trying to use Anthropic AIs for anything biotech.

    • Opus has been fine on proteomics and bioinformatics for me. I have never seen a Claude model refuse on such grounds before in the past.

      Claude is still the best IMO, but it feels like its most frustrating and grating aspects are not down to the model’s abilities, but the increasingly heavy hand of Anthropic expressing itself within the model. Fable’s comically useless responses almost seem like a cynical marketing tweak.

      “This model is so powerful we basically can’t let it do anything. How terrifying! We need more money to make it stronger. Now do you see why we should be the ones who write the regulations? We’re the Good Guy AI Company Who Will Never Ever Ever Be Unethical after all.”

      As this entity gains more ground, their models become increasingly annoying to use and their little act becomes more transparent. The whole “I’m-just a befuddled ethically-minded AI researcher who is perturbed by the power that I unwittingly discovered and I must warn the world” thing? Yeah fuck off. Your twee pandering to naïve nerds and cynical technocrats is nauseating and ordinary people can smell it a mile away. Completely repellent leadership who put up red flags to anyone left with a working ability to read between the lines of both spoken language and body language. The tech company equivalent of a sex predator who plays as the nice guy. Gross.

      Nobody likes these companies and their models are annoying, but we’re going to put up with playing middle manager to these obnoxious programs because our jobs depend on it now, and these products are still the best on the market.

      A breakthrough in tools that facilitate user-owned models and infrastructure is desperately needed for the sake of our dignity and sanity, if nothing else.

      1 reply →

    • The filters are really bad.

      Yesterday Fable rejected commenting on poetry because it had anatomy lines like:

      got anotha round of acetylcholine from da boss.

"the tok/s is unmatched for a wiki article pull." This is absolutely wonderful, thank you for making my day!

> Given how verbose claude models have become, wiki articles are probably less verbose too

Telling models to respond in the style of Wikipedia is one of the best ways to make their output bearable in my experience (for chat models, not agents)

I’ve been working on a rather complex mapping project and have been getting MUCH better results with Fable than Opus.

  • So as not to be vague, and since I just pushed a version I'm starting to be vaguely happy with...

    https://tylereaves.github.io/uk-rail-map/

    This is the result of probably a few hundred round trips. The really interesting part of the problem is keeping it both relatively true to real geometry, while greatly exaggerating it horizontally so you can actually see the individual running lines/sidings, like a signaling schematic.

    • I love computational mapping projects, because there is this hard problem of which towns to show on the map.

      Your Scotland map shows towns without rail (although some had rail previously, like Callander, Aberfeldy), it prefers insignificant (population-wise) places while ignoring the larger cities next to it (Scone instead of Perth, Bannockburn instead of Stirling, Inverness is missing, Dundee is missing, Aberdeen is missing). All these places are drawn on the map, but not labelled.

      All this clearly shows to me how bad it is. Yes it makes it look pretty, but given your task, I would have expected to give you meaningful map labelling.

      Something basic like this would get you a long way:

          0. cluster population centers into commonly known cities (i.e. show London instead of Islington or Walhamstrow)
          1. display names of the top 10 population centers in the UK
          2. display towns with stations (if crowded prioritize termination points and junctions, and prioritize larger places over smaller places)
      

      Having said that, its pretty cool to see the new and old network when zoomed in (assuming that it is half-way correct)

      1 reply →

To make the discussion constructive, can you give specific reasons (ideally with examples) about why it is so useless for you? How exactly are you using it that you think any output from it can easily be replaced with a Wikipedia search?

  • The cybersecurity and bioweapons filters reach so far that they set in as soon as the model even glazes anything STEM-related. It might give a good impression of ones ex or write a decent fanfiction but anything that could bring humanity forward is strictly off-limits.

    • The filter is not simply a bioweapons filter: the model card seems to say that the filter triggers on anything related to biology or chemistry.

  • Am I being paid to do anthropic's work for it? See my comment history for some examples in another thread, but generally I see no reason to catalogue this for a model Ive seen no evidence of being worth the effort. I'm overworked as it is, doing this for no reason isnt something I can justify.

    The successes I have had with the model were strictly worse than output from deepseek v4 pro on the exact same task.

>I'd be surprised if anyone can get any output from it that couldn't easily be replaced with a search from wikipedia.

I dont understand. This is just hyperbole right? The outputs are basically infinite and wikipedia most certainly isnt infinite.

  • The decimals of 1/3 are infinite as well and they don't contain a better-than-wikipedia article.

    And even if they did, it would be useless if it's buried in useless data and your chances or pulling it are effectively zero.

    This is regardless of the general discussion, just pointing that your argument isn't solid.

    • Sorry but that’s not the claim. The claim is wikipedia can return the same information. Please find me a migration script given my current db schema and new target schema.

      The claim is absurd.