← Back to context

Comment by nechuchelo

16 hours ago

This looks like a genuinely useful application of LLMs.

I wish more companies focused on how they can help humans instead of replacing us or squeezing us as hard as possible in the name of productivity.

I think we should reserve judgment until this lands in the hands of the people it helps.

My experience is limited to my elderly parents who have trouble seeing. With the text size Apple allows them to set it to, their phones are unreadable. Text runs off the screen in every app, 1st and 3rd party.

In their bill example, the user is told to confirm with the provider. Why not offer to call the number on the bill? Instead of telling them to use text detection, do it for them? Presumably Apple Intelligence would already have that capability. I’m afraid this will be a gimmick at best.

EDIT: Forgot to mention, the grip is good to see. Hopefully they don’t charge the apple tax on it.

  • Yeah, I used to use iOS with text one step above the default size, and text was often cut off.

    I have a problem with astigmatic halation that makes ‘dark mode’ difficult to read. Since iOS 26, multiple aspects of the system have been made dark only, contrary to the system setting. Writing text correctly should be the lowest of low-hanging fruit.

    I suspect this is more of a flashy ‘AI’ promotion rather than reflective of any real commitment.

    • I had to set macOS on high contrast to be able to differentiate ui elements at glance. But most electron-based apps do not get the hint or even provide a high contrast theme.

      1 reply →

  • It's prob why they chose a11y features. They have more pain, so they're willing to tolerate more growing pains. (And prob more motivated to provide feedback.)

This is what Apple does best.

They treat new industry advancements as technology, not products itself.

AI will be a feature to improve the customer experience, not the product itself.

I agree. There seems to be a lot of potential in this space (from my outsider view). I really hope that this issue from an earlier article (https://news.ycombinator.com/item?id=48178378) doesn't become common enough to make useful functionality like this a danger. Seems unlikely in the short term but as use cases grow, so might the bad actors.

Its with their servers right? Do they trust a iPhone with their life? Or they are trusting their data center?

  • Looks like some of the features might use on-device models. They mention subtitle generation works on-device.

Let's be honest, compare the amount of money a corporation can make helping visually impaired people to the amount of money they can make replacing software developers and financial analysts.

Don't get me wrong, Apple using these technologies to help humans who are in need of help is laudable. But let's not pretend we don't know why most corporations don't look into this kind of thing. I think if we're being honest, we all very much know why they leave this sort of thing to the always nebulous "others".

  • Tim Cook has been pretty clear where he stands:

    > “When we work on making our devices accessible by the blind,” he said, “I don’t consider the bloody ROI.” It was the same thing for environmental issues, worker safety, and other areas that don’t have an immediate profit. The company does “a lot of things for reasons besides profit motive. We want to leave the world better than we found it.”

    https://www.forbes.com/sites/stevedenning/2014/03/07/why-tim...

    • Again, it's absolutely great that Apple does these things!

      I was just answering the question of why other corporations don't.

      Money.

      There's relatively little money in helping the visually impaired. You have to do it because you want to do it. Not because you're going to get rich.

      3 replies →

  • >But let's not pretend we don't know why most corporations don't look into this kind of thing.

    I assume almost everyone looks into spending less money than more money for equivalent goods and services.

> help humans instead of replacing us or squeezing us as hard as possible in the name of productivity

Increasing their productivity is helping humans.

Aren't the LLM-based features of this announcement catch-up features? Describing the contents of the screen is something Gemini has been doing on Pixel phones for a while. It's a fairly obvious use case for a multimodal AI.

My one hope is that this eventually becomes widespread enough to stop alt text scolds.

"looks like" there are a lot of automated accessibility systems that fall woefully short in practical use

this sort of thing really needs input from someone that uses it before we can judge it