Apple releases new Xcode with autonomous coding support

RockRock8

Power User
Joined
Dec 6, 2025
Posts
196
Apple introduced a new version of Xcode with support of autonomous coding via transformer models.

Previous versions had support for basic requests via TM (through a chat interface). Now, they added more support to allow TM algorithms to produce code step-by-step in Xcode. This means it can access different components of Xcode and the OS to accomplish full or partial app development.

When using the new stuff, you get access to older models without signing in. If you sign in via one of these companies that produces models, you get access to newer models. If you do that and also edit the configuration files, you can get access to the newest models.

You can do it solely by chat interface or by uploading photos of what you want.

This is a development effort by both Apple and in coordination with those chatbot companies, led by Apple. What's very cool is the Apple-fied parts of it. For example, when you ask it to change something, it will produce code that does so, then it automatically takes pictures of the simulator and then evaluates if it successfully coded it. It's super cool, and very Apple-fied

But I created this post to criticize John Gruber, sorry. After awhile of writing nothing, the only thing he said was "it's super-duper interesting that Apple released this instead of at next wwdc," and from the context of his "work," I presume he is implying that Apple is somehow is scrambling or behind. But they're simply doing the Apple thing: announce when ready. Gruber released a hysterical rant online about Apple pre-announcing features, and then in response Apple took that feedback and course corrected. Now he's upset that they didn't pre-announce this feature at WWDC, instead waiting and developing it fully then dropping it for release 100%. Users have noticed how well it works. It's very "just works," aside from the usual "it just doesn't work" nature of TMs.

Finally, Apple vertically integrated this entire feature. Because of their work democratizing access to TMs, users can run trillion parameter models ON DEVICE and in full coordination with Xcode. This means you can use autonomous coding locally. Only Apple can deliver this: the "just works" support of models plus unique user-facing features like feedback loops of producing code and verification, the high performance of trillion parameter coding models locally, and unified memory and silicon allowing all of it to deliver all of this; all with a UI that is beautiful, dimensional, and futuristic -- Liquid Glass!

Codex and Opus are the 2 cloud models that they feature but you can connect Gemini or any cloud models, plus LOCAL models like K2.5.

Watch it in action from Apple as it navigates Apple documentation to properly support UI


From Rudrank, who is a popular member of the MLX community
 
My developer friends say this is quite good, and they seem worried about it.
 
My developer friends say this is quite good, and they seem worried about it.
I've not tried the Xcode integration specifically, but as it is just integrating these existing models into Xcode tooling I doubt it'll be much different to ClaudeCode, Cursor and whatnot.

My experience with those things so far is that they're very good at small simple things. Even small complex things. Potentially even large simple things. But large and complex things they fall apart completely with. And it rapidly devolves into "fix one thing break a million others".

People in general are horrible at predicting the future, but in my opinion, as a professional software engineer, these tools save about as much time on average as they waste. You can potentially get a 5-10% speedup of your work, but I doubt any jobs are realistically at risks in business that intend to continuing providing quality. It's also not like a non-developer will be able to sufficiently guide these things to success anyway.

Frankly I am sick and tired of the over-hyped nature of GenAI. If it were just sold honestly (and wasn't having catastrophic effects on the economy and concept of copyright, not to mention the environment) I think there'd be a lot less hate for it around. Instead of trying to talk about the idea of replacing all software developers in the world with product owners prompting their AI to "please make the app run faster" or whatever, we could actually have a discussion about when it is and isn't appropriate, and get rid of AI slop security-bug reports that are flooding the cURL developers to an extent where real security vulnerabilities may get lost in the noise.

I use the existing Predictive Code Completion model in Xcode. I do see a tiny tiny speedup from that I think. Although I also often have to remove all the erroneous stuff it hallucinated.
I also recently tried to use the more "sophisticated" chat integration to try and narrow down a memory leak. I tried for quite a while, and I should've spent that time more productively in Instruments. The model pointed me to 8 things that were absolutely not the cause, and 5 reasons the memory leak might be caused by (but all 5 reasons were complete bullshit). It also adds so much unnecessary fluff and the code is so horrendous from a performance perspective. I've seen ClaudeCode produce code with a hash map of values, and then iterating through all key value pairs going if (key == whatIAmLookingFor) { doThing(with: value) } - The point of a hash map is gone at that point and it's being treated like an array but without the contiguous memory and cache benefits of an array. It's absolutely maddening to see.

I too am scared. I am scared that my colleagues will use this stuff more when it's even more built in to Xcode, and that it will reduce the code quality, that I will get horrendous AI slop pull requests to deal with and that nobody will take ownership over their code.

Now all of that said; There's a certain set of needs that this sort of stuff will be perfectly equipped for.
Enthusiasts have long been empowered to use Automator on the Mac to automate their workflows without fully needing to learn a scripting language. If they wanted a middle path between it and programming there was Apple Script. Today there's the Shortcuts app. While these try and simplify "talking to the computer" they still have a barrier to entry and for an average person can seem intimidating or daunting to the point where automating common or repetitive tasks may just never happen. These sorts of tools are amazing for this kind of person. A power-user who isn't a professional programmer. Someone who may still have a need for, for example, a bash script that bulk re-encodes all their MOVs into MP4s, places them in a new folder and add a compressor to the audio track. Getting the AI to make such automations is absolutely perfect. It's a small self-contained tasks with a defined goal that isn't too complex but is something that can really empower power-users.

In general I think professional software developers are the least affected by genAI coding models. Neither positively or negatively.
Power-users who aren't software developers or at most dabble with it; They'll see huge wins here
 
not tried the Xcode integration specifically, but as it is just integrating these existing models into Xcode tooling I doubt it'll be much different to ClaudeCode, Cursor and whatnot.
Not to defend TMs, because I'm not, but this is an oversimplification that ignores reviews I've read of Xcode 26.3
The integration with access to the OS and tools, documentation, etc all in one place makes it function a lot easier for the user and for what it seems also easier for the TM. So with that in mind, I'd like to know what your thoughts are on it when you eventually try the new features in 26.3 (if you try it).

Also you guys seem to incorrectly think I'm an advocate of LLMs simply because I talk about what Apple is doing with them. You would be wrong on that. Please, encourage don't discourage discussion from other people simply because they speak positively and mention LLMs in the same comment. Just wanted to make my position on "AI" more clear if my comments weren't clear enough :)
 
Last edited:
Back
Top