So I finally had some time to look at the new ML frameworks and I think that Apple opening access to the foundational models is kind of a big deal that did not get enough attention. In a nutshell, what these APIs allow you to do is send prompts to the on-device LLM, with some nice features like boilerplate-less structured output and MCM-like functionality.
None of this is new of course — after all you could do this stuff for years with services like ChatGPT or locally installed LLMs (e.g. via llama.cpp). The difference is accessibility to developers. Previously, if I wanted to add some LLM-based features to my app, I'd either need an online connection with an API key (issues with security, privacy, extra costs, user experience), or I'd need to bundle a several-TB LLM with my app (almost never an option). But with this release I can just drop some LLM functionality in minutes and have it run entirely on device. Granted, these LLMs are not very good, but they are more than sufficient for many applications. I will certainly be playing around with them for automatic language annotation and similar research applications.
I think we are about to see a bunch of really interesting apps taking advantage of these features.
P.S. The way they implemented agent-based AI editing in Xcode is very cool as well.
None of this is new of course — after all you could do this stuff for years with services like ChatGPT or locally installed LLMs (e.g. via llama.cpp). The difference is accessibility to developers. Previously, if I wanted to add some LLM-based features to my app, I'd either need an online connection with an API key (issues with security, privacy, extra costs, user experience), or I'd need to bundle a several-TB LLM with my app (almost never an option). But with this release I can just drop some LLM functionality in minutes and have it run entirely on device. Granted, these LLMs are not very good, but they are more than sufficient for many applications. I will certainly be playing around with them for automatic language annotation and similar research applications.
I think we are about to see a bunch of really interesting apps taking advantage of these features.
P.S. The way they implemented agent-based AI editing in Xcode is very cool as well.