Apple's New AI Dataset Aims to Improve Photo Editing Models

MacRumors · Oct 29, 2025

Apple researchers have released Pico-Banana-400K, a comprehensive dataset of 400,000 curated images that's been specifically designed to improve how AI systems edit photos based on text prompts.

The massive dataset aims to address what Apple describes as a gap in current AI image editing training. While systems like GPT-4o can make impressive edits, the researchers say progress has been limited by inadequate training data built from real photographs. Apple's new dataset aims to improve the situation.

Pico-Banana-400K features images organized into 35 different edit types across eight categories, from basic adjustments like color changes to complex transformations such as converting people into Pixar-style characters or LEGO figures. Each image went through Apple's AI-powered quality control system, with Google's Gemini-2.5-Pro being used to evaluate the results based on instruction compliance and technical quality.

The dataset also includes three specialized subsets: 258,000 single-edit examples for basic training, 56,000 preference pairs comparing successful and failed edits, and 72,000 multi-turn sequences showing how images evolve through multiple consecutive edits.

Apple built the dataset using Google's Gemini-2.5-Flash-Image (aka Nano-Banana) editing model, which was released just a few months ago. However, Apple's research revealed its limitations. While global style changes succeeded 93% of the time, precise tasks like relocating objects or editing text seriously struggled, with success rates below 60%.

Despite the limitations, researchers say their aim with Pico-Banana-400K is to establish "a robust foundation for training and benchmarking the next generation of text-guided image editing models." The complete dataset is freely available for non-commercial research use on GitHub, so developers can use it to train more capable image editing AI.

Article Link: Apple's New AI Dataset Aims to Improve Photo Editing Models

turbineseaplane · Oct 29, 2025

Ok ... sure 🤷‍♂️

I'm very much in the "show me, don't tell me" camp with Apple and any of their AI / Siri efforts.

rp2011 · Oct 29, 2025

Anyone who has used any of the AI tools knows none of them are ready for Prime Time. None of them. That is why I always say it's just echo-chamber nonsense to say Apple or anyone is behind in this nacent quickly evolving open source media. That ALL of them have been RUSHED to market without adequate testing goes without saying.

There is immense room for improvement with just the low hanging fruit of better training data. And here we see Apple showing they certainly know that.

Boeingfan · Oct 29, 2025

“…with Google's Gemini-2.5-Pro being used to evaluate the results based on instruction compliance and technical quality.”

Am I reading this wrong, or does this state that Apple is using a google product to evaluate an apple product, making it clear that the google product is the inspiration and standard to which Apple are aspiring?

mjs916 · Oct 29, 2025

I realize this isn’t the point but… Can Apple’s own AI tools do what Gemini can? I’ve been unimpressed by Image Playground.

davwilliams · Oct 29, 2025

rp2011 said:
Anyone who has used any of the AI tools knows none of them are ready for Prime Time. None of them. That is why I always say it's just echo-chamber nonsense to say Apple or anyone is behind in this nacent quickly evolving open source media. That ALL of them have been RUSHED to market without adequate testing goes without saying.

There is immense room for improvement with just the low hanging fruit of better training data. And here we see Apple showing they certainly know that.

And as someone who uses AI tools everyday for work, you cannot be more incorrect.

JSRinUK · Oct 29, 2025

Ah, with Apple’s involvement I enthusiastically await the “Photo Tools” equivalent of this popping up whenever I want to use it…

JungleNYC · Oct 29, 2025

If they can spend billions on AI to fix the VIGNETTE tool in Photos, that would be awesome. Because that thing sucks.

Wx_Man · Oct 29, 2025

Cool. Can it then show the Apple Intelligence team how to fix Siri with a really cool AI generated pictographic as a guide ?

Apple-achian · Oct 29, 2025

Why not use photos on user devices for maximum accuracy?

gaximus · Oct 29, 2025

The most commonly used feature for editing a photo has to be removing an object, and Photos app does a decent job, sometimes, that's where the effort needs to go. Not adding objects. Not changing the season. Removing objects.

ikramerica · Oct 29, 2025

None of those examples in that picture looks very good.

Frantisekj · Oct 29, 2025

mjs916 said:
I realize this isn’t the point but… Can Apple’s own AI tools do what Gemini can? I’ve been unimpressed by Image Playground.

I wonder what Gemini can do offline

JMalone · Oct 29, 2025

Frantisekj said:
I wonder what Gemini can do offline

Gemini Flash Image 2.5 is probably around half a terabyte in size as a model and won’t run on any machine that doesn’t have 10 GPUs.

everlast3434 · Oct 29, 2025

Converting people into Legos. Seriously lol. Do people want this stuff?

JMalone · Oct 29, 2025

Apple-achian said:
Why not use photos on user devices for maximum accuracy?

Because models are created from thousands images that need to be curated and cropped first and then the deep learning/training process is too intensive for even a MacBook Pro let alone a phone. Even a Mac Studio M3 Ultra will struggle.

JMalone · Oct 29, 2025

davwilliams said:
And as someone who uses AI tools everyday for work, you cannot be more incorrect.

He is mostly correct. Your standards are just much lower than his if you believe generative models meet the high end requirements of advertising and VFX. You can use them for media production but even at best they do not achieve the high bar of actual photography, actual 3D modelling, actual high end VFX.

If they did achieve that high bar you’d be paying $2000 a month. There’s no way a corporation will give you that level high end models for $200 like Google and Sora charge for the janky models they have now.

neuropsychguy · Oct 29, 2025

mjs916 said:
I realize this isn’t the point but… Can Apple’s own AI tools do what Gemini can? I’ve been unimpressed by Image Playground.

Apple's released tools or the internal AI tools Apple is working with? No one here can answer your question.

icwhatudidthere · Oct 29, 2025

JMalone said:
He is mostly correct. Your standards are just much lower than his if you believe generative models meet the high end requirements of advertising and VFX. You can use them for media production but even at best they do not achieve the high bar of actual photography, actual 3D modelling, actual high end VFX.

If they did achieve that high bar you’d be paying $2000 a month. There’s no way a corporation will give you that level high end models for $200 like Google and Sora charge for the janky models they have now.

That really depends on the context. If OP was talking about media generation or editing, then yeah, there are still issues there. But OP said "any of the AI tools" and there are tons that are already in production use outside of media generation.

Apple-achian · Oct 29, 2025

JMalone said:
Because models are created from thousands images that need to be curated and cropped first and then the deep learning/training process is too intensive for even a MacBook Pro let alone a phone. Even a Mac Studio M3 Ultra will struggle.

No I mean why not submit user images (anonymized) to the training data on in-house machines?

Mac Fly (film) · Oct 29, 2025

rp2011 said:
Anyone who has used any of the AI tools knows none of them are ready for Prime Time. None of them. That is why I always say it's just echo-chamber nonsense to say Apple or anyone is behind in this nacent quickly evolving open source media. That ALL of them have been RUSHED to market without adequate testing goes without saying.

There is immense room for improvement with just the low hanging fruit of better training data. And here we see Apple showing they certainly know that.

100%

All of the LLMs are confidently wrong the whole time.

iPhoneFan5349 · Oct 29, 2025

rp2011 said:
Anyone who has used any of the AI tools knows none of them are ready for Prime Time. None of them. That is why I always say it's just echo-chamber nonsense to say Apple or anyone is behind in this nacent quickly evolving open source media. That ALL of them have been RUSHED to market without adequate testing goes without saying.

There is immense room for improvement with just the low hanging fruit of better training data. And here we see Apple showing they certainly know that.

Of course there’s immense room for improvement, they are not perfect at all. With that being said, AI can replace 50% of the world’s office workforce today without hesitation.

iPhoneFan5349 · Oct 29, 2025

It’s pretty clear AI will replace everyone sooner rather than later. I can’t believe there haven’t been massive protests yet.

mjs916 · Oct 29, 2025

neuropsychguy said:
Apple's released tools or the internal AI tools Apple is working with? No one here can answer your question.

Fair enough.

iPhoneFan5349 said:
Of course there’s immense room for improvement, they are not perfect at all. With that being said, AI can replace 50% of the world’s office workforce today without hesitation.

I agree. Customer service suffers overall, but it certainly can. It’s ”good enough” at lots of tasks.

WarmWinterHat · Oct 29, 2025

iPhoneFan5349 said:
It’s pretty clear AI will replace everyone sooner rather than later. I can’t believe there haven’t been massive protests yet.

It's pretty clear the LLMs aren't going to the AI that does that.

Apple's New AI Dataset Aims to Improve Photo Editing Models

macrumors bot

Contributor

macrumors 68030

macrumors 6502a

macrumors 6502a

macrumors newbie

macrumors 6502a

macrumors 6502

macrumors regular

macrumors regular

macrumors 68020

macrumors 68020

macrumors 6502a

macrumors regular

macrumors 6502

macrumors regular

macrumors regular

macrumors 68040

macrumors 6502

macrumors regular

macrumors 68040

macrumors 6502a

macrumors 6502a

macrumors 6502a

macrumors 603

Our Staff