OpenAI Alleges DeepSeek Used Its Models for AI Training

SanderEvers · Jan 29, 2025

Open AI copying the entire internet for their own goals, are angry that the Chinese use Open AI's model to train theirs.

Yes. That's what I call a hypocrite.

vantelimus · Jan 29, 2025

WarmWinterHat said:
As they should, OpenAI is getting away with ripping off material from anywhere and everywhere they want, with or without the owners permission.

Since when do two wrongs make a right?

[AUT] Thomas · Jan 29, 2025

TylerL said:
OpenAI is accusing someone of taking their proprietary AI, and turning it into an…open AI.

They’re going to twist themselves into knots explaining why it’s ok for them to scrape everyone’s copyrighted data, but NOT ok for someone to scrape theirs.

Turning OpenAIs AI into open AI... brilliant! 🤣🤣🤣 Made me lough way too hard... the irony hits so hard...

TechnoMonk · Jan 29, 2025

Here is former Open AI scientist who was one among early founders/employees. Nvidia isn’t doomed but all those projections of needing hundreds of billions of infrastructure look way overhyped. We are in punch card era of AI, may be floppy. Long way to go in AI race.

https://twitter.com/x/status/1872362712958906460

nnoble · Jan 29, 2025

Sounds like sour grapes from losers and hypocrites who’ve plundered regardless of I.P. Besides, who tells ‘the’ truth any longer?

alphaswift · Jan 29, 2025

OpenAI: We'll scrape the Internet without paying to build our model.
Also OpenAI: Deepseek scraped our model. That's crossing a line!

WarmWinterHat · Jan 29, 2025

IIGS User said:
I downloaded the app and I was using it a bit last night. I went to fiddle with it this morning and it got brain freeze. Couldn’t even answer basic questions.

I smell a lot of hype. The short sellers probably made giant bags of money the past few days. They’ll throw those winnings back into the market and ride the rebound.

I think it’s a flash in the pan, much the same as AI in general. Until it’s smarter than the cat, it’s just a bunch of hype.

I didn't download it, but I checked it out on a coworkers phone. It gets right a question about IV medication administration that ChatGPT gets wrong.. 🤷🏻‍♂️

nnoble · Jan 29, 2025

johnsawyercjs said:
This is true

Tech Bro hypocrites who’ll no doubt be supported by liar in chief.

vantelimus · Jan 29, 2025

IIGS User said:
I downloaded the app and I was using it a bit last night. I went to fiddle with it this morning and it got brain freeze. Couldn’t even answer basic questions.

I smell a lot of hype. The short sellers probably made giant bags of money the past few days. They’ll throw those winnings back into the market and ride the rebound.

I think it’s a flash in the pan, much the same as AI in general. Until it’s smarter than the cat, it’s just a bunch of hype.

You can load several versions of it in Private LLM. The Qwen based model was clearly inferior to ChatGPT. The Llama 70B based model was pretty good on my tests. It gave some interesting results on par with o1, though o1 was still a hair better in phrasing. It gave some interestingly bad results too. Whether the results are good or bad, the text it generates while pondering its answer is entertaining.

Here's the worst result DeepSeek-R1-Distill-Llama-70B gave me.
Q: define innuendo
A: “Innuendo” appears to be a misspelling or typo of the Spanish word “yendo,” which is the present participle of the verb “ir” (to go). In context, it would be correctly used as “yendo,” as in “Vámonos yendo” (We’re going).

Rogifan · Jan 29, 2025

IIGS User said:
I downloaded the app and I was using it a bit last night. I went to fiddle with it this morning and it got brain freeze. Couldn’t even answer basic questions.

I smell a lot of hype. The short sellers probably made giant bags of money the past few days. They’ll throw those winnings back into the market and ride the rebound.

I think it’s a flash in the pan, much the same as AI in general. Until it’s smarter than the cat, it’s just a bunch of hype.

Were you asking it about Tiananmen Square by chance? It definitely doesn’t want to answer any questions about that.

steve09090 · Jan 29, 2025

vantelimus said:
Since when do two wrongs make a right?

Whilst we’re quoting cliches, "What’s good for the goose, is good for the gander"

MLVC · Jan 29, 2025

I've also seen cases in which DeepSeek pretended to be Claude, so I guess they didn't limit themselves to models from OpenAI. Anyway, anybody surprised by that? Chinese do what Chinese do.

nottorp · Jan 29, 2025

Well, the great american AI peddlers have trained on copyrighted data without permission. Now they complain when someone pirates from them?

nnoble · Jan 29, 2025

10anta said:
A cheap counterfeit Chinese copy of a Western product - I’m shocked

Tired cliches won’t wash any longer, especially on this one.

TechnoMonk · Jan 29, 2025

MLVC said:
I've also seen cases in which DeepSeek pretended to be Claude, so I guess they didn't limit themselves to models from OpenAI. Anyway, anybody surprised by that? Chinese do what Chinese do.

How can they copy Claude or open AI. Heck they didn’t even use labeled training data. It’s sour grapes from Open AI and Microsoft. What they are alleging is deep seek queried Open AI models for some training data. This opens a big legal problem for open AI, they can’t accuse some one of ripping the very thing others did.

WarmWinterHat · Jan 29, 2025

vantelimus said:
Since when do two wrongs make a right?

I don't see that DeekSeek did anything wrong.

Crowbot · Jan 29, 2025

vantelimus said:
You can load several versions of it in Private LLM. The Qwen based model was clearly inferior to ChatGPT. The Llama 70B based model was pretty good on my tests. It gave some interesting results on par with o1, though o1 was still a hair better in phrasing. It gave some interestingly bad results too. Whether the results are good or bad, the text it generates while pondering its answer is entertaining.

Here's the worst result DeepSeek-R1-Distill-Llama-70B gave me.
Q: define innuendo
A: “Innuendo” appears to be a misspelling or typo of the Spanish word “yendo,” which is the present participle of the verb “ir” (to go). In context, it would be correctly used as “yendo,” as in “Vámonos yendo” (We’re going).

I haven't used any AIs aside from what's built into the iPhone. But if it can't handle basic word definitions, how can I trust it for more complex things? It seems to me that any answer would need to be verified by other means.

revfife · Jan 29, 2025

TechnoMonk said:
Here is former Open AI scientist who was one among early founders/employees. Nvidia isn’t doomed but all those projections of needing hundreds of billions of infrastructure look way overhyped. We are in punch card era of AI, may be floppy. Long way to go in AI race.

https://twitter.com/x/status/1872362712958906460

This is my biggest take away, DeepSeek figured out how to do it cheaper and more efficient.

I expect other AI companies to revise their billions of dollars worth of infrastructure plans for the future.

TechnoMonk · Jan 29, 2025

Microsoft is one of the biggest losers. They were getting stake in AI companies with cloud GPU credits. Microsoft didn’t give money to Open AI, they gave the cloud credits. You don’t need to spend billions on hardware.

nnoble · Jan 29, 2025

rikscha said:
Since when has there been any groundbreaking innovation coming from China?

There’s non so blind as those who don’t want to see.

TechnoMonk · Jan 29, 2025

Crowbot said:
I haven't used any AIs aside from what's built into the iPhone. But if it can't handle basic word definitions, how can I trust it for more complex things? It seems to me that any answer would need to be verified by other means.

It has done much better at math and coding, very helpful to use it locally with visual studio code. Much better than meta Llama models which are open source and were trained with lot more compute and cost.

This is just the beginning, huge barrier to entry in costs for training is starting to crumble.

neuropsychguy · Jan 29, 2025

TechnoMonk said:
Actually what deep seek did was far from copy paste. They leveraged RL to cut through labeling and intensive training. In fact, one of they key founding scientist of Open AI called it beginning of a new direction.
Open AI was touting how others can’t train without huge compute advantage and resources. Deep seek showed, you need few million to train instead of billions.

We don't know if their cost claim is accurate. It could be, but has to be independently verified before believing the claim. We know the model works well because it is independently verified but the cost has not been.

Also, it's possible it costs less because they might have taken a shortcut and built from OpenAI's work. Meaning, if they started their model from scratch more like what OpenAI did, it likely would have cost much more. This gives them the benefit of OpenAI's expenses and work without having to recreate it. I'm personally okay with that, just as I'm okay with OpenAI using all sources they did to train their models. I'm just stating that the direct costs are only part of the story.

Edit: Some discussion and estimates here that suggest the stated cost is likely accurate but only part of the total cost.

https://www.reddit.com/r/OpenAI/comments/1ibw1za/comment/m9lnp6e

Treq · Jan 29, 2025

madmin said:
There's never been much honour between thieves

This is the correct take.

neuropsychguy · Jan 29, 2025

WarmWinterHat said:
I didn't download it, but I checked it out on a coworkers phone. It gets right a question about IV medication administration that ChatGPT gets wrong.. 🤷🏻‍♂️

Which version of GPT? 4o or o1?

God of Biscuits · Jan 29, 2025

10anta said:
A cheap counterfeit Chinese copy of a Western product - I’m shocked

Like Dell.

OpenAI Alleges DeepSeek Used Its Models for AI Training

macrumors 6502a

macrumors 6502a

macrumors 6502a

macrumors 68040

macrumors 6502

macrumors 6502

macrumors 603

macrumors 6502

macrumors 6502a

macrumors Penryn

macrumors 68030

macrumors 68000

macrumors 6502a

macrumors 6502

macrumors 68040

macrumors 603

macrumors 68000

macrumors regular

macrumors 68040

macrumors 6502

macrumors 68040

macrumors 68040

macrumors 65816

macrumors 68040

macrumors 6502

Our Staff