Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.
This is nascent technology that while cool and useful, it is barely becoming commercially viable and has many iterations to go before it is a stable product ready for prime time. Apple is not “behind” and it is to their credit they haven’t wasted the hundreds of billions on this while they could leave it to others to do so. Once the dust settles Apple can more comfortable apply the technology to their products instead of forcing it in there before it’s ready as many have.
DeepSeek is tremendously good news for Apple, as it implies significantly more powerful AI tools will be able to run locally on device, particularly given Apple Silicon's unified memory.
 
I have the same suspicion. It literally told me it was ChatGPT yesterday.


View attachment 2477049
That is because it was trained on some synthetic data generated by ChatGPT. They didn't steal openAI proprietary tech, they just used chatgpt to generate synthetic data they used to train their model which is against OpenAI's terms of service. Same way, It is against the terms of service of most websites to not use their data to train AI models but openAI does it anyway. Their video models are trained on data they stole from youtube creators too. Without data, these models cannot be trained.
 
  • Like
Reactions: Chuckeee
LOL. Half of this thread is people whining about the fact that AI companies use their data that they put on the web without permission. The other half is happy that an AI companies data that they put on the web gets stolen too.

I have thousands of pages of data on the web and I'm sure they got used for AI training too. It's unfair but I don't know what the big deal is as these data comprise a miniscule and likely irrelevant and likely redundant part of the value of an AI company. I put the data out there for everyone to see and use. So why would I be mad that somebody is using it? If the only value of the data is to get people to visit my website than my website and service seem to not have much inherent value. It's not that AI companies take the data as is and sell it. They use the data to train a model. The data was out there for people to use it.

Whatever the reasoning is, the ship has sailed 30 years ago when everybody thought it was ok to steal music and jpeg and movies from the web. The stealing of data from the web is now not only done by individuals, but by small and large businesses and its accepted by governments and societies. Will be almost impossible to change this worldwide.
 
That is because it was trained on synthetic data generated by ChatGPT. They didn't steal openAI proprietary tech, they just used chatgpt to generate synthetic data they used to train their model which is against OpenAI's terms of service. Same way, It is against the terms of service of most websites to not use their data to train AI models but openAI does it anyway. Their video models are trained on data they stole from youtube creators too.
This most likely has to do with fact they trained on GPT2 model which was open sourced anyways. Most of the LLm models use GPT2 or modified GPT2 architecture. The data from open AI was used for distillation.
 
Not so much a correction (I think you're correct here) but a recommendation: Computerphile has a short video about DeepSeek and why it's remarkable. It delves some into distillation and contrasts the OpenAI model with DS.
Thank you for that video. It was a good jumping off point for my (layman’s) further research.
 
Why not ? Should we profit only from natural intelligence?
Profit and greed have become the raison d'être of most societies.
We should not be naive about this.
No, we should not be naive, but profit and greed are also not a working long term approach for a society. We should want to change that approach.
 
Since when has there been any groundbreaking innovation coming from China? It is always copy and paste, which is seen culturally as some sort of recognition of the great work of the teacher. This concept doesn’t work in a global economy, though, for obvious reasons.
So why doesn't open AI post this evidence instead of saying they have evidence?
 
They got the first mover advantage, they followed the Zuck mantra of "move fast and break things" and now they're upset when someone else does it to them.
This reminds me of how, in the '00s, Facebook hypocritically threatened legal action against foreign copycat social networking websites when Facebook itself was a copycat of the social networking website HarvardConnection, which was invented by Divya Narendra, Cameron Winklevoss, and Tyler Winklevoss.
 
  • Like
Reactions: Green Valkyrie
DeepSeek is tremendously good news for Apple, as it implies significantly more powerful AI tools will be able to run locally on device, particularly given Apple Silicon's unified memory.
Also for Meta, and other open source LLM models. Bad for Open AI, Microsoft, and new AI czar. A gut check for Nvidia hype.
 
  • Like
Reactions: surferfb
Regardless of how they got there, it’s a fantastic development
I think the statement isn't accurate.
If they used "distillation"; which is using the output of ChatGPT or Llama or any other AI to train DeepSeek, then the resources they claim for training and the cost is a complete lie.

The aggregated cot is whatever was used to train the model they got output from + whatever development they had to pay for.

You can't claim it only took x amount of time and resources to train a model when you used the other model to feed yours.

I'm not saying right or wrong to use the other model to train, but don't claim you had this ground breaking efficiency in resources when you didn't.
 
  • Like
Reactions: Luke MacWalker
While there may be some reasons to question AI trainers use of copyrighted internet content to train their models, that is not at all the same thing as basically stealing an AI model, adding some additional code, and calling it something brand new and more powerful. That would be like buying a stock Ford Mustang, adding a faster fuel injector and saying "we've created a much faster car, for only (the cost of the new fuel injection system)." If this turns out to be true, it would explain why OpenAI spent hundreds of millions building its model more or less from scratch, and DeepThink was able to create a "better" model for 6 million.
Exactly

It's not the stealing, it's claiming it's your own work and then claiming you were able to do it more efficiently than others.
 
So how long until the last two seasons of “Person of Interest” becomes real life?
Which one is Finch’s Machine and which one is Samaritan?
 
That's not what I got when I asked it to compare China's political position economically and socially to the left/centre/right political spectrums.

That's quite a different question. So I'm not surprised you got an answer.
 
After benefitting from the billions and billions of work hours that went into creating the data they trained their models on for the cost of downloading it, they have the audacity to actually take their own patently absurd ToS seriously. I hope a court correctly invalidates those ToS, because they are literally impossible to meet. The entire internet is overflowing with chatGPT output, which can't be watermarked and is practically impossible to filter out. Nobody can train a model any longer without stumbling over chatGPT-generated text eventually, so yes, literally every model has some sense of chatGPT-style responses as the standard AI assistant tone. It's actually pretty hard to shake it off at this point.

I think the statement isn't accurate.
If they used "distillation"; which is using the output of ChatGPT or Llama or any other AI to train DeepSeek, then the resources they claim for training and the cost is a complete lie.

The aggregated cot is whatever was used to train the model they got output from + whatever development they had to pay for.

You can't claim it only took x amount of time and resources to train a model when you used the other model to feed yours.

I'm not saying right or wrong to use the other model to train, but don't claim you had this ground breaking efficiency in resources when you didn't.
They used chatGPT (or whatever other model) to generate data for fine-tuning, which is basically standard practice. Everybody does it, yet nobody manages to be as efficient as Deepseek. Also, the fine-tuning really doesn't make a big difference in performance, the magic sauce is usually in the pre-training, that's where the model gets smart. There is no incorrect accounting here.
 
  • Like
Reactions: Chuckeee
Not surprised. Chinese companies have a long history of being facsimiles. Reproducing things well for cheap not inventing anything new.
 
  • Like
Reactions: kamyk35


OpenAI says it has uncovered evidence that Chinese AI startup DeepSeek used its proprietary models to train a competing open-source model, potentially violating the company's terms of service.

deepseek-ai-app.jpg

The discovery centers around a technique called "distillation," where developers use outputs from larger AI models to train smaller ones. The practice is common in AI development, but OpenAI claims DeepSeek crossed a line by using it to build a rival model.

"The issue is when you take it out of the platform and are doing it to create your own model for your own purposes," a source close to OpenAI told the Financial Times.

DeepSeek's R1 reasoning model has attracted widespread attention in the tech industry for achieving comparable results to leading US models at a remarkably low cost. The company claims it spent just $5.6 million on development, which is a fraction of what companies like OpenAI and Google typically invest. The app this week reached the number one position on Apple's App Store free charts in multiple countries, including the US.

Asked about OpenAI's allegations in an interview with Fox News, White House AI czar David Sacks didn't mince his words.

"There's substantial evidence that what DeepSeek did here is they distilled knowledge out of OpenAI models, and I don't think OpenAI is very happy about this," he said.

The controversy has already had market implications. Nvidia saw its shares drop 17% on Monday, wiping a one-day record $589 billion off its market value, as investors questioned whether expensive AI hardware investments might be unnecessary if companies can achieve similar results with fewer resources.

According to Bloomberg, OpenAI and Microsoft reportedly investigated and blocked accounts in August for suspected terms of service violations, and they now believe these accounts were associated with DeepSeek. Both companies have declined to provide specific details about their evidence.


Article Link: OpenAI Alleges DeepSeek Used Its Models for AI Training
When will people realise, modern age Chinese aren't innovators, they blatantly copy (poorly) and steal everything
 
  • Disagree
Reactions: nnoble and Chuckeee
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.