According to this Washington Post analysis, the forums on MacRumors are part of a Google-created dataset that is used to train AI products:
”…we analyzed Google’s C4 data set, a massive snapshot of the contents of 15 million websites that have been used to instruct some high-profile English-language AIs, called large language models, including Google’s T5 and Facebook’s LLaMA”
(forums.macrumors.com is listed under the sources for Technology, as the #4 site)
www.washingtonpost.com
So, I’d say anybody here who is highly concerned about privacy or does not want their future posts used to train AI’s should review how they use MacRumors’ forums.
”…we analyzed Google’s C4 data set, a massive snapshot of the contents of 15 million websites that have been used to instruct some high-profile English-language AIs, called large language models, including Google’s T5 and Facebook’s LLaMA”
(forums.macrumors.com is listed under the sources for Technology, as the #4 site)

Inside the secret list of websites that make AI like ChatGPT sound smart
An analysis of a chatbot data set by The Washington Post reveals the proprietary, personal, and often offensive websites that go into an AI’s training data.

So, I’d say anybody here who is highly concerned about privacy or does not want their future posts used to train AI’s should review how they use MacRumors’ forums.
Last edited: