AI models trained on public information should be open sourced and publicly available.
Billionaire’s should not own this behind closed doors.
Silicon Valley is just a scam. They’ve made billions off selling your data, and selling targeted ads with your own data to be shoved back into your face. Social media has ruined society. Every “innovation” has been some way of stealing from you, whether it’s your data, your attention, or now, the entirety of documented humanity.
/end yelling at clouds
The USA wants a world where AI wants is given permission to consume all copyrighted content for free, but we are charged for access to scholarly papers.
Capitalists the moment the free market™️ no longer works for them: “I love state intervention!”
Real capitalism, over all time and everywhere, has almost never been free-market.
Good luck with that. DeepSeek has already been reverse engineered.
Isn’t DeepSeek open source? Is there a need to reverse engineer it?
“Open source” in ML is a really bad description for what it is. “Free binary with a bit of metadata” would be more accurate. The code used to create deepseek is not open source, nor is the training datasets. 99% of “open source” models are this way. The only interesting part of the open sourcing is the architecture used to run the models, as it lends a lot of insight into the training process, and allows for derivatives via post-training
It certainly is a lot more open source than OpenAI, that’s for sure.
Deepseek actually released a bunch of their infrastructure code, including the infamous tricks for making training and interference more efficient, a couple of weeks ago.
Yes, and no. Yes in that they’ve released the research papers, pretrained parameters and weights of the model itself. Which is more than I can say for “OpenAI.” But no in that it doesn’t include training data or other critical components. Luckily, they’ve shown how they did it which makes it easy for anyone else to reverse engineer the process. That’s what Altman is afraid of.
They released the major components of their training and interference infrastructure code a couple weeks ago.
Looks like they are panicking
“Daddy! China hurtin’ my feewings! Hewp daddy! China bein’ a big meanie!!!”