- cross-posted to:
- technology@beehaw.org
- usa@lemmy.ml
- opensource@lemmy.ml
- cross-posted to:
- technology@beehaw.org
- usa@lemmy.ml
- opensource@lemmy.ml
That’s awesome! I didn’t know you could download an LLM and run it locally! That’s what I’m really interested in is something that’s on my side and not a conduit to Google, MS or other.
I’m so glad Hawley proposed this bill or I wouldn’t have known that deepseek was open source and downloadable! I’ll have to go look for a download.
Ollama makes it pretty easy, and there are other runners as well. Good luck!
The USA is in panic mode, they thought they could undermine China’s development just like they did with ball-sucking Europe, now there is a need for other nation to come up with their own models, and show the US they should stop underpinning monopolies.
Wasn’t that interested before but it does give it a certain allure now to be sure.
God, I hate Hawley. He’s an embarrassment to my state.
He doesn’t even live in Missouri.
Saw this coming as soon as Microsoft immediately wanted to investigate.
Deepseek is going to be a Chinese military complex on the books shortly.
These people will keep pushing and pushing. They know this is ridiculous, but if they flood the public with this bullshit, eventually the overton window shifts and people get brainwashed into thinking cult shit like this is good.
It’s only a proposed bill (thankfully), but definitely one to keep an eye on.
Eh, it’s just virtue signaling nonsense. It’s not going anywhere.
That’s what everyone thought about Trump becoming president… Both times.
At this point someone could say Hitler is coming back from the dead riding a dinosaur, and my reaction would be “Yeah, sure. That may as well happen. Nothing has made sense the last 10 years. That’s just as plausible as anything else we’ve seen.”
Nothing has made sense the last 10 years.
Prolly part of the psyop tbh… We walked blind into this but puppet master knew what’s up and engineered it.
that’s what they said about vaguely gestures at everything
Wow, bold choice to ban the import of technology and knowledge. Usually governments are worried about export, so it doesn’t fall into the wrong hands.
Btw, how is the Nvidia stock price doing?
Right? Like, seriously, we all know somebody is just butthurt because their stock options tanked.
Oh, wait, I’m sorry! That was very unpatriotic of me, wasn’t it? I mean, we all know that winning an election guarantees being heavily rewarded with insider trading, right? It’s not like they’re there to represent constituents or anything; I mean, doesn’t everyone know we’re a republic, not a democracy?!
Sigh…
To be fair, this is common practice. Countries do this all the time to protect their economies. Mostly known in the West is China which banned many US services.
Of course, security of the data of the citizens is also a factor. You don’t want foreign countries use this data to interfere in any way.
Honestly, I don’t think this is common practice in non-oppressive countries. I mean sure, this happens in North Korea, Iran, China… But I’m relatively free to consume what I want with a few minor exceptions. For example we don’t import food that isn’t food-safe by our standards. Regardless if it’s common practice to eat it in other places. Also food may not be able to enter the country due to laws on animal cruelty. Similar things apply to electronic devices that aren’t up to code. And some select few things are banned altogether and you can’t have them and neither can someone import them. Other than that, regulations aren’t super strict. I can use all American social media platforms despite them stealing my personal data and violating European privacy laws regularly, can use Russian or Chinese websites… I think I live in a free country.
Helping domestic economy is done with tariffs / import tax. And not by banning things and putting people in jail.
And mind that this isn’t about the service that collects your data and gives it to the Chinese government. This is about downloading the model file and using it all by yourself. So no data gets transferred to a foreign country. And it’s not because people could get harmed or anything. This is just because the vice president doesn’t want it personally. Like in some dictatorship. Otherwise they would have banned transferring data into foreign countries, if that’s what it’s about. But they didn’t do that, because it’s not about protecting the people.
Or did I miss something and there are other examples for limitations on import?
No, I think you did not miss anything 😇
Good summary
For Base Model
git lfs install git clone https://huggingface.co/deepseek-ai/DeepSeek-V3-Base
For Chat Model
git lfs install git clone https://huggingface.co/deepseek-ai/DeepSeek-V3
this is deepseek-v3. deepseek-r1 is the model that got all the media hype: https://huggingface.co/deepseek-ai/DeepSeek-R1
Yea, comment OP needs to edit links with howany up votes that got.
Can you elaborate on the differences?
Base models are general purpose language models, mainly useful for AI researchers and people who want to build on top of them.
Instruct or chat models are chatbots. They are made by fine-tuning base models.
The V3 models linked by OP are Deepseek’s non-reasoning models, similar to Claude or ChatGPT4o. These are the “normal” chatbots that reply with whatever comes to their mind. Deepseek also has a reasoning model, R1. Such models take time to “think” before supplying their final answer; they tend to give better performance for stuff like math problems, at the cost of being slower to get the answer.
It should be mentioned that you probably won’t be able to run these models yourself unless you have a data center style rig with 4-5 GPUs. The Deepseek V3 and R1 models are chonky beasts. There are smaller “distilled” forms of R1 that are possible to run locally, though.
I heard people saying they could run the r1 32B model on moderate gaming hardware albeit slowly
32b is still distilled. The full one is 671b.
I know, but the fall off in performance isn’t supposed to be severe
You are correct. And yes that is kinda the whole point of the distilled models.
https://www.deepseekv3.com/en/download
I was assuming one was pre-trained and one wasn’t but don’t think that’s correct and don’t care enough to investigate further.
Is that website legit? I’ve only ever seen https://www.deepseek.com/
And I would personally recommend downloading from HuggingFace or Ollama
r1 is lightweight and optimized for local environments on a home PC. It’s supposed to be pretty good at programming and logic and kinda awkward at conversation.
v3 is powerful and meant to run on cloud servers. It’s supposed to make for some pretty convincing conversations.
R1 isn’t really runnable with a home rig. You might be able to run a distilled version of the model though!
You’re absolutely right, I wasn’t trying to get that in-depth, which is why I said “lightweight and optimized,” instead of “when using a distilled version” because that raises more questions than it answers. But I probably overgeneralized by making it a blanket statement like that.
Tell that to my home rig currently running the 671b model…
That likely is one of the distilled versions I’m talking about. R1 is 720 GB, and wouldn’t even fit into memory on a normal computer. Heck, even the 1.58-bit quant is 131GB, which is outside the range of a normal desktop PC.
But I’m sure you know what version you’re running better than I do, so I’m not going to bother guessing.
It’s not. I can run the 2.51bit quant
You must have a lot of memory, sounds like a lot of fun!
So I guess it’s free speech as long as you agree with the goverment’s speech. If not, then it’s a crime.
free speech is when racial slurs obviously
Elon Musk was just posting a factory of prisoners all working for cents on the dollar saying that America needs more of that.
Always have been, and this is a bipartisan value, heck, it’s common to all political parties of the world.
Yeah that’s called being a sovereign… They will respect each other doing since it is a club in a oligarchy or “democracy” but little people need watch that mother fucking mouth, or daddy gonna issue some backhand
The desperation… It reeks
Fascist regime and power/police abuse has started
Every step unchallenged is an invitation to do more.
to be fair for black people that is a centuries old tune
Most minorities — it’s the middle - upper class straight able bodied white people who are oblivious to it all.
Oh, you’re right
Don’t worry, their already bad situation will get worse too.
Because China “leveraged” US technology 20 years ago, US politicians bring it up as a current shittalking point still. Deepseek, being open source, is opportunity for Americans to gain technology transfer from China, without “stealing”.
There is a desperation to protect US AI, mostly so that AI companies are indebted into serving the empire, and maybe the GOP,
Something AI is extremely capable today is deciding who to ban on reddit, or at the individual voter level, decide who should be turned away from elections. Recent US election had record voter suppression and forced provisional ballots that were never counted. Previously, black was a sufficient suppression incentive. AI makes it easy to target individuals or other factors. Musk, being praised for understanding “election machines”, now with access to SS numbers and an ability to link to voters or views on Israel/genocide, is a super power that ensures being king maker in perpetuity.
Was this Google translated from Chinese? That was chunky to read.
Good to see some socialist policy’s making their way to the US…
Oh wait…