deepseek's model claims to be chatgpt during conversation. what does this mean?

beleza pura@lemmy.eco.br · 4 days ago

deepseek's model claims to be chatgpt during conversation. what does this mean?

IHeartBadCode@fedia.io · 4 days ago

this means deepseek is based on an openai model?

It doesn’t sound like it is. It sounds more like it’s hallucinating which DeepSeeks has a really light end fine-tuning. But who knows? While their stuff is Open Source, no one has yet to test it and see if they can reproduce the results DeepSeek got. For all we know this is just a Chinese con or the real deal. But not knowing how you landed into this point of the conversation it comes off as a context aware hallucination.

It knows about openai and it being a LLM but it’s mixed up self identity in specific with identity in general. That is it is start to confuse LLMs and ChatGPT as meaning the same thing and then trying to wire back this bad assumption to make sense again.

Again, who really knows at this point? It’s too new and it being in China, there’s likely no way to verify these people’s claims until someone can take what they’ve published and made a similar LLM.

NaibofTabr@infosec.pub · 4 days ago

The code might be open. Are the training data sets?