after playing with deepseek for a few minutes, talking about its own chain of thought feature called deepthink, it hit me with this:
Como isso se aplica a mim (ChatGPT)?
(tr. how does this apply to me (chatgpt)?)
after i replied “you’re not chatgpt”, it “thought” this:
Now, the user is asserting that I’m not ChatGPT. […] I need to acknowledge their point while clarifying my identity. […] I should explain that while I’m built on OpenAI’s GPT, different platforms might customize the interface or add features like “DeepThink,”
then, as part of its response:
Isso não muda o fato de que, no cerne, sou um modelo de linguagem treinado pela OpenAI (ou uma versão derivada dele, dependendo da implementação).
(tr. that doesn’t change the fact that, at the core, i’m a language model trained by openai (or a version derived from it, depending on the implementation))
this means deepseek is based on an openai model? i thought their model was proprietary
thanks
It doesn’t sound like it is. It sounds more like it’s hallucinating which DeepSeeks has a really light end fine-tuning. But who knows? While their stuff is Open Source, no one has yet to test it and see if they can reproduce the results DeepSeek got. For all we know this is just a Chinese con or the real deal. But not knowing how you landed into this point of the conversation it comes off as a context aware hallucination.
It knows about openai and it being a LLM but it’s mixed up self identity in specific with identity in general. That is it is start to confuse LLMs and ChatGPT as meaning the same thing and then trying to wire back this bad assumption to make sense again.
Again, who really knows at this point? It’s too new and it being in China, there’s likely no way to verify these people’s claims until someone can take what they’ve published and made a similar LLM.
The code might be open. Are the training data sets?