The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

KayLeadfoot@fedia.io · 20 hours ago

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

ToastedRavioli@midwest.social · 15 hours ago

Well thats just not true, I mean LLMs really are not extremely complicated. At the end of the day it’s just algorithmic sorting of information

So in practice any given flavor of LLM is basically like a librarian. Your librarian can be a well adjusted human or an antisemitic nutjob, but so long as they sort information and can point it out to you technically they are doing their job equally as well. The real problem doesnt begin until youve trained the librarian to recommend Mein Kampf when people ask for information about the water cycle or whatever

Thorry84@feddit.nl · 14 hours ago

I think they meant people don’t know how these models work in practice. On a theoretical level they are well understood. But in practice they behave in a chaotic way (chaotic in the math sense of the word). A small change in the input can lead to wild swings in the output. So when people want to change the way the models acts by changing the system prompt, it’s basically impossible to say what change should be made to achieve the desired outcome. And often such a change doesn’t even exist, only something that’s close enough is possible. So they have to resort to trial and error, trying to tweak things like the system prompt and seeing what happens.

KayLeadfoot@fedia.io · 4 hours ago

^-- to my knowledge, this is accurate.

System prompts are the easy but wildly unpredictable way to change LLM output, but we really can’t back-trace or debug that output, we guess at what impact the s.p. edits will have.

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed)

The Cause of Grok’s Increasing Antisemitism? Apparently, Two Lines of Code (Update: One of the Lines of Code Was Removed) - FuelArc News