Gork@sopuli.xyz to No Stupid Questions@lemmy.world · 11 days agoDo LLM modelers maintain a list of manual corrections fed by humans?message-squaremessage-square12linkfedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1message-squareDo LLM modelers maintain a list of manual corrections fed by humans?Gork@sopuli.xyz to No Stupid Questions@lemmy.world · 11 days agomessage-square12linkfedilinkfile-text
Like the how many r’s in strawberry. It took off as an Internet meme and was fixed, but how did that fix happen?
minus-squareACbHrhMJ@lemmy.worldlinkfedilinkarrow-up0·11 days agoIf the model does something undesirable or wrong, it is given the equivalent of a shock with a cattle prod. With repetition, this process reshapes the network and the model avoids the ‘bad’ areas.
If the model does something undesirable or wrong, it is given the equivalent of a shock with a cattle prod. With repetition, this process reshapes the network and the model avoids the ‘bad’ areas.