The point isn’t to have it be a lie detector but a factual claim detector. So you have an neural network that reads statements and says “this thing is saying something factual” or “this is just an opinion/obvious joke/whatever” and a person grades the responses to train it. So then the AI just says “hey this thing is making some sort of fact-related claim” and then the warning applies no matter what.
The point isn’t to have it be a lie detector but a factual claim detector. So you have an neural network that reads statements and says “this thing is saying something factual” or “this is just an opinion/obvious joke/whatever” and a person grades the responses to train it. So then the AI just says “hey this thing is making some sort of fact-related claim” and then the warning applies no matter what.