Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

An LLM is just computer function that predicts next word based on the input you give it. It doesn't make any difference what the input is (e.g. please respond in style X) - the function doesn't change, and the statistical signature of how it works will still be there.

If you don't believe me, try it for yourself. Ask an AI to generate some text and give it to the AI detector below (paste your text, then click on scan). Now ask the AI to generate in a different style and see if it causes the detector to fail.

https://app.gptzero.me/



I can't use that linked app, paywall immediately. Unlike the person you were replying to here[0], I do not claim that this is impossible:

LLM is indeed just computer function that does stats. And our brains are just electro-chemistry that does stats. This is why stylometric analysis of human writing is a thing.

My previous experience with things such as you have linked to, is they used to be quite poor. I assume they're better since then, but then again so are the models.

[0] https://news.ycombinator.com/item?id=47778171


> I assume they're better since then, but then again so are the models.

Yes, but "better" means different things for each of these.

Detectors are trying to get better at distinguishing human from LLM-generated text.

LLMs are being improved to generate more useful (and benchmark maxxing) outputs, not to attempt to avoid detection.

LLMs are in fact explicitly trained to be as predictable as possible. The training goal is to minimize continuation prediction errors, which means they are in effect being trained to generate output where each word can be predicted by what came before it (which we can contrast to a human who tries to spice it up and keep it interesting by not being too predictable!).

RL post-training, which is especially used for computer code and math, is going to change this word-by-word predictability (detectability) a bit since the focus is now on a longer term goal rather than next word, but to some extent you could also view it as just steering/narrowing the output of the model towards that goal, not totally overriding the next-word statistics.

I don't know if there are AI detectors specifically trained to detect AI code rather than prose, but I'd expect that is more difficult to do, both because of the RL factor, and because computer code is so predictable in the first place - adhering to rigid syntax etc.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: