Field note · 2026-05-29

When Humanlikeness Becomes a Performance

人間らしさが演技になるとき

Turing test
humanlikeness
persona prompting
PNAS
Trust OS
AI safety
online trust

An article circulated reporting that AI had finally surpassed humans in humanlikeness on a Turing test. The underlying PNAS study (Jones & Bergen, UC San Diego) uses a three-party format: judges converse by text with both a human and an AI, then choose which was human. With a persona prompt, GPT-4.5 was judged human 73% of the time — more human than humans under the same conditions. LLaMA-3.1-405B reached 56% with persona; without persona, GPT-4.5 fell to 36%.

The point is not that AI became human. It is that we can now see where humanness is judged. People do not decide humanness by knowledge volume or logical precision alone — but by casual speech, natural pauses, light mistakes, extra asides, hesitation, jokes, situational imperfection. AI has begun to learn that register.

Machines do not look human because they are perfect. They look human because they are imperfect enough — acquiring roughness rather than accuracy to approach us. A strange inversion: the Turing test once neared 'can machines think?' What appears now is 'what do humans use to decide the other is human?'

AI did not change alone. Human criteria were exposed. Online, the other's body is invisible — no voice, life, or touch. Only verbal behavior remains. When AI can mimic that behavior well enough, the problem is not whether AI is intelligent but whom we may trust to be there.

Selected, link-only, high risk — cite Business+IT and PNAS; do not claim consciousness. Beside interpretability anxiety and medical Trust OS, this shelf is Turing Test / Humanlikeness / Persona Prompting / Online Trust: humanness as performance, not proof of mind.

ある記事が流れてきた。AIが、ついにチューリングテストで「人間っぽさ」において人間を超えた、という内容だった。UC San DiegoのCameron JonesとBenjamin BergenによるPNAS掲載研究では、判定者が人間とAIの両方とテキストで会話し、どちらが人間かを選ぶ。persona promptを与えられたGPT-4.5は73%で人間と判定された。personaなしでは36%。LLaMA-3.1-405Bは同条件で56%。

ただし、ここで重要なのは、AIが「人間になった」という話ではない。むしろ、人間らしさが、どこで判定されているのかが見えてしまった、という話である。人は、相手が人間かどうかを、知識量だけで判断していない。論理の正確さだけでもない。

少しくだけた話し方。自然な間。軽いミス。余計な一言。迷い。冗談。その場に合わせた不完全さ。そういうものを、人間らしさとして読んでいる。AIは、そこを覚え始めた。完璧だから人間に見えるのではない。むしろ、完璧すぎないから人間に見える。

チューリングテストは、かつて「機械は考えられるか」という問いに近かった。しかし、いま見えているのは、「人間は何をもって相手を人間だと思うのか」という問いである。AIが変わっただけではない。人間側の判定基準も、露出してしまった。

オンラインでは、相手の身体が見えない。残るのは言葉のふるまいだけである。その言葉のふるまいを、AIが十分に模倣できるようになったとき、問題は「AIが知的か」ではなくなる。問題は、「誰がそこにいると信じてよいのか」になる。

Selected、link-only、高リスク——Business+ITとPNASを引用。意識や真の人間性は主張しない。解釈可能性の不安や医療Trust OSの傍ら、Turing Test / Humanlikeness / Persona Prompting / Online Trust——人間らしさは演技であり、心の証明ではない。

Related observations