研究人员使用 1000 个真实世界的事实核查提示测试了五个前沿人工智能模型。这些系统在 67% 的问题上未能达成共识。
报道来源 [1]
-
Researchers tested five frontier artificial intelligence models with 1,000 real-world fact-checking prompts. The systems failed to reach a consensus on 67 perce
Researchers tested five frontier artificial intelligence models with 1,000 real-world fact-checking prompts. The systems failed to reach a consensus on 67 percent of the queries, actively disagreeing with each other on the basic facts. # AI # TechNews # MachineLearning # Cyber ht…