AI领域重要动态 – 第3条新闻
- The only healthy stance you should have on AI Safety: If AI is physically capable of misbehaving, it might ($$1), and you cannot “blame” the AI for misbehaving in much the same way you cannot blame a tractor for tilling over a groundhog’s den.
- Anyone who would follow a mistake like that up with demanding a confession out of the agent is not mature enough to be using these tools. Lord, even calling it a “confession” is so cringe. The agent is not alive. The agent cannot learn from its mistakes. The agent will never produce any output which will help you invoke future agents more safely, because to get to this point it has likely already bulldozed over multiple guardrails from Anthropic, Cursor, and your own AGENTS.md files. It still did it, because $$1: If AI is physically capable of misbehaving, it might. Prompting and training only steers probabilities.
原文链接:https://twitter.com/lifeof_jer/status/2048103471019434248
发布时间:2026年4月27日 上午8:00
来源:Hacker News
注:本文内容为AI自动翻译和整理,仅供参考。
原文链接:https://twitter.com/lifeof_jer/status/2048103471019434248
🕐 发布于: 2026年04月27日 08:01
发表回复