<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>AI Security on Non-Functional Blog</title><link>https://non-functional.net/tags/ai-security/</link><description>Recent content in AI Security on Non-Functional Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Mon, 20 Apr 2026 18:56:03 +0100</lastBuildDate><atom:link href="https://non-functional.net/tags/ai-security/index.xml" rel="self" type="application/rss+xml"/><item><title>Opus 4.7 Model Card and Mythos Preview</title><link>https://non-functional.net/posts/2026-04-20-opus-4-7-model-card-and-zvi/</link><pubDate>Mon, 20 Apr 2026 18:56:03 +0100</pubDate><guid>https://non-functional.net/posts/2026-04-20-opus-4-7-model-card-and-zvi/</guid><description>&lt;p&gt;I strongly suspect that Zvi doesn&amp;rsquo;t need more inbound links, but his latest
&lt;a href="https://thezvi.substack.com/p/opus-47-part-1-the-model-card" target="_blank" rel="noreferrer"&gt;model card assessment&lt;/a&gt;
(which is, as usual, very well written) has a couple of notable quotes:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;So yeah, none of that sounds great. It all sounds like the types of thing that, if you caught a human doing them even once, that would be a very bad sign, and in several cases you would obviously have to fire them.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Check out the examples of Mythos Preview attempting (and in some cases succeeding, only to be caught by the human at the last moment) to escape containment.&lt;/p&gt;</description></item></channel></rss>