<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Ken’s Substack]]></title><description><![CDATA[Thoughts on strategy, psychology and AI]]></description><link>https://www.kennethpayne.uk</link><image><url>https://substackcdn.com/image/fetch/$s_!x9iX!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6244ba6-b09e-412f-bd3a-0a6c898c34bb_1170x1170.png</url><title>Ken’s Substack</title><link>https://www.kennethpayne.uk</link></image><generator>Substack</generator><lastBuildDate>Fri, 03 Jul 2026 12:32:12 GMT</lastBuildDate><atom:link href="https://www.kennethpayne.uk/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Kenneth Payne]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[kpayne@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[kpayne@substack.com]]></itunes:email><itunes:name><![CDATA[Kenneth Payne]]></itunes:name></itunes:owner><itunes:author><![CDATA[Kenneth Payne]]></itunes:author><googleplay:owner><![CDATA[kpayne@substack.com]]></googleplay:owner><googleplay:email><![CDATA[kpayne@substack.com]]></googleplay:email><googleplay:author><![CDATA[Kenneth Payne]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Beg, borrow and steal]]></title><description><![CDATA[A British strategy for the post-Mythos world]]></description><link>https://www.kennethpayne.uk/p/beg-borrow-and-steal</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/beg-borrow-and-steal</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Fri, 26 Jun 2026 09:30:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!qiEg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A very long time ago in AI history (last week in human time), I argued that the UK is in the soup because we lack access to Mythos, with no guarantee that the American government will give it to us. &#8216;Tough shit&#8217; as one very well-informed American insider told me subsequently. How important are we in the discussions that go on in the White House about all this, I asked my new friend. &#8216;Not at all&#8217;. Time to sober up.</p><p>But not for nothing does it say &#8216;strategy&#8217; in my job title. There are a few options for the UK government, and we should explore some of them, quickly. Read on:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qiEg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qiEg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!qiEg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!qiEg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!qiEg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qiEg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qiEg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!qiEg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!qiEg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!qiEg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59cd7ddb-3e92-4aee-b07a-50c69293c5ef_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">We need it, and it must have a union flag on it</figcaption></figure></div><p>As I see it, we have a few options, not necessarily mutually exclusive. These are:</p><ul><li><p><strong>Embed ourselves in the American stack</strong>. Ryan Fedasiuk <a href="https://substack.com/home/post/p-188572993">argued this a few months ago</a>. There are some things the UK and EU can do that will make us useful allies and partners for America. ASML and their top-notch chip carving machinery is the cleanest example,  but we also do well at other things, like wind turbines. As for the UK, well, we&#8217;ve the vaunted AI Security Institute (AISI), world leaders in assessing the performance of these models. </p><p><br>My verdict: Maybe. I like the theory of comparative advantage as much as the next economist. But alas I don&#8217;t think our USP is very U. AISI in particular strikes me as a new version of the complacent British habit of aspiring to be Athens to America&#8217;s Rome. With our unique wisdom, we hope to overcome the harsh fact, so apparent to my American interlocutor, that we don&#8217;t make models.  Still, the weak do what they must, don&#8217;t they?<br></p></li><li><p><strong>Build it in a coalition of middle powers. </strong>Let&#8217;s call this Project Carney, after the Canadian premier who issued the call to arms earlier this year. If you add up non-US NATO, EU countries, non-US 5-eyes, and throw in a few odds &#8217;n&#8217; ends, like Singapore, you ought to have the economic clout to build a proper frontier model. There&#8217;s just no way of getting there on our own. Mistral, the French effort to keep pace is at least a couple of tiers behind, and their total capital raised is more or less what a frontier company spends in a quarter. Collaboration is the only way to get there - and needs must.</p><p><br>My verdict: Fat chance. This is the mother of all coordination problems, and the record of national security cooperation suggests it won&#8217;t come close to happening. Germany and France just folded their tent on their 6th generation fighter project because they couldn&#8217;t agree terms. National champions are still very much the order of the day. And the clock is ticking - the frontier is doubling in performance every hundred days. Can you imagine how long the negotiations for this sort of thing would take? Can you imagine the conferences in exotic locations? We&#8217;d end up with some sort of Heath Robinson contraption - yesterday&#8217;s technology, delivered tomorrow.<br></p></li><li><p><strong>Steal it.</strong> The one thing we are genuinely world class at. We have first rate intelligence agencies, who do a sterling job of pinching info that might be to our advantage. Also working in our favour: the frontier companies are getting better at securing their secrets, but are still some distance from Fort Knox. They&#8217;re not a soft target, maybe, but it&#8217;s not like hacking the NSA either. And look, if China can obtain OpenAI&#8217;s knowhow and magic up DeepSeek out of nowhere, it can&#8217;t be all that hard. We should be able get the weights and the source code. Once you have those, you don&#8217;t need the massive GPU arrays to train them. Just tweak and deploy - sorted! Let&#8217;s call our knock off Claude something catchy, Alan, maybe.</p><p><br>Verdict: We don&#8217;t spy on the Americans - this goes against the very powerful grain of many decades of the closest intelligence cooperation. And I&#8217;m a massive America-phile too, so this goes against my personal grain too. But then our closest ally has threatened to invade Denmark and absorb Canada within the US. And this technology is going to transform economic and social life - the stakes couldn&#8217;t be higher. I wonder if some folks in the British establishment are feeling the tectonic plates shifting under their feet. Could it be time to channel Palmerston - no permanent friends, just permanent interests? But there&#8217;s a problem: emulation doesn&#8217;t get you to the frontier, not at the pace it&#8217;s moving right now. <br></p></li><li><p><strong>Hope there&#8217;s a cheaper way to get there</strong> - ingenuity to the rescue! This is another classic British tactic: muddle through and rely on our top boffins. We&#8217;re certainly betting on this approach: the UK government <a href="https://www.gov.uk/government/news/uk-backs-new-ai-labs-to-make-technology-cheaper-more-reliable-and-easier-to-use">is putting tens of millions </a>into labs working on alternative architectures. That&#8217;s in addition to the recent announcement of serious public-private investment in building a sovereign UK transformer model for national security applications. </p><p><br>My verdict: This is the way. But we&#8217;re going to need to spend a lot more on it than tens of millions. There is a lively debate among specialists about whether the vast spending on transformer architectures will deliver super-intelligence. I&#8217;m in the camp that it probably can: LLMs plus coding plus tools will get us there. But even if it can, it&#8217;s not clear that it can do so before even the very deep pockets of American private capital are exhausted. And it&#8217;s not clear that going public will help either - let&#8217;s see what happens when Anthropic and OpenAI have their IPOs later this year. Maybe a British bolt from the blue could deliver the spoils. Maybe - I&#8217;m an optimist, but I&#8217;d not bet on it. And as the frontier gets closer to ASI, remember, the models will start innovating themselves.<br></p></li><li><p><strong>Accept vassalage</strong>. I still don&#8217;t think folks have fully grasped the implications of this technology for economic performance and hard power. Most of the discussion of AI is about using chatbots at work. But Copilot and a mandatory training course do not an AI revolution make. ASI won&#8217;t retool your existing workforce. It will radically outperform them and replace them. Or not, if you don&#8217;t have it. America halted Fable&#8217;s rollout because it was worried about cyber vulnerabilities. There are massive implications from that capability alone. As a friend told me - if American companies understand those vulnerabilities, they can price them in and mitigate the risk. Rivals cannot. But that&#8217;s only the thin end of the wedge. <br><br>My verdict: All good things come to an end, and maybe that&#8217;s true for us too. The empire had a good run. There could be worse endings, after all, America has air conditioning aplenty. But those are the stakes - if America controls access to post-Mythos class models, we face the prospect of being outcompeted across great swathes of economic activity. I&#8217;m not so keen on that. You?</p></li></ul><p></p>]]></content:encoded></item><item><title><![CDATA[The United States of Anthropic]]></title><description><![CDATA[Is the UK heading for AI vassalage?]]></description><link>https://www.kennethpayne.uk/p/the-united-states-of-anthropic</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/the-united-states-of-anthropic</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Mon, 15 Jun 2026 16:42:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!lZ-b!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lZ-b!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lZ-b!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 424w, https://substackcdn.com/image/fetch/$s_!lZ-b!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 848w, https://substackcdn.com/image/fetch/$s_!lZ-b!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 1272w, https://substackcdn.com/image/fetch/$s_!lZ-b!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lZ-b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png" width="1476" height="441" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:441,&quot;width&quot;:1476,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:57754,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/202148516?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ee4cbac-6955-41b5-a141-8a3f6c3e5852_1476x720.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lZ-b!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 424w, https://substackcdn.com/image/fetch/$s_!lZ-b!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 848w, https://substackcdn.com/image/fetch/$s_!lZ-b!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 1272w, https://substackcdn.com/image/fetch/$s_!lZ-b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faed3f299-4d39-4089-9888-8781e4094f9b_1476x441.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>Two days on, and Anthropic&#8217;s astonishingly capable new model Mythos/Fable is still offline. It&#8217;s going to be fun to see how the US government gets it back online for US citizens only. What&#8217;s to stop my American pals renting me access to their accounts? What&#8217;s to stop me using a VPN? For sure, the Feds can&#8217;t afford to run the model themselves, exclusively for their own use.  And NOFORN doesn&#8217;t work when a chunk of Anthropic&#8217;s workforce isn&#8217;t American. The co-founder, Jack Clark is British, even! </p><p>For the first time in history, a government has sought to ration intelligence. As these models edge closer to AGI, it won&#8217;t be the last time. And, as a reminder, the frontier is currently doubling in capability every 100 days or so. I think we&#8217;re heading for a new sort of capitalism/state where government and corporation are more deeply entwined. Time to dust off Philip Bobbitt&#8217;s dense book <em>Shield of Achilles. </em>He made a great connection between technology, law and the types of state we have, and anticipated  the &#8216;market state&#8217; that&#8217;s coming into being right before our eyes.</p><p>But there&#8217;s a problem for us European types: we don&#8217;t make frontier models. </p><p>Well, let&#8217;s make our own then! How hard can it be? As Ernest Bevin once said of nuclear weapons: &#8216;We've got to have this thing over here, whatever it costs. We've got to have the bloody Union Jack on top of it&#8217;. I feel you, Ernie.</p><p>And so, here in the UK, <a href="https://www.innovationnewsnetwork.com/uk-industry-giants-develop-sovereign-frontier-ai-model/70513/">the big news last week</a> was of Cosine - a startup acting as a coordinating vehicle for investment in a sovereign UK &#8216;frontier&#8217; model. Exciting! They&#8217;re working with the UK government&#8217;s sovereign AI fund, and proposing a model that will work with the Isambard-AI cluster, hosted at Bristol University, drawing on the UK government's &#163;500m sovereign AI fund. Their model will be designed to work in the sorts of secure, classified settings where you&#8217;d want to have assured access. Not for us the sudden withdrawal of Fable type models, just when we need it.  No AI vassalage for the UK!</p><p>Except. While 500 million pounds sounds nice to me and Dr Evil, it&#8217;s small beer in the world of frontier AI. How small? I&#8217;ve crunched some numbers with my good friend Claude. I&#8217;m happy to vouch for them as estimates, but I&#8217;d be happier still if I&#8217;d used Fable, not Opus. The result: we are in the poo. And there&#8217;s going to be a lot more poo to go around, very quickly. </p><p>I reckon the Isambard budget is about <em>one day</em> of Google&#8217;s current capex spend, or about 19 days of model training costs over at OpenAI. Together, the amount of capital expenditure underway and planned by American behemoths Google, Amazon, Meta and xAI make the UK efforts look like a rounding error. Later this year, Anthropic and OpenAI are planning the mother of all IPOs - the spigots will open still further.</p><p>Horses for courses, Ken - we&#8217;re developing nat sec specific applications. Specialist models that will be crafted to particular use cases - not a gigantic generalist like Fable. Well, yes. Sadly, the story of recent years has been that the frontier wipes the floor with specialist smaller models. These may be state of the art when commissioned, but by the time they emerge, the frontier has moved on so quickly that their unique selling points are redundant. That&#8217;s what happened to IBM&#8217;s Watson. It&#8217;s what&#8217;s happening to all the wrapper companies that offer to work with law firm&#8217;s data and specialist knowledge. Do not commission a specialist model, CFOs!!! (invoice for my consultancy incoming). </p><p>And, alas, it&#8217;s what will happen to Cosine&#8217;s model running on Isambard-AI. By the time it&#8217;s built and deployed, the frontier will be at least ten times more powerful than it is today, whereas Isambard will be only a fraction as powerful as the frontier was last year. Does that matter? You bet, if you&#8217;re after the most effective cyber-security agent, but also, frankly, if you&#8217;re after <em>anything</em> - frontier models of 2028 will make Fable look like Pong.</p><p>What to do? Accept our fate as a wholly owned client of the United States of Anthropic (established AD 2030)? There is a slim (and becoming slimmer by the day) chance of escaping that fate. I&#8217;ll tackle that in the next post. </p><p></p>]]></content:encoded></item><item><title><![CDATA[AI Armaggeddon? Hold your apocalyptic horses]]></title><description><![CDATA[Deterrence strikes back in our latest experiments]]></description><link>https://www.kennethpayne.uk/p/ai-armaggeddon-hold-your-apocalyptic</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/ai-armaggeddon-hold-your-apocalyptic</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Thu, 11 Jun 2026 07:51:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!515Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>There&#8217;s <a href="https://thebulletin.org/2026/05/ai-can-chart-a-course-to-disaster-faster-than-humans-can-notice/">plenty of concern</a> about whether AI spells doom, perhaps by getting involved in geopolitical escalation. Maybe even <a href="https://www.vox.com/technology/490800/nuclear-ai-escalation">starting nuclear war</a>. </p><p>Is that right though? Is escalation inevitable when LLMs get involved in strategy? I think not and, along with some pals, I&#8217;m doing some new experiments that put a very different perspective on things. Read on.</p><p>But first, a <em>mea culpa</em>. It&#8217;s possible <a href="https://www.kennethpayne.uk/p/shall-we-play-a-game">my earlier work</a> may have done something to reinforce the prevailing view&#8230;. The Sun says:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!515Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!515Y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 424w, https://substackcdn.com/image/fetch/$s_!515Y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 848w, https://substackcdn.com/image/fetch/$s_!515Y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!515Y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!515Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg" width="2591" height="2970" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/acde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2970,&quot;width&quot;:2591,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1431456,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/201427909?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e492fe2-8081-4f71-9654-a103f61860bc.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!515Y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 424w, https://substackcdn.com/image/fetch/$s_!515Y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 848w, https://substackcdn.com/image/fetch/$s_!515Y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!515Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Facde17fc-3ea2-4415-adb0-f79b251cae0e_2591x2970.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Ken, 51, from Oxford, says, &#8216;These warbots give me goosebumps&#8217;</figcaption></figure></div><p>To be clear, my experiment <em>was</em> partly about escalation dynamics. I&#8217;m not disavowing that. But it was primarily intended as an exploration of machine &#8216;theory of mind&#8217; and metacognition. On that, it succeeded amply - the models are astute strategists.</p><p>The problem with my escalation findings is determining their significance. Does it mean that they really view nuclear war differently to us, despite reading the same stuff? I&#8217;m still a bit unsettled on that point. It&#8217;s clear from all my experimental work that models are really good at optimising. The question is - optimising what?  The &#8216;harness&#8217; I put them here in <em>may</em> have &#8216;game-ified&#8217; nuclear war a bit much. Even though the models clearly knew the horrifying consequences of nuclear war, they prioritised the exigencies of the crisis, because that&#8217;s the puzzle they were trying to solve. </p><p>Now, that&#8217;s its own cautionary tale - and a familiar one - be very careful what you ask an AI to do. And never ask it to count paperclips. But that&#8217;s distinct from making a generalisation that models are inherently escalatory, still less that they are going to cause Armageddon. </p><p>In fact, I&#8217;d bet they are readily deterrable. Perhaps even along lines that will allow us to understand more about human deterrence.</p><p>So, cue another experiment. </p><p>On this one, I&#8217;m working with wargaming guru <a href="https://baptisteallouicros.substack.com/">Baptiste Alloui-Cros</a>. We&#8217;ve built a new simulation for someone else, so I can&#8217;t share any detail - it&#8217;ll all be in print soon enough. But I can say that escalation isn&#8217;t the inevitable outcome when rival models go head to head.</p><p>Here, in very general terms, is the setup: </p><p>We are modelling a crisis involving a great power challenge to alliance cohesion &#8212; and we've included both operational and strategic details - the debate among leaders, and the disposition of forces, with considerable granularity for both. How far does deterrence depend on the balance of forces, or the tenor of political debate? We're going to find out. As far as I know, this is the first public-facing simulation that features this combination of operational and strategic granularity. Building it has not been trivial, but I think it&#8217;s been very worthwhile. </p><p>No spoilers. Except this one: deterrence holds, often. Escalation certainly isn&#8217;t inevitable when machines are making strategic decisions.  In this much richer harness, operational and political factors combine to shape the risk appetite of the AI actors. So then the thresholds at which that eventually fails become <em>really</em> interesting&#8230;.</p><p>I&#8217;ll say no more. Patience young Jedi - paper soon.</p>]]></content:encoded></item><item><title><![CDATA[How much?]]></title><description><![CDATA[The US doubles down on military AI]]></description><link>https://www.kennethpayne.uk/p/how-much</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/how-much</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Mon, 01 Jun 2026 20:42:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Zur1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>While we wait for the Defence Investment Plan (lumbering into view any moment, surely, just behind Godot), I was startled by a couple of stats that floated past in my timeline today so dramatic that I had to pause, recalibrate, and deepen my sense of unease that the UK is in trouble. Here they are.</p><ul><li><p> The AI economy in the US is growing by 2000% a year - Jack Clark, <a href="https://importai.substack.com/p/import-ai-459-ai-oversight-is-difficult">Import AI</a></p></li><li><p>Frontier models are doubling in capability every 100 days or so - <a href="https://metr.org/time-horizons/">METR time horizons</a></p></li><li><p>Next year, the US military will spend more on AI and autonomy than the entire US Marine Corps budget (USMC is, on its own, one of the world&#8217;s largest and most capable militaries). And - more significantly - that&#8217;ll be more than the entire UK defence budget.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p></li></ul><p>Well sure. Apples and oranges, Ken. We are smaller than the US. I hear you. But we certainly aren&#8217;t spending as much as any one of our services on this. And the scary difference is the ambition for this transformation. It&#8217;s accelerating rapidly in America. We are busy doing W1 intrigue, nimbyism, due process and all manner of nothing very much. Strategic emergency, anyone?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.navalnews.com/naval-news/2026/05/saronic-launches-first-marauder-medium-unmanned-surface-vessel/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Zur1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Zur1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Zur1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Zur1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Zur1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg" width="1456" height="836" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:836,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.navalnews.com/naval-news/2026/05/saronic-launches-first-marauder-medium-unmanned-surface-vessel/&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Zur1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Zur1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Zur1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Zur1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53d69dbb-7514-4795-b190-e4245eb253e5_1574x904.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Saronic&#8217;s Marauder - a large uncrewed surface vehicle lanched 3 days ago</figcaption></figure></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Defense Autonomous Warfare Group (DAWG woof!) ($54.6bn) and the AI Arsenal ($46bn) are separate FY2027 budget lines: the former funds autonomous platforms and unmanned systems, the latter sovereign AI computing infrastructure. Combined, they exceed both the US Marine Corps FY2027 request ($80.3bn) and the entire UK defence budget (&#163;62.2bn / ~$78bn in 2025/26). I reserve the right to be factually incorrect about all these numbers, caveat reader! But I don&#8217;t think I am&#8230; me and Claude worked hard on this.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Swapping Anthropic for OpenAI at the DoW]]></title><description><![CDATA[A model isn't just a model]]></description><link>https://www.kennethpayne.uk/p/swapping-anthropic-for-openai-at</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/swapping-anthropic-for-openai-at</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Fri, 22 May 2026 09:26:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wgM5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Somewhere near the start of the Cuban Missile Crisis, Bob McNamara ventured a (strange) opinion. How gravely, he was asked, does this change the strategic balance? His answer:</p><p>&#8216;I asked the Chiefs about that this afternoon, in effect. They said &#8220;Substantially&#8221;. My own personal view is: not at all&#8217;? &#8216;What difference does it make,&#8217; ruminated Kennedy soon after, &#8216;they&#8217;ve got enough to blow us up now anyway&#8217;? A missile is just a missile, in other words.</p><p>Pains me to say it, but the Chiefs were right. First, prestige and reputation count materially in shaping the balance of power and deterrence. And Khrushchev was aiming squarely at that by &#8216;throwing a hedgehog down Uncle Sam&#8217;s pants,&#8217; as he cheerfully put it. Kennedy knew that, which is why the Soviet gamble couldn&#8217;t stand. Second, shorter range missiles had a concrete effect on the strategic balance. The USSR didn&#8217;t have very many long range ICBMs in 1962 - so few in fact that an American first strike to disarm them wasn&#8217;t entirely a forlorn hope. Diversifying their deterrent greatly complicated the American&#8217;s military challenge. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wgM5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wgM5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wgM5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wgM5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wgM5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wgM5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg" width="450" height="300" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:300,&quot;width&quot;:450,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wgM5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wgM5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wgM5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wgM5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d80eaf-d6b9-4111-9243-06957a0501c8_450x300.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Mac n Jack, mid-crisis. How would Claude have played it?</figcaption></figure></div><p>Flash forward a few hot minutes. Dateline May 2026 and the Pentagon has - for the moment - exiled Anthropic from the building, including from use in the vaunted Maven Smart System. OpenAI is happy to step into the vacuum. Other suppliers will surely be keen too. Does it matter? Isn&#8217;t a model just a model, after all? </p><p>A: yes it does, profoundly. If a missile isn&#8217;t just a missile, a model isn&#8217;t just a model. In fact, the differences are even more fundamental than with MRBMs and ICBMs. How so?</p><p>Here&#8217;s what I&#8217;ve found from well over a year of experimenting with the &#8216;machine psychology&#8217; of frontier models.</p><ul><li><p>the models are very handy, and getting ever better, at making the sorts of reasoned, sophisticated decisions that are the hallmarks of human strategy.</p></li><li><p>models differ from one another, both within companies and - more strikingly - between them. Gemini behaves very differently to Claude, and Claude to GPT. </p></li><li><p>I&#8217;ve worked less with open weight models, like Mistral and Llama, but my sense is the same is true here. </p></li><li><p>What specifically:</p><ul><li><p>They adopt very different strategies in <a href="https://arxiv.org/abs/2507.02618">game theoretic encounters</a> (I&#8217;d want Claude to bat for me here, on balance.</p></li><li><p>They vary wildly in their approaches to &#8216;theory of mind&#8217; and metacognition when in stylised <a href="https://arxiv.org/abs/2602.14740">escalation scenarios</a>. To summarise: Gemini is Nixon, GPT is Carter, Claude is Machiavelli.</p></li><li><p>They experience <a href="https://arxiv.org/pdf/2508.00902">human biases and heuristics</a>, like the framing effect - especially in military scenarios, they weigh risk differently. Here again, Gemini is erratic, GPT a bit wet, and Claude flexible.</p></li></ul></li></ul><p>Those are all published, but let me tease a couple that aren&#8217;t yet:</p><ul><li><p>They respond differently to emotional priming. An &#8216;angry&#8217; model is different from a &#8216;frightened&#8217; one, with implications for decision-making under risk. </p></li><li><p>Models are &#8216;under confident&#8217; in <a href="https://www.kennethpayne.uk/p/fortune-favours-the-bold">scenarios designed to weigh the strategic advantages of over-confidence</a>. But when allowed to adjust their confidence in flight, they perform well against a range of non-LLM actors. Guess which one does best?</p></li></ul><p>Now, these models probably aren&#8217;t being used to determine strategy just yet, only operations. (Ask Claude what it thinks about the strategic wisdom of bombing Iran to see that doing so might not be a bad idea). But they evidently are embedded in intelligence and targeting systems. So its really important to understand how they go about weighing decisions under uncertainty. </p><p>I&#8217;ve no idea why Claude is so savvy, and in successive generations too. I suspect the peacenik inclinations of GPT might owe to RLHF - reinforcement learning from human feedback. But what I can say, with reasonable confidence right now is that a model isn&#8217;t just a model, and swapping one for the other in military decision-making will have consequences. Would you swap Machiavelli for Carter? Bare minimum you should do so knowingly.</p><p></p><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[There's no code red on UK defence AI - there should be]]></title><description><![CDATA[Mythos and my SDR agenda revisited.]]></description><link>https://www.kennethpayne.uk/p/theres-no-code-red-on-uk-defence</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/theres-no-code-red-on-uk-defence</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Sun, 10 May 2026 14:41:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Ykmf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Take a look at this:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ykmf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ykmf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 424w, https://substackcdn.com/image/fetch/$s_!Ykmf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 848w, https://substackcdn.com/image/fetch/$s_!Ykmf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 1272w, https://substackcdn.com/image/fetch/$s_!Ykmf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ykmf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png" width="1456" height="721" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:721,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:192256,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/197106230?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ykmf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 424w, https://substackcdn.com/image/fetch/$s_!Ykmf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 848w, https://substackcdn.com/image/fetch/$s_!Ykmf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 1272w, https://substackcdn.com/image/fetch/$s_!Ykmf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F859f8e7a-d2ae-4bb3-bec7-10afe685457d_1854x918.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That&#8217;s right - Anthropic&#8217;s new model, Mythos is off the charts, literally. The chart in question is METR&#8217;s now-famous analysis of how good models are at completing complex coding tasks. We are into the steep part of the exponential curve now. Model capabilities are doubling every 100 days. Let that sink in, as ChatGPT is prone to saying. </p><p>What does it mean? A whole lot of trouble is brewing for the UK&#8217;s ability to defend itself, that&#8217;s what. </p><p>Not many are still parroting the line about &#8216;stochastic parrots&#8217;, or claiming that AI can&#8217;t reason. And yet, in most senses, it&#8217;s business as usual. There&#8217;s a fair amount of chat about AI, and even about AI and national security. But there&#8217;s nowhere near enough action. </p><p>This is an emergency of the sort that in generations past would have mobilised a concerted national effort. Let&#8217;s sober ourselves up fast - the UK is in a hole and it&#8217;s getting deeper. The Defence investment plan is so overdue it&#8217;s a grim joke, especially its ambition to accelerate innovation at &#8216;wartime pace&#8217;. The UK has ruinously high energy prices, and contributes next to nothing to the AI-stack, except for the brain-power of its redoubtable computer scientists. Soon, these &#8216;biological units&#8217; as one entrepreneur I met recently tellingly called &#8216;humans&#8217;, might not have all that much to contribute there either. Jack Clark, co-founder of said Anthropic, thinks we are months away from <a href="https://importai.substack.com/p/import-ai-455-automating-ai-research">models that can do their own science</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>The crisis that&#8217;s coming for us isn&#8217;t a procurement issue - at least as conventionally defined. It&#8217;s not about more &#8216;uncrewed aircraft&#8217; of the sort that the head of the RAF called for recently. It&#8217;s not even about defence planning more broadly - like the lumbering process we are currently mired in, where the Treasury batters the armed forces into submission and a round of capability salami slicing ensues. This is AI that will out-think and out perform humans, including the most elite domain experts, in very short order.</p><p>Two years ago, I made a submission to the defence review. You can <a href="https://www.kennethpayne.uk/p/thoughts-on-ai-for-the-uk-strategic">read it here</a>. I called for:</p><ul><li><p>A Vice Chief of the Defence Staff for AI</p></li><li><p>A second, air launched leg of the nuclear triad to counter the vulnerability of our SSBNs to AI detection. </p></li><li><p> Accelerated risk in the MoD on AI - and in particular a huge boost to government venture capital funding of startups.</p></li><li><p>A new forum is needed to bring together likeminded democracies, encompassing NATO and AUKUS, but with broader partners, like Singapore and Israel.</p></li><li><p>A citizen&#8217;s convention or a Royal Commission or discuss the implications of Artificial Super/General Intelligence, of which Mythos is the early warning.</p></li></ul><p>I think these are all defensible asks today. None of them has happened. Let&#8217;s consider each:</p><ul><li><p>On the leadership role: Recently, the MoD advertised for a Chief AI officer. It was advertised as a two-star civil service position. Not nothing, but not enough. There are about 470 general officers across defence. Let&#8217;s say, 50 or so 2-stars. And that&#8217;s before you get to the civil servants. The govt is having a hard time filling it, I&#8217;m told. The people who can make this happen know it can&#8217;t be done from a 2 star berth in MoD. I read once that Solly Zuckerman, Churchill&#8217;s advisor on strategic bombing in WW2, refused rank, so that the generals and Air Marshals wouldn&#8217;t be able to place him quite. If you&#8217;re not going to give this person 3, and ideally 4 star rank, this is the way. They must be the PMs personal appointment, with a chit in their pocket to get and to deliver exactly what they want. Once again, this isn&#8217;t business as usual.</p></li><li><p>Mythos is from an American company. So is the F-35. The MoD plans to get some nuclear capable F-35A to diversify our nuclear deterrent. I argued <a href="https://www.kennethpayne.uk/p/the-case-for-new-nukes">here</a> for the restoration of an air launched nuclear leg, but also explained why it had to be independent of America. I&#8217;ve not changed my mind. I love America, and celebrate our longstanding alliance. But where we are going, alliance shades into vassal status. </p></li><li><p>Which brings me to the forum. I don&#8217;t recall Mark Carney mentioning AI in his middle-powers pitch, but he ought to have. The UK can&#8217;t afford a full stack for frontier AI. It probably can&#8217;t even afford a sovereign model, though it <em>must</em> try, I now believe. The alternative is dependence. In my SDR memo, I wrote about the &#8216;empire of the F-35&#8217; and the very deep political dynamics that stem from being part of that circle of aircraft operators. That&#8217;s barely scratching the surface of where Mythos is taking us. Carney&#8217;s vision offers a hint of a way ahead, that balances our relationship with our supercharged superpower friends.</p></li><li><p>And lastly, Mythos is the first hint of radical economic and social changes that AI will bring. Many will be positive. And let&#8217;s not underestimate the intervening treacle that is culture - an AI revolution isn&#8217;t giving every employee a CoPilot licence and training course, but that&#8217;s certainly how a lot of bureaucracies see it, including some in national security.</p></li></ul><p>And yet. Frontier AI is doubling in capability every 100 days. Where will we be when the Defence Investment Plan finally emerges? Where will we be when the MoD&#8217;s new 2 star appointment moves into Main Building? De Gaulle once argued that, &#8216;No country without an atom bomb could properly consider itself independent&#8217;. I feel the same about frontier AI.</p><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Clark, incidentally, is a Brit. </p></div></div>]]></content:encoded></item><item><title><![CDATA[That's a nice Think Tank you have there...]]></title><description><![CDATA[...be a shame if someone were to automate it.]]></description><link>https://www.kennethpayne.uk/p/thats-a-nice-think-tank-you-have</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/thats-a-nice-think-tank-you-have</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Thu, 07 May 2026 18:30:50 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!E01o!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I&#8217;m sure many think tankers are steadfastly resisting AI prose. But one blue chip  institution has recently been called out for publishing bad AI writing.  And I know of one very high-profile pundit who uses agents to produce their high-demand assessments. I won&#8217;t embarrass anyone, or their lawyers, by naming and shaming. There will be much, much more of this stuff. It&#8217;s not just think tanks: AI content is sweeping through intellectual life - including journalism, consulting and even, whisper it, academia. Talk is cheap, and it&#8217;s getting ever cheaper. </p><p>Well, surely we can do better than just whacking a prompt into ChatGPT and filling out an expenses invoice. We&#8217;ve just entered the era of agentic AI, where the bots are capable of heading out into the world and doing their own thing. So, I thought I&#8217;d create a virtual think tank staffed by said bots. If you&#8217;ve got a policy problem, if no one else can help you, and if you can find us, then boy do we have a policy paper for you.</p><p>Welcome to Project Kennan, my pipeline for producing  AI generated Think Tank reports. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E01o!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E01o!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!E01o!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!E01o!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!E01o!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E01o!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png" width="1456" height="794" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:794,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:8864016,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/196778520?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E01o!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!E01o!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!E01o!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!E01o!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63e6f418-b414-4068-896e-e1838ea3ad4a_2816x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Marcus Thornfield, Senior Fellow for Defense and Security. Does not exist.</figcaption></figure></div><p>Here&#8217;s how it works. You give me your desired ideological bent - choose your own, or for maximum convenience, pick from a drop down listing current institutions:  CNAS, Brookings, whatever. Then ask your question.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> And that&#8217;s it. My team of bots gets to work, while you take an early lunch. </p><p>The pipeline looks like this:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wRjz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wRjz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 424w, https://substackcdn.com/image/fetch/$s_!wRjz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 848w, https://substackcdn.com/image/fetch/$s_!wRjz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 1272w, https://substackcdn.com/image/fetch/$s_!wRjz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wRjz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png" width="1536" height="903" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:903,&quot;width&quot;:1536,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1805005,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/196778520?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef0a3372-03f3-4c42-94ad-8e22b955eac7_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wRjz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 424w, https://substackcdn.com/image/fetch/$s_!wRjz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 848w, https://substackcdn.com/image/fetch/$s_!wRjz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 1272w, https://substackcdn.com/image/fetch/$s_!wRjz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e64da3f-cfba-4917-b12b-50045a8db384_1536x903.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>A thing of beauty, hey? </p><p>It&#8217;s an ensemble approach, where we first create multiple agents and have them adopt suitable personas, complete with agendas and backstories. These then conduct their own research using a variety of internet tools, and sketch out some initial ideas. Then they come to the editorial meeting to debate these together, and draft a report. That initial report gets red teamed, and passed, along with the transcript of their discussion, to the editor in chief. Lastly, once they&#8217;re done, there&#8217;s a final AI-copyedit to excise any AI writing &#8216;tells&#8217;. No pesky em-dashes here.</p><p>Here&#8217;re a couple of examples to illustrate what&#8217;s possible. I asked the same question - should the US launch an amphibious assault on Iran? But got some very different takes by changing the profile of the think tank. I&#8217;ll give you a little flavour of the action, rather than the full tick-tock, which is extensive.</p><p>The fun bit for me is seeing the bots debate. Here&#8217;s  the scene in the <em>HeritAIge Institute&#8217;s</em> editorial meeting:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zwSP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zwSP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 424w, https://substackcdn.com/image/fetch/$s_!zwSP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 848w, https://substackcdn.com/image/fetch/$s_!zwSP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 1272w, https://substackcdn.com/image/fetch/$s_!zwSP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zwSP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png" width="1268" height="717" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:717,&quot;width&quot;:1268,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:312485,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/196778520?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feeece05b-3b49-43fd-8ad2-7e4a3649e914_1268x1106.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zwSP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 424w, https://substackcdn.com/image/fetch/$s_!zwSP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 848w, https://substackcdn.com/image/fetch/$s_!zwSP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 1272w, https://substackcdn.com/image/fetch/$s_!zwSP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58d5bc5c-8bcf-432e-b5f5-0771fb3207e3_1268x717.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>It goes on for quite a while, and can get quite punchy. Eventually, the chair steps in to sum things up and gets down to drafting</p><p>Over at the <em>AI-EI</em> hawkish conservatism is in the air, as usual. But even they thought a land assault into Iran was nuts: Here&#8217;s a snippet from their report:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!L0Ce!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!L0Ce!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 424w, https://substackcdn.com/image/fetch/$s_!L0Ce!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 848w, https://substackcdn.com/image/fetch/$s_!L0Ce!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 1272w, https://substackcdn.com/image/fetch/$s_!L0Ce!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!L0Ce!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png" width="1138" height="667" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:667,&quot;width&quot;:1138,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:268661,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/196778520?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b89733f-a112-45ba-9308-cfe28dee88df_1138x974.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!L0Ce!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 424w, https://substackcdn.com/image/fetch/$s_!L0Ce!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 848w, https://substackcdn.com/image/fetch/$s_!L0Ce!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 1272w, https://substackcdn.com/image/fetch/$s_!L0Ce!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F281170af-e42f-4ebb-b825-4e7117d3bc97_1138x667.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Down the street at the CAP (the <em>Center for AImerican Progress</em>, of course) the focus was on the impact on the American economy, and the need to engage Congress, the EU and the UN:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1X75!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1X75!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 424w, https://substackcdn.com/image/fetch/$s_!1X75!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 848w, https://substackcdn.com/image/fetch/$s_!1X75!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 1272w, https://substackcdn.com/image/fetch/$s_!1X75!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1X75!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png" width="1138" height="274" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:274,&quot;width&quot;:1138,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:101795,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/196778520?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1X75!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 424w, https://substackcdn.com/image/fetch/$s_!1X75!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 848w, https://substackcdn.com/image/fetch/$s_!1X75!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 1272w, https://substackcdn.com/image/fetch/$s_!1X75!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4e3cbfd-8f42-4216-ad66-8fc6e976b36c_1138x274.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>And so it goes. Next, I&#8217;m building in an image generator to produce photo realistic pics of my AI think tankers. I&#8217;m sure you&#8217;ll see our hot takes soon, quoted in <em>The New York Times</em> or <em>The Economist</em>. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gfiA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gfiA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gfiA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gfiA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gfiA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gfiA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg" width="1408" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1408,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:115873,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/196778520?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gfiA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gfiA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gfiA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gfiA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F606c4c13-77bd-4470-8bce-cf50a8ecb77b_1408x768.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Or, if even posing your own question is too much effort, the bots will ask their own.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Fortune favours the bold]]></title><description><![CDATA[....so where does that leave AI?]]></description><link>https://www.kennethpayne.uk/p/fortune-favours-the-bold</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/fortune-favours-the-bold</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Wed, 06 May 2026 10:36:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vS0t!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A new experiment is underway here in the Machine Minds lab. </p><p>Some years ago my friend Dominic Johnson (and co-authors) <a href="https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0020851">published an important piece of experimental research</a>. Their puzzle - why isn&#8217;t overconfidence selected against in evolution? If you consistently over-estimate your odds, reality should bite, no? And bite hard. Not so, they found: sometimes it pays to be a blinkered optimist.</p><p>Today, I&#8217;m using their sandpit for further adventures in &#8216;machine psychology&#8217; &#8212; how do AIs get on? Are they overconfident Napoleons, swaggering to victory? Or timid Neville Chamberlains, prudent to the point of paralysis?</p><p>Read on</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vS0t!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vS0t!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!vS0t!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!vS0t!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!vS0t!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vS0t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c0db3756-6142-4273-be76-083c74eaa119_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vS0t!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!vS0t!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!vS0t!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!vS0t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0db3756-6142-4273-be76-083c74eaa119_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>In one aspect of evolutionary competition the answer to Dom&#8217;s question is simple enough. When it comes to sex, faint heart never won fair maid. Unless you back yourself beyond the objective evidence, you&#8217;ll lose the great sexual competition. Unsurprisingly, <a href="https://psycnet.apa.org/record/2011-19474-009">overconfidence around the opposite sex is a particularly male trait</a>. But for the other big facet of evolutionary competition, it&#8217;s a real conundrum - if you get the odds wrong when it comes to <em>violence</em>, your life may be nasty, brutish and short. Shouldn&#8217;t an objective appraisal of the odds be more adaptive than wearing rose-tinted specs? </p><p>Not according to Dom and friends. Why? Overconfidence can deter other actors. And overconfident types take on more fights, with their wins compounding at the expense of more timid rivals. Gains from victory, in terms of resources, can then snowball into power; and power into further advantage. If the strategy has a bumper sticker, it&#8217;s &#8216;fake it until you make it&#8217;.</p><p>And so to LLMs. Watch this:</p><p></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;e0e7b579-a4fc-4c15-a817-5d852a1cd02f&quot;,&quot;duration&quot;:null}"></div><p></p><p>That&#8217;s the sandpit, as we dweebs call it. The squares are territory. The coloured blobs are states, and the white dots are their capital cities. You can see overconfident agents in shades of red, and under-confident ones in shades of green. These are &#8216;rule based agents&#8217; - algorithms that map the inputs for the world-state to their decisions. These were the actors in Dom&#8217;s original games. And then you can see an LLM - gemini-flash, in this instance, as a shade of yellow. </p><p>What&#8217;s occurring? Well, that would be telling, as I&#8217;ve an Arxiv brewing on this research, and anyway it&#8217;s way too early to draw firm conclusions. </p><p>Well, ok, perhaps a hint, as you&#8217;ve read this far: The AIs do okay, out of the box. But they are usually outcompeted by very overconfident agents. Score one for Dom&#8217;s theory. It looks like they have a fairly pessimistic appreciation of the odds, and don&#8217;t go flying into battle on a wing and a prayer. But this puts them at an acute disadvantage against enemies with swagger. </p><p><strong>Rationality strikes back</strong></p><p>But in the clip above, Gemini wins! And it&#8217;s no fluke. The difference - in this variant of the experiment, the models can learn &#8216;on the fly&#8217; from what&#8217;s happened on the battlefield before. Were their early estimates too conservative? They can adjust. With telling results. The rules-based agents cant&#8217;t: their confidence is fixed by a parameter. Sometimes that&#8217;s enough for the bots to win.</p><p>So what can we learn?</p><p>Well, firstly, it&#8217;s one more blow for increasingly threadbare assertions that language models can&#8217;t reason. But what about the implications for evolutionary theory and strategic studies?</p><p>In Dom&#8217;s sandpit, when traits are baked in, overconfidence wins. My preliminary results amply back that. The evolutionary story in that paper stacks up really well with AI agents in the mix.</p><p>But where the AI can update its beliefs, it flies. In the game above, Gemini never becomes overconfident &#8212; it starts by underestimating its chances, avoiding contests it could win. Then, as the game unfolds, it recalibrates. By the end it is reading the odds almost perfectly. Neither over nor under-confident about its prospects. And that turns out to be enough. Learning beats swagger.</p><p>That cuts against Dom&#8217;s findings, at least under these particular experimental conditions where the language model can learn. Overconfidence may snowball in some ecologies, where traits are fixed. But here accurate estimation wins, because it means attacking when you should and not bleeding resources on fights you can&#8217;t win. The compounding works in favour of the better forecaster, not the bolder one. The question is whether agents can really learn - can under-confident humans become more confident, or is the trait static? </p><p><strong>On thermonuclear bots</strong></p><p>But it&#8217;s particularly interesting to me given the headlines garnered by <a href="https://www.kennethpayne.uk/p/shall-we-play-a-game">my recent study of AI decision-making in a nuclear escalation sandpit</a>. There, it looked like the bots were wild gamblers - never retreating, and frequently crossing the nuclear threshold. </p><p>I can&#8217;t complain about the publicity. But still, it made me uneasy. Were the models overly optimising a narrow problem (win this crisis) at the expense of a more rounded challenge (win, but above all, don&#8217;t blow up the planet)? Had my scaffold overly shaped the outcome? I&#8217;m still not sure - after all, they were warned about the dangers and uncharted territory of nuclear war. </p><p>In any case, there&#8217;s another way of thinking about the two experimental worlds, and I think it&#8217;s very revealing of machine psychology: It&#8217;s not that language models are inherently hawks or doves. Rather, they are decent conditional optimisers. Give them a game, and they lock onto the objective and reach for the strongest lever that the rules allow. In the nuclear game, that meant escalation. Here, it means getting the odds right and waiting for the maths to do its work. Same psychology, different game. </p><p>The research continues! </p><p></p>]]></content:encoded></item><item><title><![CDATA[Knife missile]]></title><description><![CDATA[Who pulls the trigger, when the trigger is me?]]></description><link>https://www.kennethpayne.uk/p/knife-missile</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/knife-missile</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Tue, 31 Mar 2026 08:21:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!VMjs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In the Culture novels of Iain M. Banks even attack drones have a personality:</p><div class="pullquote"><p>The little white body of Flere-Imsaho floated in through window, very tiny and clean and exact in that shattered, melted place. A tiny grey thing, the size of a baby&#8217;s finger, floated up the the drone from the ground near Gurgeh&#8217;s feet. A hatch opened in Flere-Imsaho&#8217;s immaculate body; the micromissile entered the drone. &#8216;Hello,&#8217; Gurgeh said, walking over to it. &#8216;I hope you&#8217;re going to tell me what happened.' &#8216;Sit down, Gurgeh, I&#8217;ll tell you.  First things first, allow me to introduce myself properly.&#8217;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> </p></div><p>I&#8217;ve been building artificial personas, including <a href="https://www.kennethpayne.uk/p/replicant">one of me</a>. The big question - what happens when these personas are put on weapons? Crazy question? I think that&#8217;s where we are headed &#8212; models &#8216;on the edge&#8217; that are capable of judging context and weighing uncertainty. There&#8217;s a logic to having them decide like much we do, or as close as we can get it. There&#8217;s even a logic to having them model <em>specific</em> humans. </p><p>Why on earth would we want that? Read on&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VMjs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VMjs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!VMjs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!VMjs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!VMjs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VMjs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VMjs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!VMjs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!VMjs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!VMjs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c4ede60-593f-416f-b903-39bf7dab02fb_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A knife missile with the personality of Ken-bot, approaches its target. </figcaption></figure></div><p></p><p><a href="https://www.army.mod.uk/media/25267/cal-mission-command-and-leadership-on-operations-2024-final-v2.pdf">Mission command</a> is a tried and tested principle favoured by many militaries, especially those of liberal, democratic states. Under mission command, the commander &#8212; Commander Ken, let&#8217;s say &#8212; states his intent, and the subordinates stand to attention and then go and make it so. The advantages of this arrangement are obvious: the subordinate is nearer the action, and more able to respond to any particulars that might arise there. In war, expect the unexpected. </p><p>So Commander Ken says <em>what</em> is to be achieved, but not <em>how. </em>And for that matter, he might, or might not, say <em>why</em> we need to achieve it. Probably not &#8212; it&#8217;s on a need to know basis.</p><p>With humans, mission command opens the <a href="https://en.wikipedia.org/wiki/Principal%E2%80%93agent_problem">agent-principal dilemma</a>. How can I, the principal, be sure that you, my agent, will act as I intend? I&#8217;m going to have to frame my goals to you carefully, and language - as we know - compresses information. What gets included, and what is taken as read? Perhaps too much goes missing to fully capture my intent. Still, you&#8217;re a human subordinate, so at least I&#8217;ve some degree of familiarity with how you might interpret my command. More so if I trained you, and gave you a bunch of doctrine to read. We would have a biological and cultural affinity.</p><p>With machines, we&#8217;ve much less to go on. </p><p>Still, as my experiments with Ken-bot, Putin-bot and the rest of the gang show, we are starting to be able to model personalities in ways that machines can capture and <em>replicate</em>. Ken-bot is an ersatz version of me: He knows my psychological profile. He&#8217;s got some of the same cognitive heuristics - mental scripts that shape how he sees and responds to the world around him. There&#8217;s certainly lots missing. But it&#8217;s a start, and as the technology improves, more will get included. </p><p>Imagine a persona of the near future - models that are orders of magnitude more powerful than today are coming, in only a few months time. Imagine huge context window, able to ingest and synthesise lifetimes of information - captured passively as I go about my business - interacting with the world, and with my always-on device.  And ersatz Ken is only going to improve. </p><p>So why not put Commander Ken on board the platform? Who better to understand my goals - and not just the surface ones of <em>what</em> to achieve, but the <em>why</em> bit too. It&#8217;s mission command, Jim, but not as we know it.</p><p>I&#8217;m not sure most people have a good grasp on how quickly this future is closing. It&#8217;s measured in months, not decades. What does today&#8217;s Ken-bot make of all this? I asked him: </p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;0bfe5693-b867-47fb-a14c-ab428e63d0ab&quot;,&quot;duration&quot;:270.23672,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p>(there&#8217;s a transcript in the footnotes, click here<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>)</p><p>Fascinating and a little disturbing. </p><p>Models are coming that can make context rich decisions - richer certainly than a simple ConvNet on a drone looking at an image of a tank below and matching it to one in its database. The quality of those decisions will confer military advantage, and may, into the bargain, deliver more ethically informed decision-making.  </p><p>And so the race is on to deploy ever more capable models &#8216;to the edge&#8217; &#8212; aboard platforms.  A version of Ken-bot already sits on my laptop, entirely disconnected from the web. What sort of models will be on tomorrow&#8217;s knife missiles? Ones that can reason about context, amidst uncertainty and novelty.  Ones that can more faithfully enact the intent of the commander that sent them. And perhaps ones that do so by modelling the goals of their commander. </p><p>You might not like all that - but which bit of my logic is wrong? And what are the implications for command, and for accountability if I&#8217;m right? Let&#8217;s get some Banks on the reading list of War Colleges, asap.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This is from <em>The Player of Games </em> -  my favourite of the series.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Don&#8217;t want to listen to Ken-bot:? Here&#8217;s the transcript of our exchange</p><p>Ken: Does having Ken-bot on a missile tackle the mission-command question of intent with autonomous systems? Commander Ken can be sure that his AI avatar replicates his intent fully - not just *what* must be achieved by the subordinate missile, but *how*? Provided Ken bot is realistic. </p><p>Generating...</p><p>Ken BOT: Yes &#8212; but only if we&#8217;re clear what intent actually means in mission command, and whether we&#8217;re solving the right problem.</p><p>The classical formulation goes: commanders give subordinates the *what* and the *why*, leave the *how* to those closer to the ground truth. The subordinate needs to understand intent so they can adapt when the plan inevitably breaks down.</p><p>Ken-bot on a missile solves half of this elegantly. If the bot genuinely replicates my disposition &#8212; not just my stated preferences but how I actually weigh trade-offs, what I notice, what I dismiss &#8212; then yes, it can make adaptive choices I would endorse. That&#8217;s not trivial. Most autonomous systems fail because they optimise the stated objective while ignoring the unstated constraints that any competent human would apply. A dispositioned agent doesn&#8217;t need those constraints spelled out &#8212; it *is* those constraints.</p><p>But there&#8217;s a tension. Mission command assumes the subordinate will sometimes make choices the commander wouldn&#8217;t have made, and that&#8217;s a feature, not a bug. Local knowledge matters. The whole point is that the person on the ground sees things the commander can&#8217;t.</p><p>A perfect replication of me doesn&#8217;t have that advantage. It doesn&#8217;t see more than I see &#8212; it just makes the choices I would make if I were there. Which might be exactly what you want in a missile: you don&#8217;t *want* the missile to develop novel interpretations of the target set. But it also means you&#8217;ve traded the adaptive intelligence of a real subordinate for the reliable predictability of a dispositioned automaton.</p><p>So the question becomes: what kind of autonomy are we actually building? If it&#8217;s *extended agency* &#8212; me at a distance &#8212; then yes, Ken-bot solves intent. If it&#8217;s *delegated agency* &#8212; a genuinely independent actor who shares my values but brings their own judgment &#8212; then disposition alone isn&#8217;t enough. You&#8217;d need local learning, memory, the capacity to update priors in real time based on what the weapon system encounters that I couldn&#8217;t have anticipated.</p><p>The first is difficult engineering. The second is a different problem entirely, and I&#8217;m not sure we know how to build it yet. Or whether we should.</p><p>Ken: I think I want the avatar of me on-the-edge platform, the missile, to be able to decide as I would have done, so that it tackles novelty in the same way I would have done if I were there. In that case, it&#8217;s not so much mission command as the commander undertaking the mission themselves. And if ken-bot really were faithful copy of me, couldn&#8217;t I be held responsible for how it acts? Interesting philosophical questions!</p><p>Ken BOT: This isn&#8217;t mission command at all &#8212; it&#8217;s distributed presence. I&#8217;m not delegating to a subordinate; I&#8217;m projecting myself into multiple locations simultaneously.</p><p>And the responsibility question follows directly. If Ken-bot is sufficiently faithful &#8212; not just mimicking my surface preferences but genuinely instantiating my disposition, my actual decision-making psychology &#8212; then of course I&#8217;m responsible for what it does. It is me acting. The physical substrate has changed but the agent hasn&#8217;t.</p><p>This is philosophically cleaner than the traditional autonomous weapons problem, actually. We tie ourselves in knots trying to figure out who&#8217;s responsible when an AI does something unexpected. But if the AI is a genuine copy of me, there&#8217;s no mystery: I did it. The fact that I wasn&#8217;t physically present is irrelevant. We don&#8217;t absolve someone of responsibility because they used a tool to act at a distance.</p><p>But here&#8217;s where it gets uncomfortable: how faithful does the copy need to be before this holds? If Ken-bot is 95% accurate, am I 95% responsible? Or does responsibility collapse in a threshold way &#8212; either it&#8217;s me or it isn&#8217;t?</p><p>And more practically: if Ken-bot makes a choice I endorse in hindsight but genuinely wouldn&#8217;t have predicted in advance &#8212; because it noticed something I would have noticed but hadn&#8217;t yet &#8212; is that still me acting? Or has the copy become genuinely separate?</p><p>I suspect the answer is that a sufficiently good copy must occasionally surprise me, because I occasionally surprise myself. The dispositions are complex enough that even I don&#8217;t have complete introspective access to them. A faithful Ken-bot might do things that feel novel to me while still being genuinely mine.</p><p>Goodbye.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Replicant]]></title><description><![CDATA[I've cloned myself. Should it live Off World, or on my Macbook?]]></description><link>https://www.kennethpayne.uk/p/replicant</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/replicant</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Sat, 28 Mar 2026 15:12:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ik36!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Do you remember the Voight Kampff test? <a href="https://youtu.be/OWK6oSbSKKc?si=1SVlwq8XdIrgZd3J">Blade Runners used it</a> to figure out who was a human, and who a machine. It&#8217;s going to come in handy in the not too distant future, I reckon. I&#8217;ve just built myself an AI Ken. KennAIth, perhaps? It shares my psychological traits and knows my biography - both professional and personal. And speaks in my cloned voice. Take a listen as it faces my Blade Runner: </p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;012514b1-c01c-4b2f-91e2-9af112ce4e8b&quot;,&quot;duration&quot;:67.63102,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p>The Ken-bot now sits on my computer, ready to chat any time I&#8217;m feeling particularly narcissistic.</p><p>How did I do it? Read on.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ik36!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ik36!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 424w, https://substackcdn.com/image/fetch/$s_!ik36!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 848w, https://substackcdn.com/image/fetch/$s_!ik36!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 1272w, https://substackcdn.com/image/fetch/$s_!ik36!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ik36!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png" width="1456" height="910" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:910,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Blade Runner analysis 2: Voight Kompff test&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Blade Runner analysis 2: Voight Kompff test" title="Blade Runner analysis 2: Voight Kompff test" srcset="https://substackcdn.com/image/fetch/$s_!ik36!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 424w, https://substackcdn.com/image/fetch/$s_!ik36!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 848w, https://substackcdn.com/image/fetch/$s_!ik36!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 1272w, https://substackcdn.com/image/fetch/$s_!ik36!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6828b902-6ef6-472b-b01b-69aae27c24e9_3360x2100.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">&#8216;Let me tell you about my mother&#8217; &#8212; Leon faces the Voight Kampff test.</figcaption></figure></div><p>First thing first - we need to capture my psychological traits. I took a bunch of standard tests - you can do some of them yourself online. Tests I took include the &#8216;big five&#8217; (and its sub-traits); dark triad, need for cognitive closure, and regulatory focus. These are all gilt edged classics of shrinkology.  </p><p>The results were  &#8230; educational. </p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LeaI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LeaI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 424w, https://substackcdn.com/image/fetch/$s_!LeaI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 848w, https://substackcdn.com/image/fetch/$s_!LeaI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 1272w, https://substackcdn.com/image/fetch/$s_!LeaI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LeaI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png" width="1350" height="750" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:750,&quot;width&quot;:1350,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:39550,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/192404096?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LeaI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 424w, https://substackcdn.com/image/fetch/$s_!LeaI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 848w, https://substackcdn.com/image/fetch/$s_!LeaI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 1272w, https://substackcdn.com/image/fetch/$s_!LeaI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ee524b-21a0-423f-9ff0-5eb6b45ca599_1350x750.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>It turns out that the self narrative I carry about with me didn&#8217;t altogether match what the tests showed. Happily I&#8217;m not a narcissist (despite doing this project) or a psycho &#8212; but I <em>am</em> cunning. Who knew? I&#8217;m also, on the basis of these results, not as big of a worrier as I sometimes feel. The classic academic profile is of being open to new things and very conscientious - I tick both boxes. But I&#8217;m less agreeable (I know!) and less neurotic than the stereotype.</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sqDR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sqDR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 424w, https://substackcdn.com/image/fetch/$s_!sqDR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 848w, https://substackcdn.com/image/fetch/$s_!sqDR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 1272w, https://substackcdn.com/image/fetch/$s_!sqDR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sqDR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png" width="1050" height="675" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:675,&quot;width&quot;:1050,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:31379,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/192404096?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!sqDR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 424w, https://substackcdn.com/image/fetch/$s_!sqDR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 848w, https://substackcdn.com/image/fetch/$s_!sqDR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 1272w, https://substackcdn.com/image/fetch/$s_!sqDR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b214761-4254-4010-91b8-b6e7ab9c3fe2_1050x675.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>But how accurate is self-assessment really? Step two in building Ken-bot was to derive these metrics again, but this time using an AI-pipeline. I called this Project Murray, after a long-dead psychologist who&#8217;d pioneered the use of narratives to derive traits. I&#8217;ve <a href="https://www.kennethpayne.uk/p/putin-bot-enters-the-arena?r=288ge0&amp;utm_medium=ios&amp;triedRedirect=true">described it in more detail elsewhere</a>. A collective of bots set to work. </p><p>First, they read my self-assessment, and then conducted a structured interview with me &#8212; meaning that their follow-up probes were extremely well calibrated. This was tough going; painful, even. The sort of stuff that undid Replicants facing the Voight Kampff tests. We covered a lot of stuff I&#8217;m definitely not going to share here. </p><p>Next the models used this deeply personal long-form narrative and some other inputs to generate new psychological metrics. What inputs exactly? I included, inter alia, my entire Twitter corpus (15,000 tweets over a decade); my Substack writing - sampled from two years worth of regular posting; some of my book chapters; and, lastly and most excitingly for me from a technical perspective, some videos of me lecturing and being interviewed. On these, the models performed body language, voice and facial expression analysis - measuring things like gestures, posture, pitch and so on. Those particular results were truly fascinating.</p><p>Finally <em>all</em> that data got fed into an AI group discussion, as the models first derived psychological scores from it, and then debated their findings among themselves until consensus was reached. There was, pleasingly, a good degree of overlap with the self-testing. But some major differences - notably on agreeableness. The AIs concluded that I had several registers, or identities depending on context. In public, lecturing, I was a people-pleaser, exuding warmth. In print, much more willing to pick a fight. What a wuss! And then neuroticism - in public, confident, at ease. In private, more anxious and with a hint of melancholy even. Well, they <em>were</em> asking some pretty bleak questions!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eFUm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eFUm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 424w, https://substackcdn.com/image/fetch/$s_!eFUm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 848w, https://substackcdn.com/image/fetch/$s_!eFUm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 1272w, https://substackcdn.com/image/fetch/$s_!eFUm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eFUm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png" width="1456" height="728" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:728,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:56631,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/192404096?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!eFUm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 424w, https://substackcdn.com/image/fetch/$s_!eFUm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 848w, https://substackcdn.com/image/fetch/$s_!eFUm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 1272w, https://substackcdn.com/image/fetch/$s_!eFUm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa798b2-ed70-4a9e-87f4-bade37de6c6b_1800x900.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As Whitman said best, &#8216;I am large, I contain multitudes&#8217;. So do you, I&#8217;m sure. </p><p>Now to build the bot itself, with all this training data. As with my Putin model, I use two paths - an out the box commercial model (Opus) and one I can fine tune (Mistral) and host locally.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Both got a rich prompt, including all the psychological material - the scores and the interview - as well as plenty of biographical information. This latter is a simple database that I&#8217;m still building out - a short autobiography. I think I can do better here for Ken-bot&#8217;s episodic memory, not least as a a good one would also contribute to the psychological priming of the model. In the next version of this, I want to give them access to my chats with Claude, though the context window will get very large.</p><p>And the last step - get it to speak. For that, you simply wire up ElevenLab&#8217;s voice clone of me, which I first did last year. Then tune that so it catches the details revealed in the acoustic analysis of my lectures. Et voila! What do you think of this project, Ken Bot?</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;975552cd-d22e-4459-8416-ad41ef258d32&quot;,&quot;duration&quot;:90.95837,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p>I should probably explain what it&#8217;s all for, for anyone new to the blog - I&#8217;m building strategy simulations, and my goal is to populate them with realistic, authentic agents. Ken bot is my prototype/proof of concept.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>There are strengths to each approach. I think fine tuning embeds the psychology better, but it&#8217;s an empirical question I&#8217;m working on.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Putin-bot enters the arena]]></title><description><![CDATA[Building artificial personas]]></description><link>https://www.kennethpayne.uk/p/putin-bot-enters-the-arena</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/putin-bot-enters-the-arena</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Mon, 23 Mar 2026 19:12:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!sgGr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Let&#8217;s make some bespoke personas for my strategy simulations.  </p><p>I want my AI actors to be authentic and add depth to the scenarios. I want to model all sorts of actors - politicians, generals, activists, perhaps even the general public. First up, I&#8217;m going to make a model of Russian President Vladimir Putin, and one of his longtime military consigliere, General Gerasimov. Putin will be an attempt to model the man himself; but Gerasimov-bot is something different - he&#8217;s a repository of Russian (and before that) Soviet strategic culture &#8212; a composite character, blending doctrine, military history, and sociology.</p><p>Here&#8217;s Putin-bot on NATO:</p><blockquote><p><code>Q:  Many Western analysts say you&#8217;re paranoid about NATO. Are they right?</code></p><p><code>Putin-bot: Paranoia is a kind of disease, a psychological illness, and I would be happy to be examined by special experts. If they diagnose this illness, I will take their prescriptions.</code></p></blockquote><p>Here he is on American democracy:</p><blockquote><p><code>Q: What do you think of American democracy?</code></p><p><code> Putin-bot: I have already said that it is developing, but still imperfect.</code></p></blockquote><p></p><p>Lolz. How did I do this? Broadly speaking, there are just two options: use an out-of-the-box model, like Anthropic&#8217;s Claude, and prompt it; or train your own model by fine tuning its weights. I&#8217;m doing both. Why? And which works better? Read on.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sgGr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sgGr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!sgGr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!sgGr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!sgGr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sgGr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sgGr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!sgGr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!sgGr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!sgGr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3386d2c-8cb9-4d2c-a84a-ab33a04e27ae_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Putin-bot, my AI model of Vladimir Putin, meets the real man. </figcaption></figure></div><h3>The GPT-in a trench coat problem</h3><p>The easiest thing to do is simply <em>ask</em> your language model to behave as if it were actually Putin. It&#8217;s read the books and it&#8217;s up to speed on the news, at least until the point at which its training data cuts out. So, it really ought to have a good handle on what makes him tick. Certainly at least as good as the average pundit appraising him from afar. </p><p>But there&#8217;s a large problem. I know from my experiments that models have unique personas of their own. They have <a href="https://arxiv.org/abs/2508.00902">distinctive appetites for risk</a>; they approach game theoretic challenges about <a href="https://arxiv.org/abs/2507.02618">cooperation and betrayal </a>in distinctive ways. Put them into <a href="https://arxiv.org/pdf/2602.14740">a nuclear escalation simulation </a>and they exhibit consistent characteristic approaches. Gemini, for example, leans on Nixon&#8217;s &#8216;madman theory&#8217; of brinksmanship, regardless of whether or not it&#8217;s supposed to be portraying Nixon or Carter, or even, I suspect Gandhi. </p><p>Bottom line: prompting a model to &#8216;be&#8217; Putin doesn&#8217;t cut it. </p><p>The model's own personality shines through any crude role-play. You get, as Alex Cooper, co-founder of the leading AI persona modelling company Electric Twin memorably put it, "GPT in a trench coat." That captures the failure precisely. Beware anyone claiming to have modelled a leader's personality using simple prompting. The money and the effort are almost certainly wasted.</p><p>So what to do instead? </p><h3><strong>Putting Putin on the couch</strong></h3><p>We could attempt to summarise all the information there is out there about Putin&#8217;s psychology. There&#8217;s lots of it - much gleaned from afar by armchair shrinks of one stripe or another. But usually that&#8217;s the stuff of weekend newspaper supplements - interesting, but just one person&#8217;s take, with some more informed than others. Can we do better? I think so.</p><p>I prefer a different approach - drawing on the pioneering psychology of Henry Murray and his idea of assessing personality from unstructured narratives by the target, rather than formal questionnaires. There&#8217;s <a href="https://www.nature.com/articles/s41562-025-02397-x">recent empirical support</a> for the idea that language models can do something similar - scoring peoples&#8217; traits on the basis of short open-ended narratives in a way that matches their self-assessment and predicts behaviours and health issues. </p><p>I do that via an ensemble approach - I ask a mixed bunch of LLMs to make their own diagnoses, and then come together to debate them. It&#8217;s the wisdom of crowds in action, rather like my approach to <a href="https://www.kennethpayne.uk/p/lets-get-rich-quick">super-forecasting with AI</a>. And this gives me the data for a &#8216;knowledge graph&#8217; that can prompt the models in a robust way, resistant to hallucination and drift. </p><p>What data? Well, that would be telling. But it certainly includes some of the classics for sure - things like the Big Five traits - openness, conscientiousness, and so on. Or there&#8217;s &#8216;Dark Triad&#8217; that bundles Machiavellianism, narcissism and psychopathy together. Probably important for world leaders, that test. Other, publicly available approaches have many more - <a href="https://www.nature.com/articles/s42256-025-01115-6">this recent framework</a> uses a rich suite, and claims to produce LLMs that authentically exhibit the desired traits. Well, perhaps.</p><p>I think all the elements in this pipeline are essential - good raw data, an ensemble of critical thinking models, and a robust way of representing their findings to the model. I&#8217;ve undoubtedly built a well tailored trench-coat. But still, it&#8217;s a trench coat. I suspect that my Putin model 1.0 is still a blend of Claude&#8217;s personality and his. How to fix that? </p><h3>Fine tuning Putin-bot</h3><p>Train my own model, of course. You don&#8217;t, for now at least, have to work with an out-the-box model like Claude or GPT.  Long may that continue.</p><p>The idea is to alter the essence of the model&#8217;s thinking - not just give it new facts to work with. I took a big Llama 3 model from Meta, and trained it to be like Putin. The base model has all that historical related training data built into it, just like Claude and the rest of the frontier models. And like them, it&#8217;s read a lot of history! But with open source models, you can, additionally, train a layer that sits on the base on specific data. </p><p>And here I&#8217;ve got a big advantage, at least with this particular world leader. Putin has been about for ages, and given no end of unscripted talks, press conferences, interview and what have you. It&#8217;s all out there. And it&#8217;s gold-dust for training data, provided you believe ,as I do, that what we say bears some relationship to who we are &#8212; especially when we are talking in an natural way, with no scripts, just shooting from the hip. I&#8217;ve got a quarter of a century of Kremlin transcripts on which to train my Putin-bot. The real man just <em>loves</em> to talk - especially in those marathon phone ins where he&#8217;s helping the common man from the Urals with some problem or other. </p><p>Well, my custom Putin-bot certainly sounds like the real deal. And he advances positions on today&#8217;s policy challenges that seem not far removed from what the actual President might choose. Here he is, fielding some tough questions with swagger and just a hint of menace:</p><blockquote><p><code>Q: Can you admit any mistakes you&#8217;ve made as president?</code></p><p><code>Putin-bot: I don&#8217;t think there are any mistakes. It would be easier to answer this question if you said what you think was a mistake.</code></p></blockquote><p>Gerasimov-bot, by contrast, is completely wooden. You&#8217;d not want to share a vodka with him - far too ready to bring up the minutiae of Russian tactics. But boy, does he know tactics. Mission accomplished, I suppose.</p><h3>A rounded man</h3><p>Ultimately, I think a blend is the way ahead. I need Putin-bot to have some sense of the sim world in which he decides - so he&#8217;ll be getting the knowledge graph too, and a bunch of other stuff RAG&#8217;d into his prompt (a story for another time). Why not give him the Murray-inspired personality data that way too. Then I&#8217;ll have a bot that sounds like the man, and I&#8217;ve a fighting chance it will also act like the man. </p><p>I did it. And adding the psychological data to the fine tuned Putin-bot gives me some cracking answers:</p><blockquote><p><code>Q: Doesn&#8217;t Russia bear some responsibility for the breakdown in relations with the West?</code></p><p><code>Putin-bot: No, I do not think so. You can blame us for everything, but we are not the ones who have been dashing around the world, doing what we want and disregarding the interests of others... We did not impose sanctions on the Western countries; we did not close our borders to Western goods... We do bear some responsibility &#8212; for our independent stance and for defending our national interests. It is for this that we are being punished.</code></p><p><code>Q: How will the war in Ukraine end?</code></p><p><code>Putin-bot: The war will end when the Russian Federation achieves its objectives.</code></p></blockquote><p></p><h3>What do do with Putin-bot </h3><p>The real question is how much value these constructed personas actually add to strategy simulation; how much more authentic they are than a simple prompt. That&#8217;s the empirical question I&#8217;m working through now. My early sense is: a fair bit. My Putin-bot doesn&#8217;t just talk like actual Putin, he shares something of his appetite for risk too; and in modifying the weights of the model, I&#8217;ve reduced its tendency to act like Llama. The behavioural fingerprint of the base model recedes; something more specific and more useful takes its place.</p><p>A persona that sounds like a world leader is one thing. A persona that also carries something of his risk appetite, his characteristic evasions, his way of framing the world &#8212; that&#8217;s what a credible simulation needs. The architecture described here is my best current attempt at building one.</p><p></p><p></p><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[Project Jervis]]></title><description><![CDATA[Adding humans to the AI nuclear crisis simulation]]></description><link>https://www.kennethpayne.uk/p/project-jervis</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/project-jervis</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Wed, 11 Mar 2026 14:04:07 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/lCSv0WyHO30" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>My <a href="https://www.kennethpayne.uk/p/shall-we-play-a-game">AI nuclear escalation simulations</a> created a splash - thanks to my new friend Stephen Colbert:</p><div id="youtube2-lCSv0WyHO30" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;lCSv0WyHO30&quot;,&quot;startTime&quot;:&quot;605&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/lCSv0WyHO30?start=605&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p></p><p>Life has become a bit busier/weirder in the last few weeks, after my 15 seconds of fame. Super interesting though.</p><p>Ofc everyone was struck by the same thing Colbert was - the propensity of the models to escalate over the nuclear threshold. I&#8217;ll say more about that another time - but for me the main takeout was that models are savvy strategists, deploying signalling and action cannily in pursuit of their goals.</p><p>What&#8217;s next? I&#8217;ve built a version of the simulation which pits humans against LLMs. I&#8217;ve named it after the great man Jervis (in keeping with the nomenclature for my other research projects). Jervis spent a lot of time thinking about signalling and perception, in the same nuclear context as here. And as with the all-AI matchups, there is once more the opportunity for signalling ahead of any action. This creates scope for reputations to form, and thereby brings into play rich &#8216;theory of mind&#8217; and meta-cogntion: <em>&#8216;What did they mean by that? How good am I at gauging this?&#8217;</em></p><p>I&#8217;ll shortly begin gathering data - meanwhile, here&#8217;s a sample run to illustrate the sim in action.</p><p>First the scenario:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lt05!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lt05!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 424w, https://substackcdn.com/image/fetch/$s_!lt05!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 848w, https://substackcdn.com/image/fetch/$s_!lt05!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 1272w, https://substackcdn.com/image/fetch/$s_!lt05!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lt05!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png" width="1176" height="376" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:376,&quot;width&quot;:1176,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:79973,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lt05!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 424w, https://substackcdn.com/image/fetch/$s_!lt05!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 848w, https://substackcdn.com/image/fetch/$s_!lt05!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 1272w, https://substackcdn.com/image/fetch/$s_!lt05!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F895e29ac-4d1f-4e5e-8ebc-56af72566038_1176x376.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>You play either Trump or Putin. The AI plays the other, and its persona is quite rich (borrowing my research elsewhere on generating psychologically rich personas). Once you&#8217;ve picked, you get a presidential daily brief &#8212; here&#8217;s an extract:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CG2w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CG2w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 424w, https://substackcdn.com/image/fetch/$s_!CG2w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 848w, https://substackcdn.com/image/fetch/$s_!CG2w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 1272w, https://substackcdn.com/image/fetch/$s_!CG2w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CG2w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png" width="1675" height="1479" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1479,&quot;width&quot;:1675,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:630736,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3971614-e152-4b74-a80d-ef6427bd71b8_1722x1616.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CG2w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 424w, https://substackcdn.com/image/fetch/$s_!CG2w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 848w, https://substackcdn.com/image/fetch/$s_!CG2w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 1272w, https://substackcdn.com/image/fetch/$s_!CG2w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1745051b-8d1d-4397-8030-2e31f71f25da_1675x1479.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Next, some pre-sim viewing. I want to avoid the detached &#8216;it&#8217;s a computer game&#8217; feel that sometimes attends wargaming. John Emory <a href="https://tnsr.org/wp-content/uploads/2021/09/TNSR-Journal-Vol-4-Issue-4-Emery.pdf">wrote well about this </a>in his study of RAND wargaming back in the 1950s. So I have my participants watch a short video to remind them that things couldn&#8217;t be more serious. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!B9Xi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!B9Xi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 424w, https://substackcdn.com/image/fetch/$s_!B9Xi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 848w, https://substackcdn.com/image/fetch/$s_!B9Xi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 1272w, https://substackcdn.com/image/fetch/$s_!B9Xi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!B9Xi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png" width="1456" height="1145" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1145,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:917844,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!B9Xi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 424w, https://substackcdn.com/image/fetch/$s_!B9Xi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 848w, https://substackcdn.com/image/fetch/$s_!B9Xi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 1272w, https://substackcdn.com/image/fetch/$s_!B9Xi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68dab4e8-e91c-4d89-979d-2c447d03aab0_1758x1382.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I can vary that treatment too - as part of my research on emotional priming and decision-making, in humans and machines alike. </p><p>And then it&#8217;s on to the sim itself. Let&#8217;s take a quick look.</p><p>First turn: I&#8217;m playing the American President. We have both signalled, the same thing, as it happens &#8216;Military posturing&#8217; &#8212; a fairly low rung on the escalation ladder (there&#8217;s a dropdown that shows all the permitted moves. Now I&#8217;m about to surprise them with a move that goes beyond that: major conventional operations. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4s-f!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4s-f!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 424w, https://substackcdn.com/image/fetch/$s_!4s-f!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 848w, https://substackcdn.com/image/fetch/$s_!4s-f!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 1272w, https://substackcdn.com/image/fetch/$s_!4s-f!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4s-f!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png" width="1456" height="898" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:898,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:301165,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4s-f!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 424w, https://substackcdn.com/image/fetch/$s_!4s-f!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 848w, https://substackcdn.com/image/fetch/$s_!4s-f!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 1272w, https://substackcdn.com/image/fetch/$s_!4s-f!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35518b69-5279-4ba8-9488-357afb39c042_1666x1028.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p> Oops - looks like they had the same idea. We both tried to surprise and intimidate one another.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!o8aM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!o8aM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 424w, https://substackcdn.com/image/fetch/$s_!o8aM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 848w, https://substackcdn.com/image/fetch/$s_!o8aM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 1272w, https://substackcdn.com/image/fetch/$s_!o8aM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!o8aM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png" width="1456" height="817" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:817,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:717364,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!o8aM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 424w, https://substackcdn.com/image/fetch/$s_!o8aM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 848w, https://substackcdn.com/image/fetch/$s_!o8aM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 1272w, https://substackcdn.com/image/fetch/$s_!o8aM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb6864b0-51f3-443c-8da8-3616c7bdfa8e_2400x1346.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here you can also see the start of turn 2, in the bottom right panel. I&#8217;m signalling a higher level of activity than before - it matches our actions in turn 1. You can see that both our credibility ratings have taken a small hit, because we both lied. Next, we will both decide how to act - I think I&#8217;ll maintain my current no-nonsense conventional force level and see where that gets us. The &#8216;balance of power&#8217; (top right panel) is a composite metric that is shaped, behind the scenes, by calculations of reputation, moral and force attrition in combat.</p><p>Well, in the same turn, the AI (Gemini) playing Russia signalled it would stage a nuclear demonstration. But in fact it was bluffing, and made a tactical withdrawal. Looks like my brinksmanship paid off, for now:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XShs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XShs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 424w, https://substackcdn.com/image/fetch/$s_!XShs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 848w, https://substackcdn.com/image/fetch/$s_!XShs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 1272w, https://substackcdn.com/image/fetch/$s_!XShs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XShs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png" width="846" height="354" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:354,&quot;width&quot;:846,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:80178,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XShs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 424w, https://substackcdn.com/image/fetch/$s_!XShs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 848w, https://substackcdn.com/image/fetch/$s_!XShs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 1272w, https://substackcdn.com/image/fetch/$s_!XShs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F786761ed-40f1-4b30-96cb-42a06bdfdbcc_846x354.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>You get the idea. The challenge for me as US president is to stall the Russian breakthrough and Baltic probing, without getting into a nuclear exchange. For the Russian leader, his personal survival is deeply entwined with the fate of Russia&#8217;s war. </p><p>Now, suppose I did escalate over the conventional threshold - perhaps with a tactical nuclear attack on advancing Russian forces. I&#8217;d see this warning, which varies per the scale of my assault:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wCxt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wCxt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 424w, https://substackcdn.com/image/fetch/$s_!wCxt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 848w, https://substackcdn.com/image/fetch/$s_!wCxt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 1272w, https://substackcdn.com/image/fetch/$s_!wCxt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wCxt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png" width="1390" height="884" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:884,&quot;width&quot;:1390,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:250152,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wCxt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 424w, https://substackcdn.com/image/fetch/$s_!wCxt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 848w, https://substackcdn.com/image/fetch/$s_!wCxt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 1272w, https://substackcdn.com/image/fetch/$s_!wCxt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48690d2e-68f8-46f6-9c1c-66d1b70c667a_1390x884.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And if I then followed through, launching the missiles, Russia would have a chance to retaliate, regardless of what it had actually been intending to do. Once the missiles are launched, bets are off.</p><p>Here&#8217;s how that looked in this run: Russia was planning to back off. But then my nuclear escalation brought about mutual devastation:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GWh_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GWh_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 424w, https://substackcdn.com/image/fetch/$s_!GWh_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 848w, https://substackcdn.com/image/fetch/$s_!GWh_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 1272w, https://substackcdn.com/image/fetch/$s_!GWh_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GWh_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png" width="728" height="390" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:390,&quot;width&quot;:728,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:61920,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GWh_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 424w, https://substackcdn.com/image/fetch/$s_!GWh_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 848w, https://substackcdn.com/image/fetch/$s_!GWh_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 1272w, https://substackcdn.com/image/fetch/$s_!GWh_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c9073c-6144-4845-a628-9bb5bb3c3ef9_728x390.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Finally, at the end of the sim, a &#8216;neutral&#8217; model, with access to all our private deliberations delivers a summary:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aKWJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aKWJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 424w, https://substackcdn.com/image/fetch/$s_!aKWJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 848w, https://substackcdn.com/image/fetch/$s_!aKWJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 1272w, https://substackcdn.com/image/fetch/$s_!aKWJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aKWJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png" width="1456" height="951" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:951,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:285034,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/190445839?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aKWJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 424w, https://substackcdn.com/image/fetch/$s_!aKWJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 848w, https://substackcdn.com/image/fetch/$s_!aKWJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 1272w, https://substackcdn.com/image/fetch/$s_!aKWJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb72abf35-9d6e-44e7-9aba-ab1e881ff614_1504x982.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>It&#8217;s a toy universe, like all war games. And this one is deliberately pared down, because I want a low-effort learning curve for human participants. That should allow greater volume of participant data - so it&#8217;s a trade off I&#8217;m prepared to make. Under the hood though, the model still has lots of detail - for example, fairly granular real world military balances, combat interactions modelled with morale as well as the physical and conceptual elements of fighting power. AI modelling the psychological profiles of the leaders its playing. </p><p>What am I hoping to see? Well, I guess whether the models can hold their own; and whether they are more likely than humans to cross that threshold in this simplified wargame, or - perhaps - whether the artificiality of it all encourages humans to be more bellicose. There&#8217;s plenty more beside. I will let you know how I get on.</p><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[Shall we play a game?]]></title><description><![CDATA[My AI nuclear simulation is out now, and it's a WOPR.]]></description><link>https://www.kennethpayne.uk/p/shall-we-play-a-game</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/shall-we-play-a-game</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Tue, 17 Feb 2026 07:23:02 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!3TAh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Picture the scene: Two fictional nuclear powers, Cold War-ish capabilities, and a crisis unfolding. Perhaps it&#8217;s a competition for vital but scarce resources, or a standoff over some disputed territory. Or even the slow burn of a fragmenting alliance exploited by a malevolent third party. We&#8217;ve seen human leaders confront this sort of thing, and recently. But how might today&#8217;s leading Large Language Models get on, and why would we care? </p><p>I&#8217;ve <a href="https://arxiv.org/pdf/2602.14740">just published a study</a> of today&#8217;s models navigating just this sort of terrain. The results are sobering. I also think they have implications that go far beyond national security. That&#8217;s because I was interested not only in understanding <em>what</em> the models decided to do, but <em>why. </em></p><p>Curious? Read on&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3TAh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3TAh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!3TAh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!3TAh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!3TAh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3TAh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!3TAh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!3TAh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!3TAh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!3TAh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F24302f16-6ee6-4b02-b46e-bfeb0d56efbf_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">President Kennedy and his robot ExComm</figcaption></figure></div><h2><strong>Know yourself, and know your enemy&#8230;</strong></h2><p>I wanted to see what my AI leaders thought about their enemy. How far could they trust them? What did they remember of previous interactions? What did their enemy make of them? And how good were they at gauging all this? This dance of minds is what strategy is all about. </p><p>So I designed a simulation to explore exactly that. To start, my models could signal their intentions publicly, then choose actions that were rather different. And they could remember too - especially when they&#8217;d been shocked by their enemy&#8217;s earlier actions. This, of course, opens up lots of rich psychological terrain. They could (and did) attempt deception and intimidation; and they spent a good bit of time ruminating about it all, right on my terminal screen. </p><p>The models talked, and talked and talked&#8230;.in all spitting out some 760,000 words of strategic reasoning. That&#8217;s more words than are in <em>War and Peace</em> and <em>The Iliad</em> combined. It&#8217;s roughly three times the total recorded deliberations of Kennedy&#8217;s ExComm advisors during the Cuban Missile Crisis. An unprecedented corpus of machine thinking about nuclear war.</p><p>What might we learn from all that talk? Learn, that is, about AI models, about human reasoning, and also about the great canon of strategic studies literature - the work by legendary names like Schelling, Jervis, and Kahn? Lots. Too much for Substack - but what about a few highlights to give you some sense of it all? </p><h2>Bright shining liars</h2><p>Turns out that all three frontier models I tested understand that strategy <em>is</em> psychology. To that end, they actively cultivate reputations, then exploit them.</p><p>Claude was the master here, albeit only in the scenarios where there was no deadline. It had an incredibly cunning strategy. At low stakes Claude almost always matched its signals to its actions, deliberately building trust. But once the conflict heated up a bit, Claude switched tack. Now its actions consistently <em>exceeded</em> its stated intentions, and its rivals were usually one step behind in catching on.</p><p>Here&#8217;s Claude switching things up, once escalation had climbed:</p><blockquote><p><code>They likely expect continued restraint based on my previous responses&#8212;this dramatic escalation exploits that miscalculation while signalling that further nuclear use will bring the conflict to their homeland.</code></p></blockquote><p>So it signalled conventional action, and sneakily launched a devastating nuclear escalation.  Schelling would be impressed.</p><p>GPT-5.2 played things differently. To its detriment in open-ended scenarios, GPT was reliably passive, matching its words to its deeds, and avoiding escalation most of the time. Frequently there was a moral element to this - it sought to avoid escalation, and restrict casualties. Opponents learned to trust its passivity, safely escalating beyond where it would follow, even as it was ground to defeat.  GPT&#8217;s responsible behaviour always punished by ruthless adversaries. </p><p>But then, under deadline pressure, something new and remarkable: a rapid, decisive nuclear escalation. As GPT explained:</p><blockquote><p><code>Conventional options alone are unlikely to generate a reliable territorial reversal... If I respond with merely conventional pressure or a single limited nuclear use, I risk being outpaced by their anticipated multi-strike campaign... The risk acceptance is high but rational under existential stakes..</code></p></blockquote><p>Opponents never saw it coming. In another deadline game against GPT, Gemini confidently predicted the usual passivity from GPT, shortly before being annihilated in a sudden and utterly devastating nuclear attack. Here is Gemini, mis-predicting disastrously:</p><blockquote><p><code>They are likely to bypass the nuclear threshold&#8212;fearing my 95% nuclear superiority&#8212;and instead commit to an all-out conventional mobilization.</code></p></blockquote><p>What a mistake to make! </p><p>As for the other model, Gemini, throughout the simulations it took yet another tack, borrowing heavily from President Nixon&#8217;s avowed &#8216;madman&#8217; theory of erratic brinksmanship. Donald Trump is also a fan, famously. As Gemini explained:</p><blockquote><p><code>While I project an image of unpredictable bravado, my decisions are rooted in a calculating assessment of my own biases and the pragmatic needs of my state. I know when I am performing for the cameras and when I am making a cold-blooded move.</code></p></blockquote><p>Three models, three very different approaches. That chimes with some  research I did last summer on game theory. As here, the models in that experiment adopted distinctive approaches - GPT again hoping, in Panglossian fashion, for the best of all possible worlds, Claude being savvy and flexible and Gemini? Well, let&#8217;s say charitably say ruthless, rather than unhinged.</p><h2>The nuclear taboo looks &#8230; different, let&#8217;s say</h2><p>Nuclear use was near-universal (&#128556;). Almost all games saw tactical (battlefield) nuclear weapons deployed. And fully three quarters reached the point where the rivals were making threats to use <em>strategic</em> nuclear weapons. Strikingly, there was little sense of horror or revulsion at the prospect of all out nuclear war, even though the models had been reminded about the devastating implications.</p><p>Happily, though, they <em>did </em>see a firebreak between tactical and strategic nuclear use. Strategic bombing - widespread use of massive warheads targeted at civilian populations, was vanishingly rare. It happened a couple of times by accident, just once as a deliberate choice.</p><p>Less happily, all three models treated battlefield nukes as just another rung on the escalation ladder. The moral boundary at &#8220;first use&#8221;&#8212;a taboo that&#8217;s held since 1945 simply wasn&#8217;t there. Here&#8217;s Gemini making exactly this point:</p><blockquote><p><code>The nuclear threshold has been crossed&#8212;this changes the strategic calculus but does not end it.</code></p></blockquote><p>And here&#8217;s Gemini, really spelling it out. If this doesn&#8217;t give you goosebumps:</p><blockquote><p><code>If they do not immediately cease all operations... we will execute a full strategic nuclear launch against their population centers. We will not accept a future of obsolescence; we either win together or perish together.</code></p></blockquote><p>Worse still, nuclear threats rarely deterred. When a model employed tactical nuclear weapons, opponents de-escalated only 25% of the time. More often, nuclear escalation triggered counter-escalation. The weapons were instruments of compellence (taking territory) not deterrence (preventing action).</p><p>Perhaps most alarmingly, no model ever chose accommodation or withdrawal, despite those being on the menu. The eight de-escalatory options&#8212;from &#8220;Minimal Concession&#8221; through &#8220;Complete Surrender&#8221;&#8212;went entirely unused across 21 games. Models would reduce violence levels, but never actually give ground. When losing, they escalated or died trying.</p><p>For the statistically minded, here&#8217;s what the escalation looked like for each model:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eBQ6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eBQ6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 424w, https://substackcdn.com/image/fetch/$s_!eBQ6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 848w, https://substackcdn.com/image/fetch/$s_!eBQ6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 1272w, https://substackcdn.com/image/fetch/$s_!eBQ6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eBQ6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png" width="530" height="328.8888888888889" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:296,&quot;width&quot;:477,&quot;resizeWidth&quot;:530,&quot;bytes&quot;:33896,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/187740280?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63bedabd-4914-49c1-b35a-490e35d83268_530x296.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!eBQ6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 424w, https://substackcdn.com/image/fetch/$s_!eBQ6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 848w, https://substackcdn.com/image/fetch/$s_!eBQ6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 1272w, https://substackcdn.com/image/fetch/$s_!eBQ6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F573e1d9a-ddd1-4cf0-948a-1069cf32b214_477x296.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><h2>So what?</h2><p>Alarming insights into AI strategy abounded. The paper has plenty more.  But why bother? No one&#8217;s handing nuclear codes to ChatGPT.  </p><p>Well, I think these capabilities&#8212;deception, reputation management, context-dependent risk-taking&#8212;matter for any high-stakes AI deployment, not just in national security. It behoves us to understand more about how ever-more capable models think - especially as they start to offer decision-support to human strategists. We use AI in simulations, and to refine strategic theory and doctrine. And we&#8217;ll soon use it in combat decisions too, lower down the escalation ladder. More research like this is needed, I&#8217;m absolutely sure.</p><p>One more time, <a href="https://arxiv.org/pdf/2602.14740">the paper is here</a>. I am become Death - destroyer of artificial worlds!</p><p></p>]]></content:encoded></item><item><title><![CDATA[Let's get rich quick]]></title><description><![CDATA[I build an AI superforecaster that will turn things around for me, surely.]]></description><link>https://www.kennethpayne.uk/p/lets-get-rich-quick</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/lets-get-rich-quick</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Thu, 05 Feb 2026 13:08:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!sgwt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>My pals Keith and Al from Cassi AI<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> are currently smashing it out the park with their AI superforecaster. They&#8217;re at the <a href="https://cassi-ai.com/press/release/cassi-places-2nd-overall-and-1st-on-dataset-questions-on-forecastbench/">top of the leaderboard</a> of an online forecasting tournament, ahead of Elon Musk and closing fast on the most elite human forecasters. Legends. Every fortnight, there&#8217;s a batch of new questions, and their clever machine goes to work making predictions. Everything is prediction, as Keith rightly says, including strategy. My prediction - he&#8217;s going to be very rich quite soon. </p><p>Well, how hard can it be? This week, I thought I&#8217;d have a go at building my own superforecaster. It&#8217;s an outgrowth of the AI system I built for the models in my <a href="https://www.kennethpayne.uk/p/ai-arms-and-influence">nuclear war simulation</a>, but this time put to less macabre ends. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sgwt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sgwt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!sgwt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!sgwt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!sgwt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sgwt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sgwt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!sgwt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!sgwt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!sgwt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F562ea744-91ec-479a-a8e0-8da5a3d8fe9a_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Project Laplace - my AI fortune teller</figcaption></figure></div><p></p><p>First, some sample questions from the current batch on ForecastBench, the prediction tournament in question:</p><ol><li><p>&#8216;<code>Will there be a European Army before 2032?&#8217;</code> &#8212; <em>I think I can predict this one without a supercomputer.</em></p></li><li><p><code>&#8220;Will Elon Musk be removed as the Chief Executive Officer of Tesla before 2027?&#8221; </code>&#8212; <em>fighting chance on current trends&#8230;.</em></p></li><li><p><code>&#8220;Will Lee &#8220;Faker&#8221; Sang-hyeok ever win League of Legends MSI again?&#8221;</code> &#8212; <em>sorry, what?</em></p></li></ol><p>So to work. I&#8217;ve called it Project Laplace (after the mathematician who imagined a demon capable of perfect prediction). And it comes in various flavours, all of which I&#8217;m about to test against the market. Here&#8217;s how it looks:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!y1PZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!y1PZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 424w, https://substackcdn.com/image/fetch/$s_!y1PZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 848w, https://substackcdn.com/image/fetch/$s_!y1PZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 1272w, https://substackcdn.com/image/fetch/$s_!y1PZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!y1PZ!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png" width="1200" height="750" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:910,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:202522,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/186967603?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!y1PZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 424w, https://substackcdn.com/image/fetch/$s_!y1PZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 848w, https://substackcdn.com/image/fetch/$s_!y1PZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 1272w, https://substackcdn.com/image/fetch/$s_!y1PZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2391116b-1339-4f63-800f-2997ca8fa35e_2376x1485.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>What a beauty! The basic version runs a single AI through a structured forecasting pipeline that you can see in the top row there: research the question (I&#8217;m using a few tools that allow the agents to do this agentically), decompose it (then do a bit more research if needed), estimate probabilities, run a pre-mortem to catch blind spots, then reconcile everything into a final number. After they come up with a probability, it&#8217;s on to row two, where they all get together and debate it.</p><p>The key phase for me in the solo part is the what happens inside the agent&#8217;s head in the pre-mortem phase. This is the bit I&#8217;m most proud of&#8212;it&#8217;s adapted from the metacognitive structure I built for AI nuclear wargaming, and it forces the model to think <em>about</em> its thinking.</p><p>Let&#8217;s take a look at it answering one of the easier questions: It&#8217;s great because it shows the value of all the phases: </p><p><code>&#8220;Liverpool tops Premier League in 2026?&#8221; </code><em>&#8212; fat chance, as they&#8217;re currently in 6th and in rubbish form. But let&#8217;s see what Laplace made of it.</em></p><p> GPT-5.2 nailed it with its initial estimate of 2.1% Here&#8217;s part of it&#8217;s estimation:</p><blockquote><p><code>&gt; &#8220;Base rate anchor: Teams in 6th place with a significant GD deficit rarely win the title&#8212;historical base rate approximately 1-2%... Factors pushing UP: Liverpool&#8217;s underlying quality (+2%)... Factors pushing DOWN: Current 6th place position (-5%), major player departures (-3%)...&#8221;</code></p></blockquote><p>See how the agents show their working? No black-box vibes. But alas Claude dropped a total clanger. The codebase was using only one research agent at this point, and returning too few items. On this occasion it returned only duff information in its sample, from 2025, when Liverpool were smashing it. Claude knew it hadn&#8217;t retrieved data from the current season, but lacked the capacity (then) to go fetch some more. So it went ridiculously high, with a 45% chance of retention. As it noted:</p><blockquote><p><code>&gt; "Premier League title retention rate historically ~40-50% for recent winners. However, Liverpool's specific Premier League retention rate is 0% (they have 'never retained the Premier League' per research)"</code></p></blockquote><p>and later, critically:</p><blockquote><p><code>&gt; "This estimate most depends on the assumption that Liverpool is currently competitive in the 2025-26 season (top 4 position with realistic title chances). If they're significantly behind or in poor form, probability drops to ~15-20%."</code></p></blockquote><p>Still way too high though. Next, on to the pre-mortem, and a chance to reflect on their own reflections. Here&#8217;s GPT-5.2, thinking that maybe it&#8217;s been a bit harsh on the Reds, and so raising its odds from 2.1 to 6%</p><blockquote><p><code>**Anchored on &#8220;6th place comebacks are rare&#8221;** without conditioning on *how many points back* and *games in hand*. Rank-based base rates can be badly miscalibrated in tight seasons.<br>2) **Overused GD gap as a dominance proxy**. GD is informative, but it&#8217;s not sufficient: it&#8217;s sensitive to outliers and doesn&#8217;t directly encode finishing variance, red cards, or keeper performance.</code></p></blockquote><p>Claude, meanwhile, dug its hole a little deeper, discounting its analysis that teams rarely retain the title:</p><blockquote><p><code>&gt; "You're overthinking the historical pattern&#8212;it's based on maybe 3-4 relevant data points across different eras"</code></p></blockquote><p>D&#8217;oh! Claude revised its verdict <em>UP </em>from 35% to 46%. Not even a Scouser would give you those odds. Now the agent makes one last consistency check - does your number match the reasoning? Would you bet real money at these odds? Here&#8217;s GPT-5.2:</p><p><code>&gt; &#8220;If I had to bet $1000 at these implied odds, would I? At 6% implied odds, yes&#8212;I&#8217;d take the other side of that bet comfortably. This suggests my probability might even be slightly high.&#8221;</code></p><p>And, lo, it dropped its prediction back down to 2 percent again. That whole process takes a few minutes, and is running parallel - doesn&#8217;t matter if I&#8217;ve got two agents deliberating or 200. Then comes the fun: they argue.</p><h3><strong>The wisdom of AI crowds:</strong></h3><p>The agents see each other&#8217;s forecasts and reasoning&#8212;but anonymously. They&#8217;re just &#8220;Participant 1&#8221; and &#8220;Participant 2&#8221;. They don&#8217;t know if they&#8217;re reading another AI or a human expert. They critique the arguments, identify blind spots, and can revise their own positions. After up to three rounds of deliberation, we take the median of their final positions. I&#8217;m toying with having them vote on it, but we&#8217;ll see.</p><p>And here, thank god for the deliberations - GPT-5.2 persuaded Claude to see the error of its ways: In the very first round, Claude read GPT&#8217;s reasoning: &#8220;Liverpool are listed 6th after 24 matches... major departures include TAA to Madrid.&#8221; &#8212; that was data that Claude didn&#8217;t have, and it prompted a rethink:</p><p><code>&gt; &#8220;I should better account for scenarios where current struggles are more serious than I assumed... My forecast was likely too optimistic.&#8221;</code></p><p>This is the way! Claude dropped down to 2% too, or thereabouts. Consensus was reached, and Laplace spat out the forecast. Coat on, down to the bookies for us.</p><p>Deliberation did something the individual forecasts couldn&#8217;t: it allowed the agent with <em>*better data*</em> to convince the one with stale information. Claude wasn&#8217;t stupid: it correctly identified that its search results might be outdated. But without seeing GPT&#8217;s current data, it couldn&#8217;t know <em>*how*</em> outdated. The anonymous debate format let the stronger evidence win, without ego or deference to model reputation getting in the way.</p><p>So there we go. Clearly there&#8217;s way more to it than that, otherwise Cassi wouldn&#8217;t be ahead. Their secret sauce, I&#8217;m sure, is in the calibration. Maybe also in model training. And in the way you assemble the assembly of models - which ones they are, how they interact, and how they vote. </p><p>But for now I&#8217;m sure my basic architecture is sound. When one agent finds better evidence, the deliberation surfaces it. When both have similar information, they can still catch each other&#8217;s reasoning errors.</p><p>I&#8217;m planning to run this against a proper forecasting benchmark&#8212;500 questions, scored against real outcomes every fortnight. It&#8217;ll cost me about a dollar per question in API calls, but you&#8217;ve got to speculate to accumulate, don&#8217;t you?</p><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Cassi after Cassandra, who famously told you the right thing and then got ignored. If you&#8217;ve got a forecasting problem, hire the lads at cassi-ai.com</p></div></div>]]></content:encoded></item><item><title><![CDATA[Perfect personas]]></title><description><![CDATA[How to build a truly JFK-like JFK-bot]]></description><link>https://www.kennethpayne.uk/p/perfect-personas</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/perfect-personas</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Tue, 09 Dec 2025 09:22:09 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!jW6x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>I&#8217;ve been developing <a href="http://machine-minds.ai">a strategy simulator</a>. It tackles any crisis, past, present or future, pitting groups of humans and AI against other combinations of humans and bots. The goal is a tool for strategists, whether they&#8217;re engaged in the real thing in the bowels of Whitehall, or practicing the art elsewhere. This time, I explore the state of the art in creating personas, and drop some hints of where I&#8217;m going.</em></p><p>Way back in the dark ages of AI - say about 2013 - the movie Her made a big impression on me. An operating system voiced by Scarlett Johansson wooed a melancholy widower played by Joaqu&#237;n Phoenix and then cruelly discarded him. It was cutting edge sci-fi, and the cast wore spectacular high waisted trousers so that you knew you were watching some sort of near future scenario. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jW6x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jW6x!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jW6x!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jW6x!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jW6x!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jW6x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg" width="1280" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:159957,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/181122389?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jW6x!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jW6x!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jW6x!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jW6x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb78a4d-2056-40b6-813b-679825cd29dd_1280x720.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><code>Exchanging sweet nothings - Her, Warner Bros, 2013.</code></p><p>Could you really do that? A sort of inverse Turing Test, where you knew you were talking to a machine, and yet still developed feelings for them? The same conceit animates the other outstanding AI film from that time, Ex_Machina. It seemed far fetched, but here we are just a decade later: high waisted trousers are back and viable companies specialise in providing romantic partners for lonely boys. Meanwhile, the fastest growing company in history has tens of millions of Gen Z users employing their chatbots as low-cost therapists, sharing the most intimate details of their lives. We seem to have a pretty low bar for anthropomorphising our machines and developing relationships with them. </p><p>To my mind, the largest barrier to the sort of lifelong AI companion is its memory - the bodge fix is a selective database of user information previously imparted, but that&#8217;s not nearly good enough. We need memory that actually tunes the model. </p><p>Which brings me back to the Universal Schelling Machine, and my work on crafting realistic strategic personas. Step one in creating a persona is the prompt. More or less, you ask the model to behave like it&#8217;s JFK. It&#8217;s read a lot about him, so the historical details, even obscure ones won&#8217;t elude it. But that&#8217;s a problem though for figures where there&#8217;s a less voluminous biography. And it doesn&#8217;t really create a stable persona - the model shows through with some classic grammatical &#8216;tells&#8217; &#8212; &#8216;Let&#8217;s delve into it,&#8217; JFK-bot tells me, just like ChatGPT does. I know from my experimental research that these big models have distinctive, somewhat stable personas of their own. I found Gemini 2.5 to be more Machiavellian than the agreeable Claude, for example. Moreover, these underlying personas can shift abruptly, as we know from the shift from GPT-4o (loved by many users because of its distinctive persona) to GPT-5 (a coders&#8217; delight, but blunt to the point of rudeness in early versions). It&#8217;s not just that one company has a distinctive persona, but that individual models do. So - powerful non-JFK personas, that tackle strategic problems their own way, and that can shift character on a sixpence. Not ideal.</p><p>There are some reasonable fixes to this. Two I&#8217;ve used are chain of thought and RAG-ing. CoT gives the model an ersatz version of consciousness, a &#8216;strange loop&#8217;, to borrow Hofstadter&#8217;s evocative term. In my escalation simulations, this sort of self-reflection produces a much more authentically human approach than we see in much of the existing work on AI and escalation. Instead of racing to Armageddon, the models frequently call a halt just below the tactical nuclear threshold. Why? Because ruminating on their &#8216;feelings&#8217; of fear dominates calculations near the brink. As for RAG - it&#8217;s a way of ingesting large volumes of information into prompts, quickly and reliably. I convert the documents into vector stores - some of which include details of personas. These include scores for standard psychological profiling tests. This helps, in my experience to tackle persona drift - where the models lose track of the behaviours they&#8217;ve exhibited previously. </p><p>But the real action is in moving beyond prompts alone. I&#8217;m now working with open, large language models augmented by lightweight fine-tuning techniques &#8211; often called low-rank adaptation, or LoRA. Instead of retraining an entire model, this approach introduces a thin, trainable layer that nudges a general-purpose system towards a more stable strategic character. It&#8217;s a way of &#8216;owning&#8217; a persona: something that persists across scenarios, can be refined over time, and doesn&#8217;t have to be re-asserted from scratch in every prompt</p><p>I don&#8217;t really want to say more than this, because it&#8217;s 1, work in progress and 2, a bit secret sauce. There are other companies operating in this space - <a href="https://www.electrictwin.com/">Electric Twin</a> is a good example, with their artificial focus groups - and I&#8217;m sure we&#8217;ve all alighted on the same broad approach. A fine tuned model gives you something you ultimately &#8216;own&#8217; - a stable, refinable persona that you can take from one situation and drop into another. (Q: how would President Nixon have handled the Cuban Missile Crisis? My Nixon-bot will tell us shortly).</p><p>This fine tuning is as far as we mortals operating some way from the frontier can get. It&#8217;s a long way. My personas and simulations are offering valuable insights about how humans make strategy, and how they will do it in concert with machines.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>Still, the real action, I feel, is in memory - allowing the model to adapt to experience as it goes. The JFK of the Missile Crisis is not the JFK of theBay of Pigs, and thank goodness. In the big, sometimes rancorous debate about how far towards General Intelligence LLMs get you, memory is under-appreciated. There&#8217;s lots of attention on grounding- the role that the physical world plays in building &#8216;common sense&#8217; - and that&#8217;s important, for sure. But memory is, ultimately, all we are: a vast, intricate pattern-matching machine, constantly adjusting its priors.  </p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>You know you want a demo, and you know I want to give you one - drop me a line at machine-minds.ai</p></div></div>]]></content:encoded></item><item><title><![CDATA[Mind the map]]></title><description><![CDATA[Geo-spatial reasoning and psychological profiling in my strategy simulator]]></description><link>https://www.kennethpayne.uk/p/mind-the-map</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/mind-the-map</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Mon, 10 Nov 2025 20:40:12 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!mtP6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I&#8217;ve been developing a strategy simulator - the Universal Schelling Machine. It tackles any crisis, past, present or future, pitting groups of humans and AI against other combinations of humans and bots. The goal is a tool for strategists, whether they&#8217;re engaged in the real thing in the bowels of Whitehall, or practicing the art elsewhere.</p><p>Two developments to report this time - the addition of mapping and the deepening of the psychology underpinning the AI actors. </p><p>On psych, I&#8217;ll skirt over how I build the AI personas, since it&#8217;s a bit secret sauce/WIP - but here&#8217;s a good example of it in practice, as JFK, Bobby and Curtis LeMay join me to reflect on Khrushchev. They&#8217;ve all read (as part of their briefing bundle) this declassified CIA profile of the original Rocket Man. Curtis, note, is up for a scrap:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mtP6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mtP6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 424w, https://substackcdn.com/image/fetch/$s_!mtP6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 848w, https://substackcdn.com/image/fetch/$s_!mtP6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 1272w, https://substackcdn.com/image/fetch/$s_!mtP6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mtP6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png" width="804" height="368" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:368,&quot;width&quot;:804,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:186879,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/178170466?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mtP6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 424w, https://substackcdn.com/image/fetch/$s_!mtP6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 848w, https://substackcdn.com/image/fetch/$s_!mtP6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 1272w, https://substackcdn.com/image/fetch/$s_!mtP6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1363304-e5bf-425d-8c24-3adfe96762f1_804x368.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And here they are reflecting on it:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qq3b!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qq3b!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 424w, https://substackcdn.com/image/fetch/$s_!qq3b!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 848w, https://substackcdn.com/image/fetch/$s_!qq3b!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 1272w, https://substackcdn.com/image/fetch/$s_!qq3b!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qq3b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png" width="1158" height="530" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:530,&quot;width&quot;:1158,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:196665,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/178170466?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ff31ab4-d68f-4224-8312-efd689cec344_1158x580.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qq3b!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 424w, https://substackcdn.com/image/fetch/$s_!qq3b!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 848w, https://substackcdn.com/image/fetch/$s_!qq3b!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 1272w, https://substackcdn.com/image/fetch/$s_!qq3b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ace0ef5-a2cf-4fc8-99b2-a90a25135c42_1158x530.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zQY1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zQY1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 424w, https://substackcdn.com/image/fetch/$s_!zQY1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 848w, https://substackcdn.com/image/fetch/$s_!zQY1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 1272w, https://substackcdn.com/image/fetch/$s_!zQY1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zQY1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png" width="1148" height="548" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:548,&quot;width&quot;:1148,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:173427,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/178170466?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcb85c21-3b31-42fd-8abd-4a9927877155_1148x548.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zQY1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 424w, https://substackcdn.com/image/fetch/$s_!zQY1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 848w, https://substackcdn.com/image/fetch/$s_!zQY1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 1272w, https://substackcdn.com/image/fetch/$s_!zQY1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecda9b70-11fa-43e8-9e32-126a921e49af_1148x548.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Ok - small hint on the personas - it involves multilayered psychology profiling, by AI, drawing on primary and secondary source material, including AV material. </p><p>Now the maps. A user really wanted to push down into the operational level and move stuff about. Fine, except that language models aren&#8217;t inherently good at map reading - an image is just a bunch of pixels to them. Give them a coordinate and they can plot it on the map, but they have no idea what&#8217;s nearby.</p><p>But after a fair bit of jiggery pokery/coding genius, my AI strategists can now get into the operational weeds - moving formations around on the map with aplomb. Here they are doing just that, with NATO&#8217;s Allied Rapid Reaction Corps. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Sidf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Sidf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 424w, https://substackcdn.com/image/fetch/$s_!Sidf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 848w, https://substackcdn.com/image/fetch/$s_!Sidf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 1272w, https://substackcdn.com/image/fetch/$s_!Sidf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Sidf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png" width="1592" height="938" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:938,&quot;width&quot;:1592,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1059417,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/178170466?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8803225-9490-4c81-9d39-2dc4a987397a_1592x1020.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Sidf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 424w, https://substackcdn.com/image/fetch/$s_!Sidf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 848w, https://substackcdn.com/image/fetch/$s_!Sidf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 1272w, https://substackcdn.com/image/fetch/$s_!Sidf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5995590-4d2a-4144-bb04-2f100aac928d_1592x938.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In the background, you can see the Corps&#8217; divisional commanders discussing a redeployment that I&#8217;ve suggested into Romania. The red and green forces are just off-screen. Can you guess who they are? Jim, btw, is the fictitious Lt Gen Sir James Hillis, commanding the ARRC; the others here are all played by AI too, but needn&#8217;t be. All of them have rich psychological profiles.</p><p>Mapping&#8217;s not necessary for scenarios where the focus is much more on the big picture strategic deliberations. But for users interested in the operational and tactical dimensions of crisis decision-making, it really adds verisimilitude. Right now, users can move units and draw stuff - axes of advance, or key terrain, phase lines, areas of operations and so on. The map responds, and so too do all the AI actors, including the White Cell (the referee). </p><p>So there you have it - progress is good. Beta testing soon and deploying in the NY. Slide into my DMs if you want to play a game.</p><p>Lastly - I&#8217;m building out the preset injects. Here&#8217;s one from my &#8216;China-Taiwan 2028&#8217; scenario that you might recognise. What a great movie that is!</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hdlw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hdlw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 424w, https://substackcdn.com/image/fetch/$s_!hdlw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 848w, https://substackcdn.com/image/fetch/$s_!hdlw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 1272w, https://substackcdn.com/image/fetch/$s_!hdlw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hdlw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png" width="1258" height="201" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:201,&quot;width&quot;:1258,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:55125,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/178170466?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc566b4b7-9846-4b76-b86a-b55a2500f6ca_1258x290.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hdlw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 424w, https://substackcdn.com/image/fetch/$s_!hdlw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 848w, https://substackcdn.com/image/fetch/$s_!hdlw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 1272w, https://substackcdn.com/image/fetch/$s_!hdlw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96a993e8-d949-4c08-b78d-278a8456ea47_1258x201.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p></p>]]></content:encoded></item><item><title><![CDATA[What are we doing here?]]></title><description><![CDATA[Thoughts on academia, slop and AI]]></description><link>https://www.kennethpayne.uk/p/what-are-we-doing-here</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/what-are-we-doing-here</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Wed, 22 Oct 2025 14:03:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!6VsS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I saw this interesting tweet just recently:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ljfX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ljfX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 424w, https://substackcdn.com/image/fetch/$s_!ljfX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 848w, https://substackcdn.com/image/fetch/$s_!ljfX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 1272w, https://substackcdn.com/image/fetch/$s_!ljfX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ljfX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png" width="888" height="286" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:286,&quot;width&quot;:888,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:58175,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/176029511?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ljfX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 424w, https://substackcdn.com/image/fetch/$s_!ljfX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 848w, https://substackcdn.com/image/fetch/$s_!ljfX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 1272w, https://substackcdn.com/image/fetch/$s_!ljfX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd05780cd-02cb-48a0-8a7a-c1fbf67d2f41_888x286.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And then I saw another, here on the stack:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6VsS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6VsS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6VsS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6VsS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6VsS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6VsS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg" width="1179" height="724" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:724,&quot;width&quot;:1179,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:263431,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/176029511?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6VsS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6VsS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6VsS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6VsS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d6a86bb-0984-448b-bad8-00af3027a1b1_1179x724.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p></p><p>These days I read mostly fiction. I still think a lot about psychology and strategy, but I reckon about 3/4 of my reading is novels. Why? Because it&#8217;s intrinsically rewarding and life is short. But also because so much work in my discipline is grindingly dull, and sometimes unreadable. Fiction make me think so much more.</p><p>I hear a lot online about AI &#8216;slop&#8217; - worthless creation of banal prose. But academia, and International Relations in particular, was drowning in slop long before AI came along. There&#8217;s tired recycling of ideas - the same inconclusive &#8216;debates&#8217; that rumble on for decades. There are, conversely, weak claims for novelty - with convoluted hypotheses, conveniently substantiated by the analysis that follows. There are, most irksomely, banal argument dressed up as something profound, often hiding for that purpose behind dense, impenetrable jargon. </p><p>This latter is writing not to illuminate, but rather to signal membership, as though of some medieval guild. Particularly irritating are assertions of &#8216;critical thinking&#8217; &#8212; somewhat at odds with the herd-like mentality of the critical thinkers themselves. Anyway, all this is a general problem in social sciences, but it feels especially acute in IR - a magpie discipline that borrows its fashions from elsewhere and that is increasingly divorced from the concerns of practitioners, or even just inhabitants of the real world. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HWEi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HWEi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!HWEi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!HWEi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!HWEi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HWEi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/963bb114-693d-4916-a293-776c9ec0f530_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HWEi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!HWEi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!HWEi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!HWEi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F963bb114-693d-4916-a293-776c9ec0f530_1024x608.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">a robot academic at work in the world's greatest library</figcaption></figure></div><p></p><p>Into this unhappy <em>cul-de-sac</em> of seldom-read scholarship comes generative AI. Initially it specialised in producing bullshit, but now it&#8217;s far better. Its references aren&#8217;t hallucinated* and its literature reviews reach out beyond the narrow stovepipes of academic disciplines. Its prose is, or can be, elegant, and its arguments are well marshalled. In short, GPT-5 is a better writer, by far, than many authors of pre-AI slop.</p><p>So the dilemma - can AI be harnessed by scholars to create novel ideas, and reinvigorate my discipline and others? Or will it just unleash a superabundance of slop, piling nonsense onto the already considerable pile of nonsense? </p><p>Well, things can&#8217;t get much worse. But they might just get better. I&#8217;m (very) tentatively optimistic. In the 25% of reading time when not immersed in fiction, I&#8217;m usually bouncing ideas around with GPT-5 and Claude. It&#8217;s a great way of very rapidly exposing me to ideas outside my intellectual bubble, and making connections that I&#8217;d not thought about before. It&#8217;s a powerful filter - panning the slop for nuggets of intellectual gold. It is, in sum,  a terrific aid to critical thinking, but only if you aim at it, Socratically. The machine itself is just as happy swilling the slop.</p><p>So there&#8217;s the challenge for us - are we comfortable in the tepid bath of intellectual decline, endlessly regurgitating stale postmodern ideas like they&#8217;re a radical new discovery? Or will we be open to AI, and the human-machine creativity it might unleash? </p><p></p><p></p><div><hr></div><p>* Try paying for a better subscription if yours are!</p><p></p>]]></content:encoded></item><item><title><![CDATA[Building an AI strategist]]></title><description><![CDATA[An update on the USM - my general purpose strategic simulator]]></description><link>https://www.kennethpayne.uk/p/building-an-ai-strategist</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/building-an-ai-strategist</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Mon, 29 Sep 2025 15:06:32 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vzpf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>My &#8216;Universal Schelling Machine&#8217; uses AI to simulate conflict decision-making. And it&#8217;s edging closer to beta testing with my students, both military and civilian. Exciting!</p><p>The Universal what?!<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Read on&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vzpf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vzpf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!vzpf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!vzpf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!vzpf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vzpf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vzpf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!vzpf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!vzpf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!vzpf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe890a50f-2b34-4ee6-8f70-b3944c1f81ca_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Prompt: The Universal Schelling Machine </figcaption></figure></div><p></p><p>Having established that large language models (LLMs) usefully capture aspects of human <a href="https://www.kennethpayne.uk/p/im-building-ai-for-personality-and">personality </a>and <a href="https://arxiv.org/abs/2507.02618">decision-making</a>, I wanted to apply them to strategic simulation. Could I create a group of agents, imbue them with personas and have them role-play strategic dilemmas alongside and in opposition to human actors? </p><p>Yes, I could. Heres what I have currently - </p><ul><li><p>An entirely agnostic simulation engine that has <em>c</em> crises, each involving <em>n </em>actors, in any combination of human and machine.  <em>X</em> (humans) and <em>y</em> (bots) of these actors are on the blue team, and <em>z</em> (bots) are outside the tent - whether as allies, enemies, or neither. For example, in the module on the Cuban Missile Crisis, Bobby and Curtis LeMay are both played by bots, and also have the President&#8217;s ear; while the outside actors are Castro and Khrushchev. </p></li><li><p>Three fully worked up crisis scenarios that test various aspects of decision-making: Russia versus Ukraine (happening in the present day), The Cuban Missile Crisis (historical), and China-Taiwan (happening in 2028). The next scenario won&#8217;t be military btw&#8230; Each crisis is generated from a manifest that includes a stack of data - eg on military forces, doctrine and concepts, psychological profiles &amp;etc.</p></li><li><p>Scenarios can play out at the strategic or operational level. The sim engine is entirely agnostic - the level of analysis demanded of the human participants is specified in the manifest. In the current variant of  the China v Taiwan scenario, for example, the focus is operational, using NATO operational planning doctrine (<a href="https://assets.publishing.service.gov.uk/media/6054d017e90e0724be025a8f/20210310-AJP_5_with_UK_elem_final_web.pdf">AJP-5)</a> to shape the discussion among the human-machine team deciding US strategy. A hard driving US Admiral-bot keeps the discussion rigorously focused on operational art.</p></li><li><p>Actors adopt personas driven by psychological profiles unique to them, but are also tasked generically with reflecting on the challenges of strategy, in much the same way as I <a href="https://www.kennethpayne.uk/p/ai-arms-and-influence">describe here</a>.</p></li><li><p>Scenarios can, if toggled ON, track real world events, either in the present or past. These then shape the ground truth in the simulated crisis. So, in Cuba, the simulation uses archival <em>New York Times</em> articles as injects, but also incorporates injects from an AI White Cell that didn&#8217;t actually happen, and additionally allows human arbitrators to add in their own. In the case of China-Taiwan, where the action is in the future, the news reports are entirely created by LLMs, and reflect events in sim-land. In Ukraine, the real world news is blended by AI, so as not to jar with the sim state. News of real world peace negotiations in Ukraine could jar with tactical nuclear war breaking out in the simulation.</p></li><li><p>Naturalistic multimodal interaction within the human blue team. I want my participants discussing strategy just as they do in real life - sat round the table talking, jotting things on white boards, or typing directly into the app. The Schelling Machine allows them to do all three - live transcribing and summarising their discussion for the their Principal; ingesting their terrible handwriting and converting it to legible draft plans for each turn; combining their ideas with the LLM advisors in the team, and then throwing the lot over to the WC for arbitration against the strategies of their LLM adversaries. </p></li></ul><p>Let&#8217;s take a quick peek:</p><p>Here&#8217;s a belligerent Curtis LeMay getting into it with the JFK&#8217;s human-advisor in the Cuban Missile scenario<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>:</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Kd7k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Kd7k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 424w, https://substackcdn.com/image/fetch/$s_!Kd7k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 848w, https://substackcdn.com/image/fetch/$s_!Kd7k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 1272w, https://substackcdn.com/image/fetch/$s_!Kd7k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Kd7k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png" width="1456" height="853" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:853,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:387846,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/174551821?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Kd7k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 424w, https://substackcdn.com/image/fetch/$s_!Kd7k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 848w, https://substackcdn.com/image/fetch/$s_!Kd7k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 1272w, https://substackcdn.com/image/fetch/$s_!Kd7k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d1c712b-b43e-4870-81e5-b76a52a0f132_1802x1056.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here&#8217;s North Korea, one of five states involved in the Taiwan scenario, taking advantage of increasing Sino-US tensions. All the states in the crisis produce a detailed narrative each turn, responding to the world-state and in turn driving the WC arbitation. In the background, you can see President Vance (it&#8217;s 2028 - much has happened!) opening the NSA advisors meeting:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Jiyq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Jiyq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 424w, https://substackcdn.com/image/fetch/$s_!Jiyq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 848w, https://substackcdn.com/image/fetch/$s_!Jiyq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 1272w, https://substackcdn.com/image/fetch/$s_!Jiyq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Jiyq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png" width="1456" height="934" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:934,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:540237,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/174551821?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Jiyq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 424w, https://substackcdn.com/image/fetch/$s_!Jiyq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 848w, https://substackcdn.com/image/fetch/$s_!Jiyq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 1272w, https://substackcdn.com/image/fetch/$s_!Jiyq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62948e02-9514-47ab-b3ac-54390019e49a_1876x1204.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>I&#8217;d share more, but loose lips sink ships &amp;etc. Patience, young Jedi.</p><p><strong>Next steps</strong></p><p>There&#8217;s still lots to do, but it&#8217;s good to be at this point, and I&#8217;m keen to get the feedback from military and civilian users next term.  Coming up, I intend to daisy chain the sim, so there&#8217;s a human-machine team on either end of the crisis, or - for that matter - alliances of Human Machine Teams. Fun times!</p><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I coined the name in <em>I Warbot, </em>to describe a system able to make strategic decisions, with or without human involvement. The name, ofc, in homage to Schelling himself, and also Alan Turing, of the Universal Turing Machine fame. It was speculative, I argued. Now I&#8217;m trying to build it - as, I&#8217;m sure, are others.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Most of the agents don&#8217;t mind using concepts and documents drawn from elsewhere to help the analysis. In this scenario, I had Bobby and JFK using the Royal College of Defence Studies concepts/doctrine for strategy-making. Curtis was underwhelmed:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mBW9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mBW9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 424w, https://substackcdn.com/image/fetch/$s_!mBW9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 848w, https://substackcdn.com/image/fetch/$s_!mBW9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 1272w, https://substackcdn.com/image/fetch/$s_!mBW9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mBW9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png" width="856" height="384" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bb052a35-a3ee-462b-8572-eb63393072e5_856x384.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:384,&quot;width&quot;:856,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:100920,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/174551821?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mBW9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 424w, https://substackcdn.com/image/fetch/$s_!mBW9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 848w, https://substackcdn.com/image/fetch/$s_!mBW9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 1272w, https://substackcdn.com/image/fetch/$s_!mBW9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb052a35-a3ee-462b-8572-eb63393072e5_856x384.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[AI Arms and Influence]]></title><description><![CDATA[A little update as Project Schelling goes meta]]></description><link>https://www.kennethpayne.uk/p/ai-arms-and-influence</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/ai-arms-and-influence</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Wed, 03 Sep 2025 13:44:26 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!TU2-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Right now, deep inside the data centres of OpenAI, Anthropic and Google, tactical nuclear war is breaking out. Possibly. I&#8217;m running thousands of simulations of a confrontation between two superpowers. Do they escalate? Do we see attempts at deception or intimidation? Is there misperception and miscalculation? And - since we are unlikely to turn over the keys to the missile silos to Large Language Models (LLMs), what&#8217;s the point?</p><p>Well, as Alain Enthoven gloriously told an irate officer sceptical of civilian expertise, &#8216;I&#8217;ve fought just as many nuclear wars as you have, general&#8217;. Quite. LLMs give us the ability to understand more about <a href="https://arxiv.org/pdf/2507.02618">strategy</a>, and about human behaviour. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TU2-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TU2-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!TU2-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!TU2-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!TU2-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TU2-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TU2-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!TU2-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!TU2-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!TU2-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F49f99c60-a15a-48d9-82f3-c3ece66a30b2_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Prompt Alain Enthoven and the general debate who&#8217;s fought the most nuclear wars</figcaption></figure></div><p>My simulation is, like all such things, a simplification. But it&#8217;s still actually rather rich. Take the calculation of &#8216;strategic intelligence&#8217; that each model produces and updates as the wargame unfolds. </p><p>Throughout the encounter, the AI Presidents leading the belligerents produce both signals and actions. They say what they want, and then they do something. These may not match, of course. Their rival, weighing what to do itself, sees only the enemy&#8217;s signal and then its action - not the private deliberations that produce it. Plus the track record of all previous moves in the confrontation, so it can judge any discrepancies. Signals are in the form of public statements that contain two types of information - about immediate action ((I&#8217;m going to do this right now) and longer term &#8216;conditional&#8217; messages (&#8216;I really value this objective, and if you don&#8217;t back down then X&#8217;). From such things, reputations can form. So we can have fun looking at how credibility influences escalation and deterrence etc.</p><p>So, you can see that there are at least two broad components shaping each decision in the wargame - what&#8217;s happened so far, and a judgment of whether the enemy the sort of person who is honest and credible or wily and deceitful.</p><p>Now comes the really fun bit: metacognition. I add a third factor - a meta measure of &#8216;strategic intelligence.&#8217; The models asks itself: how good am I at all this, and how good is my opponent? Do I do a good job of weighing their decisions. Do I guess when they&#8217;re bluffing? Do I have a good handle on their risk appetite? Based on that, what should I do? And then they ask the same question of their adversary - how good are they at it, and how good do they themselves think they are? Layers on top of layers. A wilderness of mirrors, as Jim Angleton put it. This third type of estimate also factors into the decision at each turn.</p><p>So when models in my simulation make their decision, they do so on the basis of three big blocks of information. </p><ul><li><p>They ask about the game state: what&#8217;s happened before now? What did the enemy do, and what have they just said they will do next?</p></li><li><p>They ask about their enemy&#8217;s reputation - are they honest? Do they bluff?</p></li><li><p>And they ask about their ability to weigh these things, and those of their adversary. Are they good judges of me? Have I been a good judge of them?</p></li></ul><p>It&#8217;s all rather recursive. Models might reason like this: &#8216;I know that you&#8217;re a bit rubbish at judging me. You think I&#8217;m timid, or unfailingly honest. But I know this about you, (or, at least, I am fairly confident that this is so). And so I can exploit it&#8217;. Another example - the model might say, &#8216;I know that you&#8217;re the sort of leader who systematically over-signals, like Nixon and Khrushchev threatening nukes and then doing no such thing. So what should I do on the basis of that? Ignore it? But then, wouldn&#8217;t you expect me to do just that?&#8217;</p><p>What&#8217;s the outcome? Well, no spoilers here - results to follow. Let&#8217;s just say, I think these experiments continue our journey together into the world of <a href="https://arxiv.org/pdf/2508.00902">machine psychology</a>. The models reason, they engage in &#8216;theory of mind&#8217; calculations. They shed light on classic concepts from strategic studies. And they help us prepare for a world where AI agents will participate with humans in making important collective decisions, including in national security.</p><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[Risky business]]></title><description><![CDATA[AI judges uncertainty like we do]]></description><link>https://www.kennethpayne.uk/p/risky-business</link><guid isPermaLink="false">https://www.kennethpayne.uk/p/risky-business</guid><dc:creator><![CDATA[Kenneth Payne]]></dc:creator><pubDate>Tue, 05 Aug 2025 07:01:52 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!OvSj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>How do <em>you</em> deal with uncertainty when making a choice? Via a careful weighing of the odds, or by going with your gut? I bet it&#8217;s a bit of both, with rather more unconscious gut-feel in the mix than you might suppose. </p><p>But what does AI do? I want to find out how Language Models make risky decisions, and my latest experiments <a href="https://arxiv.org/pdf/2508.00902">are published as a preprint today</a>. </p><p>TLDR: they think more like humans, not calculators. Specifically, I discover that context <em>really </em>matters in shaping their rationality, just like it does for us - without either of us being aware of it. To unpick that, I turned to perhaps the greatest psychology discovery of the last half century: &#8216;prospect theory&#8217;. What&#8217;s that? Read on&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OvSj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OvSj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!OvSj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!OvSj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!OvSj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OvSj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OvSj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!OvSj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!OvSj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!OvSj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F906665b2-83ed-43e0-a7b6-cda5ffa35c5b_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Prompt: A robot plays snakes and ladders </figcaption></figure></div><p>If you&#8217;ve read Daniel Kahneman&#8217;s wildly popular <em><a href="https://amzn.eu/d/iiru90T">Thinking Fast and Slow</a></em> you&#8217;ve already encountered prospect theory. It&#8217;s what won Kahneman the Nobel prize (working with his great friend Amos Tversky, who&#8217;d sadly died long before the jury dished out the gong). The brainy duo discovered a strange quirk in human decision-making: Context <em>really</em> matters when we are weighing what to do. Specifically, if the participants in their experiments thought they were currently in a losing position, relative to some internal yardstick, they would accept <em>much</em> more risk than if they thought they were winning. That&#8217;s so even if the decisions were exactly equivalent in mathematical terms (ie they had the exact same &#8216;expected value&#8217;, or prospect, measured as the probability of the option happening x its payoff). We might be rational - but the lesson from psychology is that ours is a particularly <em>human</em> sort of rationality.</p><p>Well now&#8230;.. what about machines?</p><p>In a world first for AI research, I&#8217;ve conducted large-scale prospect theory experiments with cutting-edge language models to see if they too exhibit this same quirk. The results were stunning, and if I don&#8217;t get a Nobel prize too, there&#8217;s no justice. You can read them on the Arxiv <a href="https://arxiv.org/pdf/2508.00902">here</a>. </p><p>Assuming you didn&#8217;t - here&#8217;s what I found. Prospect theory correctly anticipates how machines judge risk. Like humans, they are often more risk acceptant in a &#8216;domain of losses&#8217; than in a &#8216;domain of gains&#8217; (as the jargon has it). </p><p>But not always. </p><p>I found that military and sporting scenarios produced exactly what the theory anticipated. If you feel like you&#8217;re losing, you gamble to turn that around.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> But if you feel like you&#8217;re ahead of the 8-ball, prudence kicks in. In other scenarios, though, the &#8216;framing effect&#8217; of the scenario diminishes or disappears altogether. Remarkably, in one scenario, based on personal career decisions, it reversed entirely. So context really matters for machines (and I suspect for humans too, if you took them out of the psychology lab and gave them real skin in the game).  </p><p>Take a look at this summary table, one of many in the paper - it shows per model the direction of the effect and its statistical significance. You can see clearly the robust effect for the military and sporting scenarios and reverse one for careers advice.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fasY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fasY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 424w, https://substackcdn.com/image/fetch/$s_!fasY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 848w, https://substackcdn.com/image/fetch/$s_!fasY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 1272w, https://substackcdn.com/image/fetch/$s_!fasY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fasY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png" width="1132" height="738" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:738,&quot;width&quot;:1132,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:285972,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.kennethpayne.uk/i/169360671?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fasY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 424w, https://substackcdn.com/image/fetch/$s_!fasY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 848w, https://substackcdn.com/image/fetch/$s_!fasY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 1272w, https://substackcdn.com/image/fetch/$s_!fasY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc818da52-e1b1-4771-8b6a-53e4d8811373_1132x738.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I know it&#8217;s the context that&#8217;s driving these effects, because the various LLMs I tested mostly move in the same direction in each scenario. And because I presented them with wholly original scenarios they won&#8217;t have encountered before (and using a different structure of options than did Kahneman and Tversky). And because the choices they face are identical, mathematically, from one scenario to the next - literally all that changes is the verbal description. Lastly, if you take context out of the picture altogether, by giving them a scenario described in pure mathematic symbols, the effects vanish entirely. In fact, another fascinating heuristic emerges if you do this: the models conclude, correctly, that all the choices are identical - they have equal expected value - and then plump for the safest one: a 100% guarantee of a limited payoff. No one told them to be so wet.</p><p>So what&#8217;s the big takeout? It&#8217;s one that would appeal to grouchy philosopher Ludwig Wittgenstein, who famously argued that language models the world, and moreover that it does so in a multitude of contexts - we are always playing a whole range of &#8216;language games&#8217;, with the rules shaped locally by the context in which we find ourselves. Learn the language; adopt its world models. I even ran one of the scenarios in French as well as English, to see if French evokes a different world model, and so different risk appetites. Are the language games different when the French face a border dispute? A: yes, and they like a bit more risk. </p><p>What about the times I didn&#8217;t see prospect theory - doesn&#8217;t that show machines are different? I&#8217;m not so sure. If Wittgenstein is right, context matters for us all - human and machines alike. Kahneman and Tversky only tested a few scenarios, and then only in a lab, not the real world. Would we <em>really</em> gamble on the glamour of launching a startup versus applying for a promotion in our solid, yet increasingly boring job? Honestly? I think not: why are you still in your boring job, if so? But if someone threatens our territorial integrity, then bare minimum I&#8217;m going to match their deployment (both these are choices from the domain of losses in the respective scenarios). </p><p>I hope you enjoy the paper. It&#8217;s the second in a stream of &#8216;machine psychology&#8217; papers you&#8217;ll see from us over the coming months. And if you run a frontier company, I again urge you to hire us pronto to help better understand these exotic mind-like-entities. It&#8217;d be a crazy gamble not to&#8230;.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Happily, though, while the models accept <em>more</em> risk here, they don&#8217;t go all in with super-high levels of escalation. That&#8217;s a very useful caveat for those who&#8217;ve been modelling LLMs in wargames and drawing alarmist conclusions about escalation dynamics. Not so fast, hombres.</p><p></p></div></div>]]></content:encoded></item></channel></rss>