<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>gum</title><link>https://notgum.com/posts/</link><description>short essays on AI, LLMs, math, and things that won't fit in a tweet. for confused minds, from a confused mind.</description><language>en</language><lastBuildDate>Sun, 14 Jun 2026 09:44:22 +0000</lastBuildDate><atom:link href="https://notgum.com/posts/" rel="self" type="application/rss+xml"/><item><title>Mythos is good but not infallible</title><link>https://notgum.com/post/mythos-is-good-but-not-infallible/</link><pubDate>Sun, 07 Jun 2026 00:00:00 +0000</pubDate><guid>https://notgum.com/post/mythos-is-good-but-not-infallible/</guid><description>A note on a mistake in Proposition 7.1 of Mythos&amp;rsquo;s Erdős unit-distance writeup.</description></item><item><title>Thoughts on the Mythos system card with a specific eye on cyber capabilities</title><link>https://notgum.com/post/thoughts-on-the-mythos-red-team-system-card/</link><pubDate>Fri, 10 Apr 2026 00:00:00 +0000</pubDate><guid>https://notgum.com/post/thoughts-on-the-mythos-red-team-system-card/</guid><description>Crosspost: thoughts on Mythos&amp;rsquo;s claimed cybersecurity implications.</description></item><item><title>A mildly cursed 3.5× triton `tl.reduce` optimization</title><link>https://notgum.com/post/manual-unrolling-beats-tl.reduce-for-small-k/</link><pubDate>Fri, 05 Sep 2025 00:00:00 +0000</pubDate><guid>https://notgum.com/post/manual-unrolling-beats-tl.reduce-for-small-k/</guid><description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; For small, compile-time K, manually unrolling a 3D→2D bitwise-OR reduction can beat &lt;code&gt;tl.reduce&lt;/code&gt; by ~3.5×.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;ll take you on a small adventure of some weird Triton compiler behaviour.&lt;/p&gt;
&lt;p&gt;We&amp;rsquo;ll look at a reduction used inside an attention variant that reduces a 3D tensor along the last axis with bitwise OR to produce a 2D tensor and the weird stuff I encountered while doing that.&lt;/p&gt;
&lt;p&gt;Concretely given an integer tensor&lt;/p&gt;</description></item><item><title>llama2.c running on original iPhone (240k)</title><link>https://notgum.com/post/llama2.c-running-on-original-iphone-240k/</link><pubDate>Thu, 10 Jul 2025 22:00:00 +0000</pubDate><guid>https://notgum.com/post/llama2.c-running-on-original-iphone-240k/</guid><description>240k LLM runs on the very first iPhone.</description></item><item><title>How I made PrimeIntellect's toploc 70x faster WIP</title><link>https://notgum.com/post/how-i-made-primeintellects-toploc-70x-faster-wip/</link><pubDate>Sun, 23 Feb 2025 19:38:00 +0000</pubDate><guid>https://notgum.com/post/how-i-made-primeintellects-toploc-70x-faster-wip/</guid><description>&lt;p&gt;tbd. U can read up much in the comments of the code and pull-request here &lt;a href="https://github.com/PrimeIntellect-ai/toploc/pull/3"&gt;https://github.com/PrimeIntellect-ai/toploc/pull/3&lt;/a&gt;&lt;/p&gt;</description></item></channel></rss>