HIAL Guest Lecture
Today I gave a guest lecture for the course Human-interactive Agent Learning. I presented SHARPIE, our tool for controlled human-RL experiments. During the lecture, I showed how SHARPIE can be used…
Hey entrepreneurs, you need to get your AI tools in order!
We’re at an amazing time when entrepreneurs can be accelerated using AI in many different ways. If you’re a technical founder and know how to use these tools, you're already way…
Agents Can Reason. They Still Can't Really Search.
Modern agents can write code, call APIs, draft a memo, and pass a benchmark. That part is real. Put one in front of a clean, well-scoped task and it can look genuinely magical. Then you ask it to do…
Brainmade but not Not by AI
As I was building out this blog in its current form I had the "written by a human/ not by ai " badge living in my footer for a while. This seems to be pretty ubiquitous in the indie web community at…
AI coding transcript
Expanding on my short post this morning about AI-assisted coding style, I think the reason I like this iterative approach is that I don’t always know what I want the UI to look like until I…
AI #160: What Passes For a Pause
A lot happened, but by today’s standards this felt like a quiet week. I was happy for the break, and I hope that we get to continue relatively relaxing. The Anthropic PBC vs. Department of War case…
AI-Powered Adaptive Authentication and Behavioral Biometrics: The Enterprise Guide 2026
60% of phishing breaches now bypass traditional MFA. Learn how AI-powered adaptive authentication and behavioral biometrics create continuous security without adding friction, with real deployment…
Reality-Constrained Systems: A Framework for Reducing Drift in AI and Decision Systems
A systems-level framework for maintaining alignment between outputs and the realities they are meant to represent. As AI systems and The post Reality-Constrained Systems: A Framework for Reducing…
AI versus the Deep State
A distinguished colleague (C1) introduced last year’s word of the year – AI Slop – into last week’s The Prozac Liberation Front post. As an unaccountable editor, I took the…
Building Agent Studio: How Medable Is Using Agentic AI to Accelerate Clinical Trials
Listen to this episode on: Spotify | Apple PodcastsWhat if AI could help reduce the 10-plus years it takes to get a new drug to market? That's the driving ambition behind Medable's agentic…
more noodling on AI and creativity
The 2026 Oscars™ happened, and a questionable “AI Production Studio” and “AI Talent Agency’s”1 “AI Actor” tried to use the occasion to convince The Academy™…
experiments with claude, part ⅳ: dzilification of MIME-Lite
Is all my 2026 blogging going to be about doing more random stuff with Claude? No, I promise. But I’m still working through my backlog of “and the next thing I tried to put it through its paces”, so…
experiments with claude, part ⅴ: ClaudeLog
Originally, this was going to be the last in my series of stuff I did with Claude that I found compelling, but… the news, good or bad, is that I’ll be posting at least one more soon. This one,…
Exploiting a PHP Object Injection in Profile Builder Pro in the era of AI
WordPress plugin "Profile Builder Pro" (versions before 3.14.5) is susceptible to Unauthenticated PHP Object Injection. In this blog post, we discuss how we discovered and exploited the vulnerability…
The AI Automation Engineer in 2026: A Comprehensive Technical and Career Guide
Table of Contents1. Introduction2. What Is an AI Automation Engineer? The Role Redefined for 20262.1 From RPA to Agentic AI - The Structural Shift2.2 AI Automation Engineer vs. AI Engineer vs. ML…
This stunning new AI agent could mark the end of the Hollywood entertainment industry
A new AI agent which promises to create complete video drama episodes on demand has been released on ByteDance’s XiaoYunQue AI platform. The product is based around the new Seedance 2.0 video…
To AI and back - part 1
I wrote my first AI program in high school (around 2009-2010). I found a tutorial for writing a genetic algorithm to find a list of number that sum to a value (I think). It was written in C++, I knew…
Demystifying Memory in Claude Code
If you’ve been using Claude Code for a while , you know it’s more than just a terminal chat interface that codes — it’s an beast of an AI agent that devours tokens and still leave you with work…
Webinar: AI Has a People Problem
Next session: March 26, 5:00pm UTC AI adoption has its own jagged frontier. The same tools create completely different problems for different people on the same team. AI tools create a random…
No, LLM is not going to replace software engineers, here's why
figure { text-align: center; margin: 0 auto; } figcaption, figcaption p { color: grey; font-size: 0.9em; } figure img { max-width: 100%; } Today, I’d like to share my theory about why LLMs cannot…
OpenAI Codex with OpenCode -- My Experience After a Month
I think it is important to explain my perspective. I am not a professional software developer, but I do use OpenCode almost every day. I am not writing massive applications. I rarely hit the limits…
The Meta-stasizing Cancer of Indirection
A widening chasm between Zuckerberg and reality If you can believe it, this image isn’t AI-upscaled. Image: Meta. Meta has a leadership problem. Over the last year, the company, at the behest of Mark…
Vibe Coding, QWERTY, and US Healthcare - or: The Future of Software Engineering?
Vibe Coding, QWERTY, and US Healthcare - or: The Future of Software Engineering?Posted on 2026-03-18 20:45 by Timo Bingmann at Permlink with 0 Comments. Tags: #ai #codingSummary (TL;DR)Why is…
ChatGPT attempts to summarize and review my new book 'ailien minds!'
A CHATGPT Review of AI lien Minds by David Brin For a midweek posting we'll take a break from frets about civilization and shift to something actually important. <p class="MsoNormal"…
More of the Disease, Faster (What happens when you ask an LLM to find you an edge)
This week I discovered the “vibe quant” movement (or rather, it discovered me). People using LLMs to find trading strategies, validate them, and put them into production. The pitch is…
I Am In an Abusive Relationship with the Technology Industry
Salma Alam-Naylor (via Robb Knight): You simply cannot breathe without seeing, hearing, or engaging in any kind of technical conversation about AI. AI has dominated the Zeitgeist so catastrophically…
Why Focused AI Agents Get Better Coding Results
Learn why focused AI agents outperform generalists. All about why tokens, context windows, progressive disclosure, and splitting planning improves results.
Modelwerk: Beyond Transformers
The two most interesting models in the modelwerk series of lessons about neural networks turned out to be ones that came after the transformer. Not because they're better, but because they seek to…
State of Compute
##TLDR: Update on the AI Ouroboros. Links 🔗: Meta: http://archive.today/i8fLH Dell: http://archive.today/Pxs5W HP: http://archive.today/s2UIM SK Hynix: http://archive.today/kwHjG TrendForce MacBook…
Odd Lots
Well, how about an AI Odd Lots? Most of the tech gossip I see these days falls into that category. Here we go: Ok, I wasn’t expecting this: Elon Musk’s AI encyclopedia Grokipedia posted a long-form…
Record Business Dilution Theory
A fun consideration. From the distant perch of a forensic musicologist at least. And for now, at least. I just finished reading The National Law Review’s “Universal Music Group May Have…
Plasticity as the Mirror of Empowerment (David Abel)
Two weeks ago, David Abel argued that RL’s central concepts need precise mathematical definitions. Last week, Mani Hamidi responded through the lens of evolutionary theory, focusing in particular on…
Groundsource: Google Turns 25 Years of News into 2.6 Million Flood Records with Gemini
Google Research built Groundsource, a framework that uses Gemini to extract verified flood events from news reports across 80 languages, producing a 2.6 million event dataset spanning 150+ countries.
LoGeR: DeepMind’s 3D Reconstruction That Scales to 10,000 Frames with Hybrid Memory
Google DeepMind’s LoGeR reconstructs 3D geometry from video over 10,000+ frames and kilometer-scale distances, reducing trajectory error by 74% on KITTI with no post-processing.
Models are optimizing their own tooling now
For sixty years, self-improving AI was the apocalypse scenario. It arrived as a sampling temperature tweak. The gains are real anyway.
An agent chat turned into a skill, and then it QA'd itself
I had a simple index.html page, a marketing site for a client in Spanish. They asked an English version too, so I opened the chat to Claude’s Opus 4.5 model and asked it to create a near-identical…
Diary Day 2 to 5: Behind the scenes of building my personal agent from scratch
A build note on designing a very thin, voice-first iOS interface for my personal agent, and why tailored software is becoming more accessible.
From Chess to Poker: How Speed Changed Design Before AIand and
Research, planning, sketching, prototyping, testing, information architecture, long conversations and iteration all made sense in a world where development was the bottleneck and mistakes were…
Homebrew culture in amateur radio
Posted on 2026-03-19 In this post, I want to draw some parallels to the erosion of homebrew culture among radio amateurs with the way computer programming scene is getting changed with the advent of…
How to Stop My Agent from Getting Me Fired
My AI agent has access to my email and Slack. Here are four tactics I use to stop it from sending a career-ending message — from system prompts to deterministic hooks, LLM-as-a-judge steering, and…
BGB Group - LLMs That Think: Demystifying Reasoning Models (in 5 minutes!)
Slides. All resources can also be found in my archive. For a BGB Group internal showcase I talked about reasoning models: what they are, how they work and when to use them -- all in just five…
Warsaw IT Days - From RAG to AI Agent & LLMs That Think: Demystifying Reasoning Models
Announcement. For "From RAG to AI Agent": recording, Colab Notebook, archive folder. For "LLMs That Think": recording, slides, archive folder. At the Warsaw IT Days I gave two talks. In the first,…
OpenAI to Acquire Astral
OpenAI announced today that it will acquire Astral, the company behind uv, Ruff, and ty. The Astral team will join OpenAI’s Codex group after the deal closes, subject to regulatory approval.…
One Year with AI in Production: The Autonomy Myth and What Actually Works
AI agents won't ship features while you sleep (yet), but they will change who can contribute to your codebase
Deep Hollow — A Survival Game You Play With Your AI Assistant
Deep Hollow is a survival-strategy game where your AI assistant defends your underground fortress while you're away. Free to play in Early Access.
The Anatomy of an Agent Loop
Every major AI agent runs the same core loop. The 6-line version is easy. The production-hardened version—with context compaction, loop detection, cost budgets, and graceful termination—is where…
Big Tech & the Future of Software
AI, the Infrastructure, the Layoffs, the Grift & the Future of Software
The last two years with AI tools: what changed, what didn’t
After a year of AI-assisted development at a real web agency: what genuinely helps, what's overhyped, and why software engineering matters more than ever.
The best engineers just get shit done, sometimes with AI
The best engineers I know never talk about their AI setup. The worst engineers I know only talk about their AI setup.
On using spec-driven development and agent swarms
I've been experimenting with specification-driven development (SDD) using GitHub's Spec Kit on some greenfield work as well as for some long-open issues on a couple TypeScript projects I maintain at…