<![CDATA[Signs of AI writing]]>
One has to be aware that human speech and writing is being influenced by LLMs, and thus they are becoming more similar. This was already evident in 2024, as shown by a study that detected a…
The Thing We All Obviously Want
Generated by AI—notice the perspective. Over the past year, we have seen the rapid development of AI-assisted programming to an astounding degree. Even five years ago, fully-automated program…
The forklift and stapler
Most developers have become pretty good at using AI, some reluctantly. We paste in an error and ask for an explanation. We ask for unit tests, a regex we will immediately regret, or a slightly less…
AI Observability Review for LLM, RAG, and Agent Systems
A focused review for teams shipping LLM, RAG, and agent systems: trace coverage, evaluation gaps, token cost visibility, failure modes, and OpenTelemetry instrumentation plan.
Formal proofs for distributed protocols with AI may be closer than you think
This text is artisanally typed using Das Keyboard, with occasional suggestions by Copilot (most of them ignored anyways). The figures are generated with ChatGPT 5.5. In November 2024, I wrote a blog…
Inside the Git Hooks: Tagging Every AI Agent Commit (Part 3)
Part 3 — the build for the lighter solution. How a set of git hooks stamps a session id on every commit an AI Agent makes, survives squash and rebase, and captures each push — all best-effort, all…
What I'm working on: Building a Workflow
This is a working summary of things I’ve been working on to build a workflow for agentic development over the last 6 months. Agentic development will scale up all software development. I see two…
What I Learned Rolling Out AI to 900+ Engineers
I was hired to lead the AI rollout for 900+ engineers at a $2bn+ sports betting company. Sports betting is an older industry, and older industries tend to have a larger gap to close when technology…
When Is DeepSeek V5 Coming Out? The Honest 2026 Answer
DeepSeek V5 has no announced release date. Here's what the July 24 deprecation actually means, plus V4 Pro pricing, vision API status, and Claude vs GPT-5.
TurboQuant on Windows and LM Studio 2026: Complete Setup Guide
Use TurboQuant-compatible GGUF models on Windows with LM Studio, Ollama for Windows, and llama.cpp. Covers hardware requirements, which models support TQ4_K_M, and the best current approximation…
Evals: a plain-English map of the types worth knowing
Everyone says 'evals' and means ten different things. Here's a quick tour of the main types — what each one checks, and when it's worth the cost.
Blink if you’re human
Or human-ish
Do NOT Hallucinate!
It is common knowledge amongst AI-enhanced superhuman programmers that the best way to prevent your AI coding agents from hallucinating is to tell them not to hallucinate. The 10x programmers also…
Evals aren't a step at the end. They run the whole way through
There’s a version of building an AI app that goes like this. You build the thing, you get it mostly working, and then someone says “should we evaluate it?” and you bolt some evals…
What One Year in AI Security and Governance Changed About How I See AI
After one year working around AI security and governance, I trust flashy AI demos less and pay more attention to data, permissions, discovery, and the boring systems around AI.
AI and greener choices
The earth is heating up and AI isn't helping. It drives major increases in electricity use, water use and CO2 emissions. Yet, industry and governments alike seem keen to leverage the latest tech. Can…
Frontier AI Models Evaluation Benchmarks
A guide to frontier AI model benchmarks in 2026, covering MMLU, GPQA Diamond, HLE, SWE-bench, ARC-AGI-2, MMMU, Arena Elo, etc. What each benchmark measures, which models lead, why scores saturate.
Why extreme risk cannot be measured
Can we measure extreme financial risk? Is financial stability only a question of technology and data? Many seem to think so. I disagree.
Cut the Token Bill on Both Ends
Two small tools that compound: Caveman shrinks what the agent says back, RTK shrinks what your terminal pipes in. Same context window, twice the room.
Agents Write My Code. Agents Review It. I Referee.
AI agents review my pull requests now, not me. But there is one thing I will never be able to hand them: everything my company knows that was never written down.
The AI Layoff Trap
Picture a town with 1,000 companies and 100,000 workers. Each company employs 100 people, pays them a wage, and sells to them. The money leaves through the front door as salary and comes back through…
AI inference is obviously profitable
Many people claim that AI inference is unprofitable to serve, and thus must be subsidized by an ocean of dumb money from investors who believe that some future AI model will come to dominate the…
Cascade Chat Has Moved to an ADLC
Cascade Chat now runs on an Agentic Development Lifecycle. Agents write the code, Playwright verifies it, and the only human decisions left are ideation, architecture, and the release gate.
പത്തുകോടിയുടെ മലയാളം കോർപ്പസ്
മലയാളത്തിൽ AI കോർപ്പസ് നിർമിക്കാനായി പത്തുകോടി രൂപ ഈയിടെ അവതരിപ്പിച്ച കേരളബഡ്ജറ്റിൽ നീക്കിയിരിത്തിയിട്ടുണ്ട്. ആർട്ടിഫിഷ്യൽ ഇന്റലിജൻസ് സമീപകാലത്ത് നേടിയിട്ടുള്ള വലിയ പുരോഗതിയുടെ പശ്ചാത്തലത്തിൽ…
Unsupervised Learning NO. 534
Serious Ubiquiti Vulnerabilities, My Advice for Hosting Public Services, Reversing Binaries with Ghidra MCP, My Meta-Prompt Recommendation, New Dario and Sam Websites, and more...
Manage AI-driven docs contributions
AI tools make it easier than ever to generate content. With that ease, nearly anyone can become a contributor to the documentation. When people are using AI tools to create documentation, how do you…
AI Output
Had a really good conversation with a colleague the other day as to why I don’t use AI a whole lot outside of when work mandates usage of AI. If I come across a long blog post, I rarely read…
Let’s Learn Together: Financial Tools Startup Ambrook Spent Six Weeks Helping Their Entire Team Adopt AI. And Now They’ve Open Sourced the Materials
“Here’s how we did it” is one of my favorite phrases to hear in a healthy startup ecosystem. Collaborative learning benefits everyone and the startup which initiates the discussion…
Dan Reed’s AI does Hemingway
Dan’s spot on, as always, here. I remember one time, the bunch of us had fun making ChatGPT (early version) write NSF non-technical summaries as Sonnets and Haiku. Those were the days. Now,…
Please don't use an LLM to communicate with other human beings.
Communication is an important skill, stop delegating it to a machine
The Daily View 6/25/2026: Bitcoin @$58k, MicroStrategy collapse, Reflecting pool, AI…
Bitcoin has finally plunged to $58,000. This will go down as one of my greatest forecasts ever, if not the greatest. I was the only person, back in 2025 when it was at >$100,000, who predicted there…
Give me eggs 🥚
As an AI trainer, there is a fun game to test AI image models and see how far we still are from having an “intuitive human” model. This is not a test of image generation, but of cultural human…
Howls of Derisive Laughter, Goose
Looking over a shareholder presentation explaining an AI valuation using the parable of the golden goose (via Mefi), I kept […]
Ho scritto alla mia Jazz band preferita di Torino dopo 10 anni di “Non Commerciale”
Tutto il pianeta sta adottando AI generative a caso, cagando sull’ambiente con intensità esponenziale, in cerca di allucinazioni sempre migliori. Grazie a questo, le allucinazioni organiche di…
FR#168 – LLMs Join The Fediverse
How to build a fediverse community when bots are indistinguishable from humans on applications to join?
Meta Attempts to Unf*ck Its Engineering Culture (Good Luck With That)
’Tis But Some Quick News for June 25, 2026 It is not an understatement to say that Mark Zuckerberg’s particular Facebookian flavor of AI psychosis has been wreaking havoc on Meta’s engineering…
A Lever Made of Agents
AI as a new source of operating leverage where the durable advantage comes from process power, not prompts. Editorial note: I’m taking on a few additional companies looking to find the…
Do societal promises influence patent value?
Do societal promises influence patent value? An analysis of inventions in artificial intelligence
AI and the Declining Cost of Trying
AI agents can lower the cost of trying something. That doesn't automatically make it worth doing.
We Rented the Mainframe Back
Two AI assistants went dark in forty-eight hours last week. In neither case did the model break. The thing that broke was the wire in front of it, and we built that fragility back on purpose.
Former Apple Executive Launches PersonaShield to Fight Deepfakes
Someone is using your face to sell weight loss pills. Someone else pasted your likeness into a political ad you’d never endorse. A third person built an entire social media channel around…
Help me, but don't touch the paper!
As I’m discussing paper writing with more folks in the age of Claude Code and other coding assistants, we’re having more discussions about how to use these tools in writing. Per my earlier post, I…
The people who use AI the most are also the most worried about it
Pew's 2026 data shows the youngest U.S. adults use AI chatbots the most, yet they are also the most likely to expect AI to harm society.
the cozy catastrophists of cosplay doomerism
Asterisk Magazine, which aspires to be The New Yorker of effective altruism, has a piece profiling “AI doomers” which, even after reviewing the author’s other sincere engagements…
Is This Okay? How Override Labs Built a Safety-First AI Consent Coach for Teen Boys
Listen to this episode on: Spotify | Apple PodcastsWhat if AI could help prevent sexual assault before it happens — without tracking users, judging them, or handing them a verdict?In this…
The Exhaustion of Talking to a Tool
.post .post-content { margin-top: 0px; } LLMs are exhausting because they require spending precious social energy to operate them. Energy that might be better spent on people. When you use a good…
Computer-Use and TOCTOU: What You Click Is Not What You Get!
Last year, Jun Kokatsu disclosed an interesting vulnerability with ChatGPT Operator by exploiting a race condition. I was wondering if I could reproduce this attack chain, and this post describes the…