Field notes

Writing about agents, retrieval, and the things in between.

Design notes, engineering decisions, and the occasional postmortem from building RunaxAI.

How a Mac mini under my desk serves runaxai.com
May 18, 2026
The deployment stack behind RunaxAI: K3s on a Mac mini, a Helm chart that covers everything, self-hosted GitHub Actions runners via ARC, and Cloudflare Tunnel doing what port forwarding used to.
11 min read·#deployment#kubernetes#helm#cloudflare#infra#engineering
The Agentic RAG Pipeline: Adaptive Retrieval at Scale
May 17, 2026
How RunaxAI dynamically adjusts retrieval strategies based on corpus size, combining Hybrid Search, HyDE, and Cross-Encoder Reranking to deliver high-fidelity context.
3 min read·#rag#architecture#pinecone#embeddings
Memory, the way we wish chat apps did it
May 17, 2026
Why we store user memory as atomic facts with supersession, how the extraction pipeline avoids reprocessing the entire conversation every turn, and the three-store split between Redis, Postgres, and pgvector.
8 min read·#memory#architecture#rag#engineering
Agent Orchestration: Taming the LLM Tool Loop
May 17, 2026
Building deterministic constraints around non-deterministic LLMs: Tool policies, duplicate suppression, budget limits, and dynamic summarization in RunaxAI.
3 min read·#llm#orchestration#agents#engineering
Redis as the Backbone: Caching, Sessions, and State
May 17, 2026
How RunaxAI leverages a single Redis instance to manage active SSE chat sessions, semantic tool caching, and background worker queues.
3 min read·#redis#caching#architecture#state-management
Behind the chat interface: orchestration, memory, caching, eval — the full picture
May 16, 2026
A deep dive into Runax — a production document intelligence platform built without LangChain or LangGraph.
10 min read·#rag#architecture#production#engineering
Introducing RunaxAI
May 16, 2026
An agentic RAG system with two chat modes, six tools, four specialised agents, hybrid retrieval, and an observability stack — what it does today, and how it actually works under the hood.
10 min read·#product#launch#rag#architecture

Writing about agents, retrieval, and the things in between.

How a Mac mini under my desk serves runaxai.com

The Agentic RAG Pipeline: Adaptive Retrieval at Scale

Memory, the way we wish chat apps did it

Agent Orchestration: Taming the LLM Tool Loop

Redis as the Backbone: Caching, Sessions, and State

Behind the chat interface: orchestration, memory, caching, eval — the full picture

Introducing RunaxAI