What we publishwhen it stops beingload-bearing.
The Observatory is what remains when an engagement closes. Briefings, insights and instruments — released the moment they are no longer holding a live file together. No promotion, no schedule, no editorial calendar.
Four channels.One discipline.
- § 01
Convenings
Briefings, roundtables and closed sessions. Held under Chatham House by default.
- § 02
Insights
Working notes from active files. Released when the underlying matter is closed.
- § 03
Mandates
Public records of engagements where the client has chosen disclosure.
- § 04
Tools
Calculators, frameworks and instruments built for our own files first.
Briefings & closed sessions
No convenings match this filter.
Insights
The Next AI Bottleneck Isn’t the Model: It’s the Inference System
The Counterintuitive Networking Decisions Behind OpenAI’s 131,000-GPU Training Fabric
I Let CodeSpeak Take Over My Repository
How to Write Robust Code with Claude Code
I Built the Same B2B Document Extractor Twice: Rules vs. LLM
Exploring Patterns of Survival from the Titanic Dataset
What’s the Best Way to Brainwash an LLM?
Building an Evaluation Harness for Production AI Agents: A 12-Metric Framework From 100+ Deployments
From Vibe Coding to Spec-Driven Development
Hybrid Search and Re-Ranking in Production RAG
Proxy-Pointer Framework for Structure-Aware Enterprise Document Intelligence
Your First WebAssembly Program and Web App (Written, Tested, and Deployed Entirely in the Web Browser)
Learning Word Vectors for Sentiment Analysis: A Python Reproduction
Using Transformers to Forecast Incredibly Rare Solar Flares
How to Build a Claude Code-Powered Knowledge Base
PySpark for Beginners: Mastering the Basics
Batch or Stream? The Eternal Data Processing Dilemma
LLM Summarizers Skip the Identification Step
The Must-Know Topics for an LLM Engineer
RAG Is Blind to Time — I Built a Temporal Layer to Fix It in Production
The AI Agent Security Surface: What Gets Exposed When You Add Tools and Memory
From Data Scientist to AI Architect
When Customers Churn at Renewal: Was It the Price or the Project?
Unified Agentic Memory Across Harnesses Using Hooks
The Joy of Typing
Give Your AI Unlimited Updated Context
How Major Reasoning Models Converge to the Same “Brain” as They Model Reality Increasingly Better
I Rewrote a Real Data Workflow in Polars. Pandas Didn’t Stand a Chance.
Beyond Lists: Using Python Deque for Real-Time Sliding Windows
When the Uncertainty Is Bigger Than the Shock: Scenario Modelling for English Local Elections
Timer-XL: A Long-Context Foundation Model for Time-Series Forecasting
Why I Don’t Trust LLMs to Decide When the Weather Changed
Deconstruct Any Metric with a Few Simple ‘What’ Questions
Discrete Time-To-Event Modeling – Predicting When Something Will Happen
How to Make Claude Code Validate its own Work
RAG Hallucinates — I Built a Self-Healing Layer That Fixes It in Real Time
Surviving High Uncertainty in Logistics with MARL
Single Agent vs Multi-Agent: When to Build a Multi-Agent System
How to Build an Efficient Knowledge Base for AI Models
Playing Connect Four with Deep Q-Learning
How AI Tools Generate Technical Debt in IoT Systems — and What to Do About It
CSPNet Paper Walkthrough: Just Better, No Tradeoffs
Inference Scaling (Test-Time Compute): Why Reasoning Models Raise Your Compute Bill
Which Regularizer Should You Actually Use? Lessons from 134,400 Simulations
How a 2021 Quantization Algorithm Quietly Outperforms Its 2026 Successor
How to Get Hired in the AI Era
Churn Without Fragmentation: How a Party-Label Bug Reversed My Headline Finding
Ghost: A Database for Our Times?
Why Powerful Machine Learning Is Deceptively Easy
A Gentle Introduction to Stochastic Programming
Proxy-Pointer RAG: Multimodal Answers Without Multimodal Embeddings
How to Study the Monotonicity and Stability of Variables in a Scoring Model using Python
Why AI Engineers Are Moving Beyond LangChain to Native Agent Architectures
4 YAML Files Instead of PySpark: How We Let Analysts Build Data Pipelines Without Engineers
Ensembles of Ensembles of Ensembles: A Guide to Stacking
Agentic AI: How to Save on Tokens
System Design Series: Apache Flink from 10,000 Feet, and Building a Flink-powered Recommendation Engine
Let the AI Do the Experimenting
Correlation Doesn’t Mean Causation! But What Does It Mean?
PyTorch NaNs Are Silent Killers — So I Built a 3ms Hook to Catch Them at the Exact Layer
The Next Frontier of AI in Production Is Chaos Engineering
A Career in Data Is Not Always a Straight Line, and That’s Okay
How Spreadsheets Quietly Cost Supply Chains Millions
Comparing Explicit Measures to Calculation Groups in Tabular Models
Bytes Speak All Languages: Cross-Script Name Retrieval via Contrastive Learning
I Reduced My Pandas Runtime by 95% — Here’s What I Was Doing Wrong
Causal Inference Is Different in Business
The Essential Guide to Effectively Summarizing Massive Documents, Part 2
Introduction to Approximate Solution Methods for Reinforcement Learning
I Built an AI Pipeline for Kindle Highlights
How to Improve Claude Code Performance with Automated Testing
How to Select Variables Robustly in a Scoring Model
Using a Local LLM as a Zero-Shot Classifier
I Simulated an International Supply Chain and Let OpenClaw Monitor It
Your Synthetic Data Passed Every Test and Still Broke Your Model
Lasso Regression: Why the Solution Lives on a Diamond
Using Causal Inference to Estimate the Impact of Tube Strikes on Cycling Usage in London
Correlation vs. Causation: Measuring True Impact with Propensity Score Matching
From Ad Hoc Prompting to Repeatable AI Workflows with Claude Code Skills
Ivory Tower Notes: The Methodology
How to Run OpenClaw with Open-Source Models
DIY AI & ML: Solving The Multi-Armed Bandit Problem with Thompson Sampling
Git UNDO : How to Rewrite Git History with Confidence
How to Call Rust from Python
Your RAG Gets Confidently Wrong as Memory Grows – I Built the Memory Layer That Stops It
I Replaced GPT-4 with a Local SLM and My CI/CD Pipeline Stopped Failing
What Does the p-value Even Mean?
Context Payload Optimization for ICL-Based Tabular Foundation Models
The LLM Gamble
From Risk to Asset: Designing a Practical Data Strategy That Actually Works
Proxy-Pointer RAG: Structure Meets Scale at 100% Accuracy with Smarter Retrieval
Dreaming in Cubes
KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.
Your RAG System Retrieves the Right Data — But Still Produces Wrong Answers. Here’s Why (and How to Fix It).
AI Agents Need Their Own Desk, and Git Worktrees Give Them One
Introduction to Deep Evidential Regression for Uncertainty Quantification
memweave: Zero-Infra AI Agent Memory with Markdown and SQLite — No Vector Database Required
Building My Own Personal AI Assistant: A Chronicle, Part 2
Your Chunks Failed Your RAG in Production
What It Actually Takes to Run Code on 200M€ Supercomputer
For journalists,editors and analysts.
We respond to media on the record only when the underlying file is closed. For background, attributable commentary, or expert introductions, write to the press desk directly.
Receive the filethe day it closes.
A signal cut from active files. Insights, mandates and tools — published the day they stop being load-bearing for an engagement. No promotion, no tracking pixels, no schedule.