Hacking Guardrails

Imagine you are looking at an AI system from the outside. It has guardrails. It has a safety spec. It refuses to answer certain prompts. It cites policies. It looks responsible. Then you zoom in and realize the guardrails sit on top of a model whose real objective is something else entirely. It is trained…

Everything is Astroturfed

Let’s face it: we live in an era where destruction is the new construction. Everything that’s built – from our political systems to our smartphones – seems designed to break down, morally or mechanically. Why? Because those who control the levers of power are about as capable as a screen door on a submarine. They…