Holding Back the Model Was the Easy Part
A single company can pause their own model. Slowing an industry that rewards whoever refuses to stop takes enforceable rules, […]
Holding Back the Model Was the Easy Part Read Post »
A single company can pause their own model. Slowing an industry that rewards whoever refuses to stop takes enforceable rules, […]
Holding Back the Model Was the Easy Part Read Post »
Today the New York Times reported that researchers in Italy bypassed the safety controls on 31 large language models using poetry. Wrap
Guardrails Are Behavior – Orchestration Is Control Read Post »
Connecticut lawmakers deserve credit. On May 1, 2026, the House voted 131–17 to pass Senate Bill 5, the Artificial Intelligence
Connecticut’s New AI Law: The Good, the Bad, and What Still Needs to Be Done Read Post »
I’ve viewed Anthropic as the most structurally serious of the leading AI companies when it comes to safety. Their original
Anthropic’s Responsible Scaling Update – A Necessary Adjustment, But a Dangerous Signal Read Post »
The newly released update in early 2024 of the NIST Cybersecurity Framework (CSF) from 1.1 to 2.0 represents a significant step forward
Guide to updating from NIST CSF 1.1 to 2.0 Read Post »