Engineering Blog. From the team.

Infrastructure deep dives, post-mortems, product launches, and open-source updates — written by the engineers who do the work, not marketing rewriting their notes.

Monthly
New posts
7
Engineers writing
Open
Post-mortems
Latest Post
Security

How We Secure 150,000 Servers Without a CISO

Security isn't a department — it's an engineering discipline. Four-layer defense architecture, from edge filtering to disk encryption, and why we skip the compliance theater.

NJ
Nik Johal
Recent Posts
Hardware

Why We Chose AMD EPYC 9654 Over Intel Xeon w9-3495X for Our Gen-4 Fleet

We tested both processors under real customer workloads for 90 days. The EPYC won on performance-per-dollar, power efficiency, and memory bandwidth.

KV
Kira Vasiliev
Network

How We Built a Redundant BGP Network Across 28 Regions

Our network carries 4.2 Tbps through four transit providers and seven IXPs. Here's how we designed it to survive any single failure.

TA
Tunde Adebayo
Incident

Post-Mortem: Kubernetes Control Plane Outage, December 16, 2025

A configuration change caused 23 minutes of control plane unavailability affecting 1,847 clusters. Running workloads were not affected.

RF
Rémi Fournier
Product

Migrating 2.1 Million DNS Zones to Our New Anycast Infrastructure

We replaced our entire DNS serving infrastructure over a single weekend with zero customer-visible downtime.

ZL
Zhen-Wei Lin
Engineering

Provisioning a Bare Metal Server in 47 Seconds: Our Boot Pipeline

From API call to a fully configured server with your SSH keys, network configuration, and operating system — in under a minute.

ID
Ingrid Dahl
Open Source

Building the Sigilhosting Terraform Provider: Lessons From 14 Months of Development

28 resource types across compute, networking, DNS, and load balancing. Here's what we learned building a production-quality provider.

TH
Tomás Herrera
What We Write About

Our blog covers four areas

Infrastructure deep dives — how we design networks, select hardware, manage firmware, and handle capacity planning. The unglamorous operational details that make hosting reliable.

Product updates — new features, regions, and service changes with full technical context. Not just what changed but why we built it that way and what alternatives we considered.

Incident post-mortems — published within 72 hours. Root cause, timeline, impact, and specific prevention measures with owners and deadlines. We believe transparency about failures builds more trust than pretending they don't happen.

Open-source — our CLI, SDKs, and Terraform provider are all open source. When we make significant changes, we explain the motivation, migration path, and any breaking changes.