My n8n crashed overnight and took 89GB of disk with it.
Updated n8n. Went to bed. Woke up to:
- Server unreachable
- Disk: 28GB → 89GB
- 3.8 million executions in 24 hours
- 43 workflows dead
One routine update. Complete disaster.
Recovery: Diagnosed via SSH, cleaned the database, reclaimed 60GB — back online without losing a single workflow.
But the scariest part wasn't the crash.
It was that I had no visibility of running workflows
So I built a workflow that monitors all my other workflows.
Every night at 5 PM — one email:
- n8n online or down?
- Disk usage
- Executions + errors
- Status of all 43 workflows
30 minutes of daily manual checking → 0
4-hour crisis → 15-minute early fix
Three things to do if you self-host n8n:
- Enable execution pruning — unchecked history will fill your disk silently
- Check logs after every update
- Build a monitoring workflow before you need one
Drop a comment if you want the workflow JSON. 👇