Operations
Disaster Recovery Guide
Disaster Recovery Guide
Objective
Restore read availability and consistency from S3/MinIO manifests and chunks with minimal time-to-first-read.
Recovery Workflow
- Provision replacement Astra nodes.
- Configure object-tier settings (
ASTRAD_S3_ENDPOINT,ASTRAD_S3_BUCKET,ASTRAD_S3_PREFIX). - Start nodes with clean local data directories.
- Verify manifest/chunk retrieval and revision restoration.
- Re-enable write traffic after quorum and key-parity checks.
Verification Commands
etcdctl --endpoints=http://127.0.0.1:2379 endpoint status -w table
etcdctl --endpoints=http://127.0.0.1:2379 get /__tenant/ --prefix --keys-only
Failure Modes to Watch
- Missing object chunks.
- Checksum mismatch.
- Partial node restore with divergent revisions.