Node Name

Node Section

Description

Description goes here

Notes

Node notes go here

Associated Journeys

  • Journey title goes here
DEMO 90.0
Accepted unreliability of 36.5 days/year
DETERMINISTIC 99.0
Accepted unreliability of 3.65 days/year
REACTIVE 99.9 - WELL ENG SOFTWARE
Accepted unreliability of 8.77 hours/year
PROACTIVE 99.99 WELL ENG'D OPS
Accepted unreliability of 52.6 minutes/year
AUTONOMIC 99.999 WELL END'D BIZ
Accepted unreliability of 5.26 minutes/year
LOCAL DEVELOPMENT
MONOLITH
CODE REVIEW
PRE MERGE HOOKS
ACTIVE PASSIVE CLUSTERS
MICROSERVICES
LEFTSHIFT RELIABILITY DESIGN
GRACEFUL SERVICE DEGRADATION (INDIVIDUAL CUJS)
LEFT SHIFT PERFORMANCE TESTING
GRACEFUL SERVICE DEGRADATION (UNIVERSAL)
BOUNDED CONTEXT
PROTOBUFS
SMOKE TESTS
AUTOMATED UNIT TESTING
MULTI SERVICE DEVELOPMENT
DISTRIBUTED SYSTEMS AWARENESS
DEPLOYMENTS IN PLACE
FEATURE FLAGS
ACTIVE ACTIVE MULTI CLUSTER
BASIC CHAOS TESTING
SERIOUS DESIGN/DOMAIN DRIVEN DESIGN
DESIGN AROUND UNIVERSAL FAILURE DOMAINS
SHARDED DATA
MANUAL TESTS
CODE VERSION CONTROL
FUNCTIONAL TESTS
SEMI AUTOMATED INTEGRATION
DATA VERSIONING
TRAFFIC SHIFTING
INSTRUMENTATION FOR IN PROCESS TRACES
BACKWARDS VERSION COMPATIBILITY BY DEFAULT
CANARY DEPLOYMENTS
LEFT SHIFT QA TESTING (SDET)
E2E TESTING
MULTI CLUSTER ROLLOUT POLICY
UNIVERSAL SMART RETRIES
SHARDED SERVING
MANUAL INTEGRATION TESTS
REGULAR RELEASE CADENCE
CONTAINERS
BLUE GREEN DEPLOYMENTS
FUZZ TESTING
DISTRIBUTED SYSTEMS (NO ACTIVE/PASSIVE)
AUTOMATIC ASSURED CAPACITY AND PERFORMANCE TESTING
ANDON CORD/BIG RED BUTTON
CODE QUALITY THRESHOLD (CODE REUSE PREFERRED)
LOW CONTEXT ARCHITECTURE, DESIGN, CODING, OPERATIONS
LANGUAGE READABILITY
ONLY CUSTOMIZE COMPONENTS NEEDING CUSTOMIZATION
DESIGN FOR CHAOS
FORMAL METHODS (E.G. TLA+)
LOCAL DATA STORAGE
SINGLE ZONE
DNS / SIMPLE LB
BASIC LINEAR CAPACITY PROJECTION
ADVANCED LOADBALANCING
IAC
UNDERSTAND INFRASTRUCTURE FAILURE DOMAINS
AUTO FAILOVER
FAILURE TESTING IN PROD
N+1 AS STANDARD
N+2 THINKING
N+2 GLOBAL PLANNING
PET HOST
>1 COMPUTER
DISTRIBUTED STORAGE
ALTERNATE SITE REPLICATION
CATTLE INFRASTRUCTURE
CONTAINER ORCHESTRATOR
AUTO SCALING
ELIMINATE SPOFS (HARDWARE & SOFTWARE)
SERVICE DISCOVERY
DRAIN/SPILL (N/S & E/W)
BASIC LOADTESTING
MULTI ZONE
HOLT-WINTER CAPACITY PROJECTIONS
FAILURE INJECTION
N+1 REGIONAL PLANNING
L7 GLOBAL LB
HIGH WATER MARK PREDICTION
IMMUTABLE INFRASTRUCTURE
ASSURED CAPACITY LOAD TESTING
REAL WORLD TRAFFIC LOAD TESTING
L4 REGIONAL LOAD BALANCING
PRODUCTION LAUNCH PLATFORM
MULTI REGION
OFF-HOST BACKUP
RPO/RTO DEFINED
DR PLAN
RPO/RTO REFINED
DR PLAN SIMULATED/TABLETOP
DR PLAN TESTED PERIODICALLY
CONTINUOUS INTEGRATION
CONTINUOUS DELIVERY
REGULAR BCP TESTING (RUN FROM ALTERNATE SITE)
% BASED TRAFFIC STEERING
ACTIVE ACTIVE DATASTORES
INTERNAL RATE LIMITING
AUTONOMOUS RESPONSE SYSTEMS
AUTOMATIC ROLLBACKS
MANUALLY CREATED MACHINES
MANUAL VM IMAGES
CUSTOM VMS VIA SEMI-AUTOMATION
ITIL STYLE NOC
DR SITE EXISTS
MANUAL REMEDIATION PLAYBOOKS
FORMAL INCIDENT RESPONSE ROLES
FORMAL INCIDENT RESPONSE PROCESSES
ROLLBACKS/ROLLFORWARDS TESTED
CONTINUOUS DEPLOYMENT
EXTERNAL RATE LIMITING
CENTRALIZED PRODUCTION CHANGELOG
PROACTIVE DDOS COUNTERMEASURES
LOAD PREDICTION
MANUAL REMEDIATION
SCHEDULED DOWNTIME
BASIC INCIDENT MANAGEMENT
REPEATABLE DEPLOYMENTS
AUTOMATION OF TOIL
PROBLEM MANAGEMENT FUNCTION
DEDICATED OPERATIONS TOOLING
AUTOMATED SERVICE DISCOVERY
DATA COLLECTION AUTOMATION
MOSTLY AUTOMATED REMEDIATION
PATCHING WINDOWS
GOLD IMAGE AUTOMATION
CENTRAL CERTIFICATE ROTATION
BREAKGLASS SECRET ACCESS
GLOBAL POLICY ENFORCEMENT
VANILLA DDOS PROTECTION
DIRT TESTING
PRODUCT SPECIFIC DDOS PROTECTION (E.G. WAF)
HOST METRICS AND LOGGING
PER HOST ALARMS
HOST PING TESTS
SYNTHETIC MONITORING
APM METRICS AND TRACES
INTERNAL SLAS
ERROR BUDGETS
CUSTOM IN PROCESS TRACING
CROSS SERVICE TRANSACTION TESTING
MULTI MACHINE DEBUGGING, HOTSPOTS ETC
ANOMALY DETECTION
OBSERVABILITY INTEGRATION ACROSS TOOLS
ON HOST LOG GREP
SSH TO GREP LOGS
CENTRALIZED LOG COLLECTION
REALTIME CENTRALIZED LOG ANALYTICS
AUTOMATED TOPOLOGY VIEW
SERVICE LEVEL INDICATORS (SLI)
RECORD AND REPLAY TRAFFIC
ADVANCED VIZUALIZATIONS (HEATMAPS, FLAMEGRAPHS)
NEAR MISS DETECTION
SERVICE LEVEL OBJECTIVES (SLO)
EVENT CORRELATION
HIGH CONTEXT BEHAVIOURS
RCA/5 WHYS
INCENTIVISE TRUST/SAFETY
UNDERSTAND BUSINESS IMPACTS
BLAMELESS POSTMORTEMS
POSTMORTEM REVIEWS/ACTIONS
SINGLE CENTRAL CAB
HOLISTIC VIEW OF R9Y AS HIGH VALUE
RELIABILITY EXECUTIVE/SPONSOR EXISTS
RELIABILITY HAS A SEAT AT THE TABLE
R9Y IS A PRODUCT DIFFERENTIATOR
R9Y CAN STOP FEATURE LAUNCH
PROACTIVE RISK AND SCALING ANALYSIS
MANAGING PET CONFIGURATION DRIFT
MEASURE EVERYTHING
DATA DRIVEN DECISIONS
SERVICE OWNERSHIP
INCENTIVISE CROSS SILO COLLABORATION
DEDICATED R9Y STAFFING
CHANGE FREEZES
VERTICAL SCALE IS AN ANTIPATTERN
SRE SWE ROLES INTRODUCED
EMPOWERED R9Y STAFF
R9Y EMBEDDED IN HIGH LEVEL STRATEGY AND OPERATIONS
ADVANCED COST OPTIMIZATION
FOCUS ON PREVENTION AND NEAR MISSES INSTEAD OF OUTAGES
TODO LISTS
WATERFALL PROJECTS/PMO
SMART GOALS
GOALS -> OBJECTIVES (OKRS)
ARCHITECTURE REVIEWS
HIGH PERFORMING STAFF (PROMOTION AND HIRING)
REACTIVE RISK ANALYSIS
BASIC COST OPTIMISATION
INTRODUCING DEDICATED SRES
TOIL BUDGETS
DECREASED RELIANCE ON 3RD PARTY SAAS
SELF DRIVEN CHECKLIST LAUNCHES
ERA
DEVELOPMENT-DEMO
DEVELOPMENT-DETERMINISTIC
DEVELOPMENT-REACTIVE
DEVELOPMENT-PROACTIVE
DEVELOPMENT-AUTONOMIC
INFR...
INFR...
INFRASTRUCTURE-REACTIVE
INFRASTRUCTURE-PROACTIVE
INFR...
OPERATIONS-DEMO
OPERATIONS-DETERMINISTIC
OPERATIONS-REACTIVE
OPERATIONS-PROACTIVE
OPER...
OBSE...
OBSE...
OBSE...
OBSE...
OBSE...
PEOPLE-DEMO
PEOP...
PEOPLE-REACTIVE
PEOPLE-PROACTIVE
PEOP...