View On GitHub
r9y-map
Project maintained by
r9y-dev
Hosted on GitHub Pages — Theme by
mattgraham
Local_Development
Monolith
Code_Review
Pre_Merge_Hooks
Active_Passive_Clusters
Microservices
Leftshift_Reliability_Design
Graceful_Service_Degradation_Individual_CUJs_
Left_Shift_Performance_Testing
Graceful_Service_Degradation_Universal_
Bounded_Context
Protobufs
Smoke_Tests
Automated_Unit_Testing
Multi_Service_Development
Distributed_Systems_Awareness
Deployments_in_Place
Feature_Flags
Active_Active_Multi_Cluster
Basic_Chaos_Testing
Serious_Design_Domain_Driven_Design
Design_Around_Universal_Failure_Domains
Sharded_Data
Manual_Tests
Code_Version_Control
Functional_tests
Semi_automated_integration
Data_versioning
Traffic_shifting
Instrumentation_for_in_process_traces
Backwards_Version_Compatibility_by_default
Canary_Deployments
Left_Shift_QA_testing_SDET_
E2E_testing
Multi_Cluster_Rollout_Policy
Universal_Smart_Retries
Sharded_Serving
Manual_integration_tests
Regular_release_cadence
Containers
Blue_Green_Deployments
Fuzz_Testing
Distributed_systems_no_active_passive_
Automatic_assured_capacity_and_performance_testing
Andon_cord_big_red_button
Code_Quality_Threshold_code_reuse_preferred_
Low_Context_Architecture
Language_Readability
Only_customize_components_needing_customization
Design_for_Chaos
Formal_methods_e.g.
TLA
Local_data_storage
Single_Zone
DNS__SImple_LB
Basic_linear_capacity_projection
Advanced_Loadbalancing
IaC
Understand_Infrastructure_Failure_Domains
Auto_Failover
Failure_Testing_in_Prod
N_1_as_standard
N_2_Thinking
N_2_Global_Planning
Pet_Host
A_single_computer
Distributed_storage
Alternate_site_replication
Cattle_Infrastructure
Container_Orchestrator
Auto_Scaling
Eliminate_SPOFs_hardware__software_
Service_Discovery
Drain_Spill_N_S__E_W_
Basic_Loadtesting
Multi_Zone
Holtz-Winter_capacity_projections
Failure_Injection
N_1_Regional_Planning
L7_Global_LB
High_Water_Mark_Prediction
Assured_Capacity_Load_Testing
Real_World_Traffic_Load_Testing
L4_Regional_Load_Balancing
Multi_Region
Off-host_backup
RPO_RTO_defined
DR_Plan
RPO_RTO_refined
DR_plan_simulated_tabletop
DR_plan_tested_periodically
Continuous_Integration
Continuous_Delivery
Regular_BCP_Testing_run_from_alternate_site_
Percent_Based_Traffic_Steering
Active_Active_Datastores
Internal_Rate_Limiting
Autonomous_Response_Systems
Automatic_Rollbacks
Manually_created_machines
Manual_VM_Images
Custom_VMs_via_semi-automation
ITIL_style_NOC
DR_Site_Exists
Manual_remediation_playbooks
Formal_Incident_Response_Roles
Formal_Incident_Response_Processes
Rollbacks_Rollforwards_tested
Continuous_Deployment
External_Rate_Limiting
Centralized_Production_Changelog
Proactive_DDoS_Countermeasures
Load_Prediction
Manual_Remediation
Scheduled_Downtime
Basic_Incident_Management
Repeatable_Deployments
Automation_of_Toil
Problem_Management_Function
Dedicated_Operations_Tooling
Automated_Service_Discovery
Data_Collection_Automation
Mostly_Automated_Remediation
Patching_Windows
Gold_Image_Automation
Central_Certificate_Rotation
Breakglass_Secret_Access
Global_Policy_Enforcement
Vanilla_DDoS_Protection
DiRT_Testing
Product_Specific_DDoS_Protection_e.g.
WAF
Host_Metrics_and_Logging
Per_Host_Alarms
Host_Ping_Tests
Synthetic_Monitoring
APM_Metrics_and_Traces
Internal_SLAs
Error_Budgets
Custom_In_Process_Tracing
Cross_Service_Transaction_Testing
Multi_Machine_Debugging
Anomaly_Detection
Observability_Integration_Across_Tools
On_host_log_grep
SSH_to_Grep_Logs
Centralized_Log_Collection
Realtime_Centralized_Log_Analytics
Automated_Topology_View
Service_Level_Indicators_SLI_
Record_and_Replay_Traffic
Advanced_Vizualizations_heatmaps_
Near_Miss_Detection
Service_Level_Objectives_SLO_
Event_Correlation
High_Context_Behaviours
RCA_5_Whys
Incentivise_trust_safety
Understand_Business_Impacts
Blameless_Postmortems
Postmortem_reviews_actions
Single_Central_CAB
Holistic_View_of_R9y_as_high_value
Reliability_Executive_Sponsor_exists
Reliability_has_a_seat_at_the_table
R9y_is_a_product_differentiator
R9y_can_stop_feature_launch
Proactive_Risk_and_Scaling_Analysis
Managing_pet_configuration_drift
Measure_Everything
Data_Driven_Decisions
Service_Ownership
Incentivise_cross_silo_collaboration
Dedicated_R9y_staffing
Change_Freezes
Vertical_Scale_is_an_Antipattern
SRE_SWE_roles_introduced
Empowered_R9y_staff
R9y_Embedded_in_High_Level_Strategy_and_Operations
Advanced_Cost_Optimization
Focus_on_prevention_and_near_misses_instead_of_outages
TODO_Lists
Waterfall_Projects_PMO
SMART_Goals
Goals_->
Objectives_OKRs
Architecture_Reviews
High_Performing_Staff_Promotion_and_Hiring_
Reactive_Risk_Analysis
Basic_Cost_Optimisation
Introducing_Dedicated_SREs
Toil_Budgets
Decreased_Reliance_on_3rd_party_SaaS