Defining and managing SLOs, SLAs, and error budgets, building observability and alerting frameworks, leading incident response and post-mortems, developing automation to eliminate toil, designing capacity planning models, and collaborating with engineering teams on production readiness.