- 5+ years in SRE, production engineering, platform operations, or security automation with strong coding ability.
- Hands-on scripting and coding experience, especially Python, with comfort working against APIs, log pipelines, and automation workflows.
- Experience building pragmatic observability and alerting systems in AWS or comparable cloud environments.
- Ability to reduce operational toil through automation while keeping signal quality high and false positives manageable.
- Comfortable with incident handling, rollback thinking, SLA / SLO discussions, and evidence-driven postmortems.
- Interest in AI systems, agent runtimes, and MCP-style integration risks is highly valuable.
- Site Reliability Engineers or backend engineers with strong automation skills.
- Platform, DevSecOps, or observability engineers who build tooling, not just dashboards.
- Cloud automation engineers with strong logging, tracing, and incident-response instincts.
- Detection or security automation engineers who prefer code, pipelines, and remediation over ticket operations.