Site reliability Engineer Salary Guide

SALARY BY STATE
  • Entry Level $110,000 AUD
  • Mid Level $140,000 AUD
  • Senior Level $170,000 AUD

Job Summary:

  • Design and implement scalable and highly reliable software systems to enhance the overall system reliability.
  • Collaborate with the software development team to integrate new features, manage code releases, and ensure system uptime.
  • Develop automation tools to monitor system health, performance, and reliability.
  • Proactively identify and address potential system issues before they become critical.
  • Analyse system outages and develop strategies to prevent future failures.

Key Skills

  • Proficiency in cloud platforms like AWS, Azure, and Google Cloud.
  • Strong programming and scripting skills, preferably in Python, Go, or Bash.
  • Expertise in monitoring and logging tools like Grafana, Prometheus, and ELK Stack.
  • Familiarity with container orchestration tools like Kubernetes.
  • Deep understanding of networking, system architecture, and distributed systems.

Standard Industry Training

  • Google Cloud Professional SRE certification
  • Certified Kubernetes Administrator (CKA)
  • Advanced training in system monitoring and log analysis

Interview Questions

  1. How do you approach on-call rotations, and what strategies do you use to minimise disruptions?
  2. Describe a time when you had to diagnose a production issue under pressure. What steps did you take and what did you learn?
  3. How do you balance the SRE principles of reliability and innovation in a rapidly evolving system?
  4. What strategies do you recommend for managing state in a distributed system?
  5. How do you handle post-mortem reviews after a system outage or incident?
DOWNLOAD PD TEMPLATE Register My Interest in this Position