Senior Site Reliability Engineer
StraumannDescripción del puesto
Senior Site Reliability Engineer About Straumann Group:
At Straumann Group we’re on an exciting journey of growth, innovation, and impact - driven by our mission to improve oral health and transform millions of lives worldwide. United by purpose, we bring our best selves to work every day, embracing a high-performance, player-learner culture that inspires collaboration, curiosity, and ambition. Here, you’ll have the opportunity to take charge of your own career, harnessing your skills, passion, and enthusiasm for learning to continually grow and progress.
Together, we’re not just shaping brighter smiles, we’re unlocking the potential of people everywhere, including our own. About the role:
We are looking for a highly motivated and experienced Senior Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a strong background in software engineering and IT operations and will be responsible for ensuring the reliability, scalability, and performance of our production systems.
Key responsibilities: Design, implement, and maintain monitoring, alerting, and observability practices that support fast issue detection and response.
Manage capacity, load balancing, and system performance to identify and address risks early.
Define and track service level agreements (SLAs), service level indicators (SLIs), service level objectives (SLOs), and error budgets.
Partner with software engineering teams to improve application reliability, scalability, and operational readiness.
Support and promote DevOps ways of working across engineering teams.
Lead or coordinate incident response activities, contribute to on-call practices, and drive effective post-incident reviews with clear follow-up actions.
Contribute to the planning and execution of software and infrastructure deployments.
Automate repetitive operational tasks using scripts and engineering tools.
Help keep the technology stack current and support teams in adopting sustainable improvements.
Our tech stack: Docker | Kubernetes | AWS | Grafana | Prometheus | GitHub & GitHub Actions | TypeScript | Node.JS | Express | PostgreSQL | Metabase | OpenAPI | Python | PyTorch | TensorFlow | ArgoCD & Workflows | ClearML | Packer Our whole stack runs on AWS using EKS, and we deploy our infrastructure changes in a GitOps pipeline using CDK. Our applications are deployed in a GitOps fashion using ArgoCD. Our backend is mostly written in TypeScript and Python, and all our machine-learning applications are in Python.
We have an efficient and effective design and development process around RFCs, PR reviews, and pair programming.
Requirements: Strong software engineering background, including experience with at least one programming language such as Python, Go, Java, or TypeScript.
Hands-on experience with containerization and orchestration technologies such as Docker and Kubernetes.
Experience working with cloud platforms, ideally AWS.
Experience with infrastructure as code tools such as AWS CDK, Terraform, or Pulumi.
Familiarity with monitoring and observability tools such as Prometheus, Grafana, ELK, or similar platforms.
Good understanding of Linux or Unix systems administration, networking fundamentals, and system performance.
Experience working with databases such as PostgreSQL, MySQL, MongoDB, or Cassandra.
Ability to collaborate effectively across teams and communicate clearly with technical stakeholders.
Ability to prioritize work, manage competing demands, and approach problems in a structured way.
Bonus points: Experience in medical AI or similar regulated fields We welcome applications from people of all backgrounds, identities, and experiences. You do not need to meet every requirement to apply — especially if you believe you would be strong in the role. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or disability.
Employment Type: Full Time Alternative Locations: Spain : Madrid Travel Percentage: 0 - 10% Requisition ID: 20072
¿Te interesa este puesto?