Senior Software Engineer - DevOps / Platform at Atlassian (2019-05 – Present)
Contributed to Platform infrastructure and DevOps practices used by multiple Atlassian product teams. Delivered reliability, security, automation, and cost-optimization initiatives across production environments.
- Led multi-region Kubernetes platforms (EKS) serving millions of users, consistently meeting 99.99% availability SLOs across 15+ production clusters.
- Developed an Internal Developer Platform (IDP) enabling 200+ engineers to self-service infrastructure through reusable Terraform modules and GitOps workflows, reducing reliance on central DevOps support.
- Defined and maintained Infrastructure-as-Code standards using Terraform, including modular design, remote state management, provider versioning, and automated validation, reducing provisioning lead time from days to minutes.
- Used Python to automate infrastructure checks, deployment support tasks, and internal platform operations, reducing manual effort and improving engineering efficiency.
- Managed AWS cloud infrastructure for production platform environments, including EKS, VPC networking, IAM/access controls, compute resources, and reliability improvements.
- Implemented CI/CD pipelines using GitHub Actions and ArgoCD, reducing deployment-related incidents by 85%.
- Championed GitOps as the single source of truth for infrastructure and application delivery, improving auditability, rollback safety, and environment consistency across teams.
- Built and improved deployment workflows for infrastructure and application releases across non-production and production environments, reducing manual effort and improving release reliability.
- Mentored engineering teams on CI/CD best practices, peer review, and automation standards, improving developer experience and reducing cross-team deployment errors.
- Built monitoring and alerting dashboards using Prometheus, Grafana, Loki, and ELK to track service health, infrastructure performance, logs, and production incidents.
- Defined SLIs and SLOs for key platform services, helping teams measure availability, latency, error rates, and reliability more consistently.
- Improved incident response by tuning noisy alerts, creating operational runbooks, and adding targeted alerts for critical production failures.
- Supported production incidents and led post-incident reviews, resolving Kubernetes, infrastructure, deployment, and networking issues while driving long-term reliability improvements.
- Implemented DevSecOps controls within CI/CD pipelines, incorporating container scanning, SAST/DAST, and dependency checks, contributing to successful SOC 2 Type II audit compliance.
- Developed centralized secrets management with HashiCorp Vault, enabling automated rotation, audit logging, and the elimination of manual credential handling.
- Designed cloud security guardrails using OPA and AWS IAM best practices, improving compliance posture and minimizing incident blast radius.
- Executed FinOps strategies using reserved instances, spot capacity, and rightsizing, achieving 30% annual cloud cost reduction without service degradation.
- Built cost allocation and chargeback models to improve visibility, accountability, and ownership of infrastructure spend across engineering teams.
- Optimized AWS cloud costs through rightsizing, reserved capacity, tagging standards, and usage reporting for shared platform services.
DevOps Engineer at NearForm (2014-08 – 2019-04)
Supported enterprise client modernization projects, helping improve the scalability, reliability, and operational consistency of development, staging, and production environments.
- Supported enterprise client modernization projects, helping improve the scalability, reliability, and operational consistency of development, staging, and production environments.
- Built and maintained cloud infrastructure across multiple client engagements, supporting application delivery, environment stability, and platform operations in Linux-based environments.
- Migrated containerized applications from Docker Compose to AWS container platforms, supporting ECS adoption and, in later projects, EKS as part of broader cloud-native modernization efforts.
- Standardized infrastructure provisioning and lifecycle management using Terraform and Terragrunt, improving repeatability, governance, and environment control.
- Improved deployment automation for application and infrastructure changes, reducing manual operational effort and increasing release consistency across teams.
- Packaged and deployed Kubernetes workloads with Helm and, in later cloud-native engagements, supported Istio-based traffic management and Kafka-backed distributed application environments.
- Collaborated with developers and delivery teams on troubleshooting, platform operations, and infrastructure governance, including Sentinel-based policy controls where applicable.