Publicis Sapient is a digital transformation partner helping established organizations get to their future, digitally enabled state, both in the way they work and the way they serve their customers. We help unlock value through a start-up mindset and modern methods, fusing strategy, consulting, and customer experience with agile engineering and problem-solving creativity. United by our core values and our purpose of helping people thrive in the brave pursuit of next, our 20,000+ people in 53 offices around the world combine experience across technology, data sciences, consulting, and customer obsession to accelerate our clients’ businesses through designing the products and services their customers truly value.
We are looking for a
- *Senior Backend Engineer**
with strong
and a
- *Site Reliability Engineering (SRE) mindset**
to own, maintain, and evolve critical production systems. This role combines hands-on backend development with deep responsibility for system reliability, performance, and operational excellence across customer-facing applications.
You will work on
- *legacy and modern Java systems**
, actively modernizing components while ensuring high availability and stability in production environments.
- *Your Impact
- Design, develop, and maintain backend services using
- *Java, Struts, and Hibernate/JPA
.
- Refactor and modernize legacy Java modules to improve
- *performance, scalability, and maintainability**
.
- Own production reliability by ensuring
- *high availability, resilience, and fast recovery**
of critical systems.
- Troubleshoot complex production issues, perform
- *root cause analysis**
, and implement long-term fixes.
- Define, track, and improve
- *SLIs, SLOs, and error budgets**
for key services.
- Collaborate closely with
- *QA, DevOps, and infrastructure teams**
to ensure safe, predictable releases.
- Drive improvements in
- *observability, automation, and operational readiness**
.
- *Backend Engineering (Java)
- Develop and enhance backend components using
- *Core Java, J2EE, Struts (1.x/2.x), and Hibernate
.
- Design and optimize
- *SQL queries, ORM mappings, and data access layers**
.
- Apply best practices for
- *code quality, performance tuning, and secure coding**
.
- Identify architectural and code-level bottlenecks and propose pragmatic improvements.
- *SRE & Production Reliability
- Ensure
- *stability, reliability, and performance
of large-scale, customer-facing applications.
- Monitor application health using
- *logs, metrics, traces, and alerts**
.
- Participate in
- *on-call rotations**
, actively reducing incident frequency and MTTR.
- Lead or contribute to
- *incident response, postmortems, and preventive actions**
.
- Improve system resilience through better
- *capacity planning, alerting, and failure handling**
.
- *Operational Excellence & Automation
- Automate repetitive operational tasks using scripts and tooling.
- Enhance
- *CI/CD pipelines
to support safer deployments and rollbacks.
- Improve
- *logging, dashboards, and alerting rules**
to strengthen observability.
- Create and maintain
- *runbooks, operational documentation, and knowledge bases**
.
- *Required Skills & Experience
- 5–8 years**
of experience in backend engineering and production support roles.
- Strong hands-on experience with:
- Java (Core Java, J2EE)
- Struts (1.x or 2.x)
- Hibernate / JPA
- Solid knowledge of
- *relational databases**
(Oracle, MySQL, PostgreSQL).
- Proven experience supporting
- *production systems**
with real on-call responsibility.
- Strong debugging,
- *log analysis**
, and
skills.
- Experience with
- *Linux/Unix environments**
and shell scripting.
- Familiarity with
- *monitoring and observability tools**
(Splunk, ELK, Prometheus, Grafana).
- Experience working with
- *CI/CD tools**
(Jenkins, GitLab CI, GitHub Actions).
- *Preferred Qualifications
- Exposure to
- *cloud platforms
(AWS, Azure, or GCP).
and
.
- Practical understanding of
- *SRE concepts**
: SLIs, SLOs, error budgets, incident management.
- Experience modernizing
- *legacy Java applications**
.
- Familiarity with
- *messaging systems**
(Kafka, RabbitMQ) or
(Redis, Memcached).
- Additional Information:
- \*\This is a Contractor Position open for candidates based in LATAM as long as they can work on CST.\\*