← Back to jobs

Senior SRE, Site Reliability Engineer

Klaviyo · Dublin · Dublin, IE

Salary not disclosed
Full-time Senior
Response score is building for Klaviyo. Be the first to apply and set the benchmark.
<div class="content-intro"><p><em>At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If you’re a close but not exact match with the description, we hope you’ll still consider applying. Want to learn more about life at Klaviyo? Visit <a class="_ymio1r31 _ypr0glyw _zcxs1o36 _mizu194a _1ah3dkaa _ra3xnqa1 _128mdkaa _1cvmnqa1 _4davt94y _4bfu18uv _1hms8stv _ajmmnqa1 _vchhusvi _kqswh2mm _ect4ttxp _syaz13af _1a3b18uv _4fpr8stv _5goinqa1 _f8pj13af _9oik18uv _1bnxglyw _jf4cnqa1 _30l313af _1nrm18uv _c2waglyw _1iohnqa1 _9h8h12zz _10531ra0 _1ien1ra0 _n0fx1ra0 _1vhv17z1" href="http://klaviyo.com/careers" data-renderer-mark="true">klaviyo.com/careers</a>&nbsp;to see how we empower creators to own their own destiny.</em></p></div><h2><strong>Senior Site Reliability Engineer – Site Reliability Engineering</strong><strong> </strong><strong>(Dublin)</strong></h2> <h3><strong>Team Overview</strong></h3> <p>As a senior Site Reliability engineer, you’ll ensure Klaviyo’s critical platforms are reliable, scalable, and sustainable while enabling rapid product development. We treat reliability as a core product feature and use software engineering to solve complex systems and operational challenges.</p> <p>Our work spans security, infrastructure, and software development, requiring us to understand systems and engineering.&nbsp; We build complex, foundational solutions that must be extremely reliable, secure, and performant at global scale.</p> <p>Our charter is to build and operate foundational services and infrastructure, define clear reliability objectives, reduce operational toil through automation, and continuously improve systems based on real production learnings. The work is highly visible and directly impacts how Klaviyos build software and how customers experience Klaviyo every day.</p> <h3><strong>How you’ll make an impact</strong></h3> <p>As a Senior Site Reliability Engineer, you will build and operate the platforms, systems, and services that underpin Klaviyo’s reliability and operational excellence. You will:</p> <ul> <li>Build and operate foundational, security-critical services with a strong emphasis on availability, scalability, latency, and fault tolerance</li> <li>Apply software engineering principles to automate infrastructure, reduce operational toil, and improve system reliability at scale</li> <li>Design, implement, and evolve systems using SRE best practices</li> <li>Define and refine SLIs, SLOs, and error budgets to guide engineering decisions</li> <li>Improve observability, alerting, and incident response to reduce mean time to detection and recovery</li> <li>Participate in on-call rotations with a focus on sustainable operations and automatic remediations&nbsp;</li> <li>Perform quantitative analysis to understand system behavior, capacity constraints, and scaling limits</li> <li>Identify systemic risks and reliability bottlenecks and drive long-term, preventative solutions</li> <li>Collaborate closely with product, platform, and security engineers to influence architecture early and ship reliable systems</li> <li>Mentor and pair with other engineers, helping raise the bar for reliability, operational maturity, and engineering excellence</li> </ul> <h3><strong>Who you are</strong></h3> <p>You are a cloud-native, platform-focused SRE who uses software to build and operate reliable production systems at scale.</p> <ul> <li>You write and maintain production-quality code (e.g. Python, Go, or similar) to build internal platforms, automate operations, and improve system reliability</li> <li>You have built, deployed, and operated distributed, cloud-native systems and understand failure modes such as partial outages, dependency failures, resource saturation, and cascading impact</li> <li>You have experience operating containerized workloads and platforms (e.g. Kubernetes) in production, including deployment strategies, scaling behavior, and service networking</li> <li>You are comfortable participating in on-call rotations and diagnosing production issues</li> <li>You have designed and operated observability systems and know how to build actionable alerts that reflect real user and service impact</li> <li>You apply SRE concepts such as SLIs, SLOs, error budgets, and burn-rate–based alerting to guide engineering decisions and operational response</li> <li>You have hands-on experience with infrastructure as code and declarative configuration (e.g. Terraform, Kubernetes manifests, policy-as-code)</li> <li>You have performed capacity planning, load testing, and performance analysis for distributed services and platforms</li> <li>You routinely contribute to post-incident reviews and drive concrete, code-focused follow-up actions that prevent recurrence</li> <li>You are comfortable reviewing and contributing to technical designs, platform APIs, operational runbooks, and system documentation</li> <li>You’ve already experimented with AI in work or personal projects, and you’re excited to dive in and learn fast. You’re hungry to responsibly explore new AI tools and workflows, finding ways to make your work smarter and more efficient.<br><br></li> </ul> <h3><strong>Nice to have</strong></h3> <ul> <li>Experience supporting security-critical platforms or building internal security tooling</li> <li>Familiarity with identity, access management, secrets management, or policy enforcement systems</li> <li>Experience operating systems at scale in cloud environments (AWS preferred)</li> <li>Background in resilience testing, fault injection, or chaos engineering</li> <li>A strong comprehension of algorithms and data structures at scale</li> </ul> <h3><strong>Tech Stack</strong></h3> <p>Klaviyo’s platform is primarily built with Python and React and runs on AWS. Engineers join us from a wide range of technical backgrounds and are supported in learning our stack.</p> <p>Core technologies include:</p> <ul> <li>Python / Django / FastAPI</li> <li>MySQL / Redis / Memcached</li> <li>RabbitMQ / Celery / Apache Kafka / Apache Pulsar</li> <li>AWS / Terraform / Kubernetes<br><br></li> </ul> <h3><strong>Location &amp; Work Model</strong></h3> <p>This role is based in Dublin, Ireland and follows a hybrid working model. Klaviyo supports work authorization and relocation for this position.</p> <p>At Klaviyo, we enjoy tackling meaningful engineering challenges and value people who take ownership, learn continuously, and collaborate openly. We are committed to building inclusive teams and encourage applications from candidates of all backgrounds.</p> <p>Klaviyo is growing fast and we have openings for all skill levels across all of our teams. Learn more about our engineering culture at<a href="https://klaviyo.tech"> https://klaviyo.tech</a></p> <p>&nbsp;</p><br><p>We use Covey as part of our hiring and / or promotional process. For jobs or candidates in NYC, certain features may qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements and candidate submitted applications. We began using <a href="https://getcovey.com/product/covey-scout-inbound">Covey Scout for Inbound</a> on April 3, 2025.</p><p>Please see the independent bias audit report covering our use of Covey <a href="https://getcovey.com/nyc-local-law-144" style="text-decoration:underline">here</a></p><div class="content-pay-transparency"><div class="pay-input"><div class="description"><p>Our salary range reflects the cost of labour in the country where the job post is advertised. The base salary offered for this position is determined by several factors, including the applicant’s job-related skills, relevant experience, education or training, and work location.</p> <p>In addition to base salary, our total compensation package may include participation in the company’s annual cash bonus plan, variable compensation (OTE) for sales and customer success roles, equity, sign-on payments, and a comprehensive range of health, welfare, and wellbeing benefits based on eligibility.&nbsp;</p> <p>Your recruiter can provide more details about the specific salary/OTE&nbsp; range for your preferred location during the hiring process.</p></div><div class="title">Base Pay Range in Local Currency:</div><div class="pay-range"><span>€92.000</span><span class="divider">&mdash;</span><span>€138.000 EUR</span></div></div></div><div class="content-conclusion"><p><strong>Get to Know Klaviyo</strong></p> <p>We’re Klaviyo (pronounced clay-vee-oh). We empower creators to own their destiny by making first-party data accessible and actionable l
Apply on Klaviyo ↗

This role is listed directly from Klaviyo's careers page. · Posted 14 hours ago