Zeta Global Logo

Zeta Global

Senior Site Reliability Engineer

Posted 2 Hours Ago
Easy Apply
Remote
Hiring Remotely in United States
Senior level
Easy Apply
Remote
Hiring Remotely in United States
Senior level
The Senior Site Reliability Engineer will implement and manage service level objectives, lead postmortems, enhance system reliability, automate operations, and analyze key metrics to drive improvements. They will collaborate with product teams and lead initiatives on capacity and reliability while utilizing technology like OpenTelemetry and AWS.
The summary above was generated by AI


WHO WE ARE 

Zeta Global (NYSE: ZETA) is the Data-Powered Marketing Cloud that leverages advanced artificial intelligence (AI) and trillions of consumer signals to make it easier for marketers to acquire, grow, and retain customers more efficiently. Through the Zeta Marketing Platform (ZMP), our vision is to make sophisticated marketing simple by unifying identity, intelligence, and omnichannel activation into a single platform – powered by one of the industry’s largest proprietary databases and AI. Our enterprise customers across multiple verticals are empowered to personalize experiences with consumers at an individual level across every channel, delivering better results for marketing programs. Zeta was founded in 2007 by David A. Steinberg and John Sculley and is headquartered in New York City with offices around the world.

Job Overview:
We are looking for a dynamic and highly skilled Senior SRE Engineer to join the team. 

 

Key Responsibilities:

• Implement and manage SLOs, SLIs, and error budgets.

• Lead and promote postmortems, ensuring robust root cause analysis to drive continuous system improvement. Analyze historical data to identify improvement areas.

• Implement full observability across systems using OpenTelemetry.

• Reduce toil through runbook automation.

• Record and track key MTTx metrics (MTTA, MTTR, MTTF, etc.).

• Lead design sessions on capacity planning, reliability by design, automation, and alerting.

• Collaborate with product teams to enhance system reliability.

• Engage in strategic initiatives for capacity, reliability, and automation, ensuring alignment with business goals.

What We're Looking For:

• 3+ years of experience as an SRE.

• 2+ years of software development experience, with a strong emphasis on automation.

• 3+ years of experience in AWS

• Experience managing and designing high-throughput systems processing millions of transactions daily.

• Deep understanding of observability with hands-on experience implementing SLIs, SLOs, and error budgets.

• Proficiency in Kubernetes, Terraform, and cloud platforms (AWS) with a focus on scalability and reliability.

• Hands-on experience with Infrastructure as Code (IaC) tools.

• Experience with distributed systems and microservices architecture (MSA).

• Production experience with distributed tracing.

• Strong software development background in Python, Golang, and Bash scripting.

• Experience with OTEL Collectors, collection, sampling, and customizations.

• Solid understanding of SLIs, SLOs, and error budgets.

• Hands-on experience with CI/CD platforms (GitOps, GitLab, Jenkins, ArgoCD, etc.).

• Expertise in incident management and root cause analysis.

• Knowledge of modern deployment strategies (Canary, Blue-Green, etc.).

• Familiarity with resiliency patterns (circuit breakers, retry mechanisms, load balancing, etc.).

• Experience with SQL and NoSQL databases and understanding of distributed systems.

• Proficiency in statistical analysis applied to metrics.

• Experience with high-performance, low-latency systems.

• Proven experience in cloud cost optimization strategies.

• Containerization experience (docker, k8s) on-prem and cloud.

• Experience with Kafka or other distributed messaging systems.

• Strong understanding of security and compliance standards within DevOps/SRE environments.

BENEFITS & PERKS

  • Unlimited PTO
  • Excellent medical, dental, and vision coverage
  • Employee Equity and Stock Purchase Plan
  • Employee Discounts, Virtual Wellness Classes, and Pet Insurance And more!!


    COMPENSATION RANGE
     

    The compensation range for this role is $130,000.00 - $155,000.00, depending on location and experience.


PEOPLE & CULTURE AT ZETA

Zeta considers applicants for employment without regard to, and does not discriminate on the basis of an individual’s sex, race, color, religion, age, disability, status as a veteran, or national or ethnic origin; nor does Zeta discriminate on the basis of sexual orientation, gender identity or expression.

We’re committed to building a workplace culture of trust and belonging, so everyone feels invited to bring their whole selves to work. We provide a forum for employees to celebrate, support and advocate for one another. Learn more about our commitment to diversity, equity and inclusion here: https://zetaglobal.com/blog/a-look-into-zetas-ergs/

 

ZETA IN THE NEWS!

https://zetaglobal.com/press/?cat=press-release

 

#LI-DD1
#LI-Remote

Top Skills

Bash
Go
Python

Similar Jobs at Zeta Global

9 Days Ago
United States
Remote
2,194 Employees
Expert/Leader
2,194 Employees
Expert/Leader
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Principal Software Engineer will lead technical design and implementation of new product features while mentoring team members, optimizing infrastructure, and collaborating with product managers. Responsibilities include evaluating technologies, engaging in architectural discussions, and improving system performance and observability through DevOps initiatives.
9 Days Ago
United States
Remote
2,194 Employees
Senior level
2,194 Employees
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Big Data DevOps Lead will enhance Big Data infrastructure efficiency, coordinate inter-team initiatives, and lead architecture efforts. Responsibilities include maintaining system reliability, deploying monitoring tools, and implementing disaster recovery practices. On-call support for production systems is also required.
10 Days Ago
United States
Remote
2,194 Employees
Senior level
2,194 Employees
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
As a Lead AI Engineer, you will develop and implement scalable machine learning systems, focusing on deep learning and generative AI. Collaborate with cross-functional teams, apply software engineering principles, and refine model performance while integrating advanced ML capabilities into products.

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account