Ensure performance, reliability, and scalability of platforms. Responsibilities include systems design, incident management, automation, and mentoring junior engineers.
Senior Live-Ops Site Reliability Engineer
Location: Remote (Anywhere in Canada)
Company Overview
eDynamic Learning is celebrating 16 years of serving educators. Founded by a classroom teacher, we're on a mission to empower educators with accessible and equitable resources, guiding students on their journey to life after graduation. We are dedicated to supporting both teachers and programs that facilitate student exploration of interests, career options, and skill acquisition through Career and Technical Education (CTE). We prioritize quality and the development of vital life readiness skills, including interpersonal communication and financial literacy.
Our commitment to fostering exploration starts early, with resources tailored to middle school students. Our rich courseware catalog and Learning Blade resource have a proven track record of expanding STEM, computer science, and career interest and awareness.
As the largest publisher of CTE and elective digital curriculum in North America, we offer a vast catalog of over 250 courses spanning grades 6-12. Our CTE pathway curriculum aligns to 14 career clusters, preparing students for nearly 100 industry certifications. To help bring our curriculum to learners, we provide professional development as well as virtual instructional services, supported by certified teachers, that facilitate personalized learning.
eDynamic Learning doesn't stop at coursework alone. We are passionate about helping students grow their skills through experiential learning through our Knowledge Matters virtual simulation instructional materials and projects. Our simulations are true hands-on learning in a virtual environment.
We take pride in the fact that our solutions and services are designed to empower educators and students alike, enabling them to take a transformative journey of exploration, engage in learning, and participate in real-world experiences.
In July 2025, eDynamic Learning was acquired by Pearson.
Role Overview
We are seeking a Senior Live-Ops Site Reliability Engineer (SRE) to ensure the performance, reliability, and scalability of eDynamic Learning’s platforms and services.
In this role, you will be a key member of the engineering operations team, responsible for maintaining uptime, optimizing production systems, and building automation that scales. You’ll work closely with software engineering, DevOps, and infrastructure teams to deliver seamless and reliable experiences for students and educators across North America.
This position combines hands-on engineering, systems design, and incident management in a mission-driven, fast-paced environment.
Responsibilities
Location: Remote (Anywhere in Canada)
Company Overview
eDynamic Learning is celebrating 16 years of serving educators. Founded by a classroom teacher, we're on a mission to empower educators with accessible and equitable resources, guiding students on their journey to life after graduation. We are dedicated to supporting both teachers and programs that facilitate student exploration of interests, career options, and skill acquisition through Career and Technical Education (CTE). We prioritize quality and the development of vital life readiness skills, including interpersonal communication and financial literacy.
Our commitment to fostering exploration starts early, with resources tailored to middle school students. Our rich courseware catalog and Learning Blade resource have a proven track record of expanding STEM, computer science, and career interest and awareness.
As the largest publisher of CTE and elective digital curriculum in North America, we offer a vast catalog of over 250 courses spanning grades 6-12. Our CTE pathway curriculum aligns to 14 career clusters, preparing students for nearly 100 industry certifications. To help bring our curriculum to learners, we provide professional development as well as virtual instructional services, supported by certified teachers, that facilitate personalized learning.
eDynamic Learning doesn't stop at coursework alone. We are passionate about helping students grow their skills through experiential learning through our Knowledge Matters virtual simulation instructional materials and projects. Our simulations are true hands-on learning in a virtual environment.
We take pride in the fact that our solutions and services are designed to empower educators and students alike, enabling them to take a transformative journey of exploration, engage in learning, and participate in real-world experiences.
In July 2025, eDynamic Learning was acquired by Pearson.
Role Overview
We are seeking a Senior Live-Ops Site Reliability Engineer (SRE) to ensure the performance, reliability, and scalability of eDynamic Learning’s platforms and services.
In this role, you will be a key member of the engineering operations team, responsible for maintaining uptime, optimizing production systems, and building automation that scales. You’ll work closely with software engineering, DevOps, and infrastructure teams to deliver seamless and reliable experiences for students and educators across North America.
This position combines hands-on engineering, systems design, and incident management in a mission-driven, fast-paced environment.
Responsibilities
- Own the availability, reliability, and performance of production systems and services
- Design and maintain scalable infrastructure to support high-traffic educational applications
- Build monitoring, alerting, and observability systems to proactively detect and resolve issues
- Lead incident response and postmortem processes to improve resilience and reduce downtime
- Develop automation tools and scripts to streamline deployments, operations, and recovery
- Collaborate closely with engineering and DevOps teams to design and implement fault-tolerant systems
- Continuously refine CI/CD pipelines and deployment processes for speed and safety
- Champion best practices in infrastructure-as-code (IaC), security, and configuration management
- Partner with development teams to ensure reliable service releases and smooth rollouts
- Analyze capacity trends and system performance to plan for future growth
- Mentor junior engineers and contribute to an operational culture of transparency, ownership, and continuous learning
- Bachelor’s Degree in Computer Science or equivalent experience
- 8+ years of experience in systems engineering, DevOps, or Site Reliability Engineering roles
- Proven experience managing mission-critical, high-availability production environments
- Strong background in Linux systems administration and performance tuning
- Expertise with AWS infrastructure and related services
- Proficiency with Docker, Kubernetes, and infrastructure-as-code tools such as Terraform or CloudFormation
- Solid programming/scripting skills in Python, Bash, or similar
- Experience with CI/CD pipelines, deployment automation, and Git-based workflows
- Deep understanding of networking, HTTP, and distributed systems principles
- Familiarity with monitoring and observability tools (Datadog, Prometheus, Grafana, etc.)
- Legally eligible to work in Canada and/or the U.S.
- Self-starter who thrives in a remote, fast-paced environment
- Strong problem-solving and debugging skills
- Excellent communication and collaboration abilities
- Strong incident management, root cause analysis, and troubleshooting skills
Top Skills
AWS
Bash
CloudFormation
Datadog
Docker
Grafana
Kubernetes
Prometheus
Python
Terraform
Similar Jobs
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Dealer Digital Solution Manager manages the Dealer Digital Solutions program, ensuring effective vendor operations, product oversight, and strategy execution for digital products in automotive.
Top Skills:
AdobeAnalyticsCRMDigital AdvertisingExcelOfficePowerPointSemSeo
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The role involves designing and executing software tests for automated driving software, analyzing test data, collaborating in Agile teams, and contributing to continuous improvement initiatives.
Top Skills:
C/C++JenkinsMatlabPythonRobot FrameworkSimulink
Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software • Big Data Analytics • Automation
The Product Manager will lead the App Platform team to enhance user experiences, collaborating with UX design, engineering, and product teams and ensuring compliance with accessibility and localization standards.
Top Skills:
Analytics ToolsMobile Application DesignSaaSWeb Application Design
What you need to know about the Vancouver Tech Scene
Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.


