Lemurian Labs Logo

Lemurian Labs

Runtime Engineer

Reposted 5 Days Ago
Be an Early Applicant
Hybrid
8 Locations
Mid level
Hybrid
8 Locations
Mid level
Design and develop a multi-target runtime, optimize kernels, analyze compiler outputs, and improve architecture based on ML engineers' needs.
The summary above was generated by AI
At Lemurian Labs, we’re on a mission to bring the power of AI to everyone—without leaving a massive environmental footprint. We care deeply about the impact AI has on our society and planet, and we’re building a rock-solid foundation for its future, ensuring AI grows sustainably and responsibly. Because let’s face it, what good is innovation if it doesn’t help the world?

We are building a high-performance, portable compiler that lets developers “build once, deploy anywhere.” Yes, anywhere. We’re talking about seamless cross-platform compatibility, so you can train your models in the cloud, deploy them to the edge, and everything in between—all while optimizing for resource efficiency and scalability.

If the idea of sustainably scaling AI motivates you and you’re excited about making AI development both powerful and accessible, then we’d love to have you. Join us at Lemurian Labs, where you can have fun building the future—without leaving a mess behind.

Key Duties

  • Design, develop, maintain and improve our multi-target runtime
  • Use the latest techniques in parallelization and partitioning to automate generation and exploit highly optimized kernels
  • Rapid prototyping and data driven exploration of new ideas
  • Benchmark and analyze the outputs produced by our optimizing compiler on target hardware
  • Work closely with our product team to understand the evolving needs of ML engineers and drive improvements in architecture
  • Build tools to collect and analyze performance bottlenecks

Essential Skills and Experience

  • A deep understanding of asynchronous, concurrent programming.
  • 4+ years of experience with C/C++ (C++14 or newer).
  • An understanding of HW architecture (vector vs scalar registers and instructions, memory hierarchies).
  • Knowledge of operating system kernel development or hypervisor development.

Preferred Skills and Experience

  • Experience developing or maintaining libraries like CUDA or ROCm.
  • Experience with GPU programming.
  • Experience with high performance computing (HPC).
  • Masters or PhD degree in computer science, or equivalent practical experience.
  • Knowledge of DL frameworks such as PyTorch, JAX or Triton.
  • Experience with programming large compute clusters.

Salary depends on experience and geographical location. 

This salary range may be inclusive of several career levels and will be narrowed during the interview process based on a number of factors, such as candidate’s experience, knowledge, skills and abilities, as well as internal equity among our team. 

Additional benefits for this role may include: equity, company bonus opportunities; medical, dental, and vision benefits; retirement savings plan; and supplemental wellness benefits.

Lemurian Labs ensures equal employment opportunity without discrimination or harassment based on race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity or expression, age, disability, national origin, marital or domestic/civil partnership status, genetic information, citizenship status, veteran status, or any other characteristic protected by law.

EOE

Top Skills

C/C++
Cuda
Gpu
High Performance Computing
Jax
PyTorch
Rocm
Triton

Similar Jobs

7 Days Ago
In-Office
2 Locations
Senior level
Senior level
Artificial Intelligence
Design and develop high-performance distributed software for scalable AI training systems, focusing on data pipelines and system efficiency.
Top Skills: C/C++PythonPyTorch
Yesterday
In-Office
7 Locations
Mid level
Mid level
Artificial Intelligence • Software • Generative AI
Build and operate core runtime services for Glean's AI agents, focusing on performance, reliability, and integration with leading LLM providers.
Top Skills: AWSAzureC++GCPGoJavaKafkaKubernetesOpentelemetryPub/SubPythonRedis
6 Days Ago
In-Office
Toronto, ON, CAN
Mid level
Mid level
Artificial Intelligence
Develop high-performance code for distributed systems, optimizing data structures and algorithms to leverage hardware resources within the Cerebras architecture.
Top Skills: AssemblyC/C++Python

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account