Fastino Logo

Fastino

AI Platform Engineer

Posted 10 Days Ago
Be an Early Applicant
Remote or Hybrid
7 Locations
Senior level
Remote or Hybrid
7 Locations
Senior level
Design, build, and own the end-to-end model platform: training/fine-tuning pipelines, RL workflows, data ingestion/curation, reproducible experiments, scalable inference services, and production deployment.
The summary above was generated by AI

AI Platform Engineer

Full-time | Hybrid or Remote

Introduction:

  • Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.

  • Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb

  • Fastino has raised $25M (as featured in TechCrunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.

The Role:

We are looking for a systems-level engineer to own Fastino’s model platform end-to-end.

This is not a feature role.

You will design and build:

  • Training pipelines

  • Fine-tuning workflows

  • RL infrastructure

  • Data ingestion and curation systems

  • Inference services

  • Scalability and backend architecture

You will own the platform that turns models into production systems.

What You’ll Work On:

  • Architect distributed fine-tuning pipelines for small encoder and decoder models

  • Implement LoRA, adapters, distillation, and compression workflows

  • Design experiment tracking, reproducibility, and dataset versioning systems

  • Optimize training efficiency (GPU utilization, memory, throughput, cost)

  • Design scalable RL training workflows (policy optimization, reward modeling)

  • Integrate RL with supervised fine-tuning and distillation

  • Build evaluation loops and automated regression detection

  • Build scalable ingestion pipelines for structured and unstructured data

  • Design dataset curation, filtering, and quality enforcement systems

  • Implement reproducible data workflows tied to training runs

  • Architect low-latency inference services

  • Design safe production deployment workflows

Strong candidates will have:

  • Deep experience with PyTorch and transformer architectures

  • Experience building production ML systems end-to-end

  • Experience with distributed training and inference

  • Experience optimizing GPU workloads

  • Strong backend and systems engineering fundamentals

  • Experience with containerization and orchestration

  • Cloud infrastructure experience (AWS/GCP/Modal/Together.ai etc)

Bonus:

  • Experience with RL or RLHF

  • Experience with distillation and compression

  • Experience building internal ML platforms

Top Skills

Pytorch,Transformers,Lora,Adapters,Distillation,Rl,Rlhf,Gpus,Containerization,Orchestration,Aws,Gcp,Modal,Together.Ai

Similar Jobs

11 Days Ago
Easy Apply
Remote or Hybrid
4 Locations
Easy Apply
Mid level
Mid level
Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
Build and own observability, lifecycle management, and tooling for LLM-powered features. Design closed-loop evaluation pipelines, dashboards for cost/usage/latency, tie user feedback to prompts, and establish prompt development, testing, deployment, and monitoring best practices while partnering across engineering teams.
Top Skills: AnthropicGoGCPGoogle VertexKubernetesOpenaiPostgres
12 Days Ago
Easy Apply
Remote or Hybrid
CA
Easy Apply
Mid level
Mid level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Build and evolve core AI platform capabilities: design shared execution patterns, APIs, and backend services to support multi-step GenAI workflows, model integrations, and scalable production systems. Collaborate with AI engineers, data scientists, and product partners to turn emerging AI use cases into reliable, extensible platform features.
Top Skills: GoJavaLangchainLlmsMcpOpenai SdkPython
4 Days Ago
Easy Apply
Remote or Hybrid
CA
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
As a Senior Software Engineer on the AI Platform, you'll develop scalable AI systems, focusing on backend services, workflows, and integrations, partnering with cross-functional teams.
Top Skills: GoJavaLangchainOpenai SdkPython

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account