The role involves post-training of LLMs, model alignment, server operation for checkpoint routing, and building evaluation pipelines.
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.
Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.
Top Skills
Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Sr. Ethics & Compliance Manager oversees compliance in Canadian government contracting and security, advises on legal frameworks, and develops internal policies.
Top Skills:
Ai-Enhanced TechnologyCompliance RegulationsDocument Safeguarding ProgramsPublic Procurement Law
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
The Sales Strategy & Operations Manager focuses on optimizing sales processes, managing sales technology, developing performance metrics, and collaborating on strategic initiatives to improve sales operations.
Top Skills:
CRMSpreadsheetsSQL
Cloud • Security • Software • Cybersecurity • Automation
The Principal Product Marketing Manager will lead the go-to-market strategy for GitLab's security solutions, influencing positioning and collaborating across teams to drive customer engagement and revenue growth.
Top Skills:
AIApplication SecurityComplianceDevsecopsSoftware Security
What you need to know about the Vancouver Tech Scene
Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.



