Optimize, compress, and distill large language and vision models for on-device inference. Build pipelines for distillation and hardware-specific compilation, and benchmark performance across NPU/GPU architectures.
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
- Compress and optimize large language and vision models for on-device inference.
- Develop pipelines for model distillation and hardware-specific compilation.
- Benchmark performance across various NPU/GPU architectures.
Qualifications:
- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Similar Jobs
Agency • Artificial Intelligence • Blockchain • Web3
Run adversarial tests on language and multimodal models, build guardrails and real-time filters for autonomous tool use, and support RLHF alignment and constitutional AI development to ensure safe AI deployment.
Top Skills:
Adversarial MlGuardrailsJailbreak TaxonomiesLlmsMultimodal AgentsPrompt EngineeringReal-Time FilteringRed-Teaming FrameworksRlhf
Cloud • Fintech • Food • Information Technology • Software • Hospitality
As a Staff Software Engineer, you'll lead a team in developing and delivering scalable software solutions for employee management in the restaurant industry, focusing on enhancing customer and employee experiences.
Top Skills:
GraphQLJavaKotlinReactRestTypescript
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Cybersecurity • Data Privacy
As a Strategic Sales Engineer, you'll provide technical direction, support sales efforts, and develop integrated solutions addressing customer needs to exceed sales quotas.
Top Skills:
Backup And Disaster RecoveryCloud Data ManagementData AnalyticsSan (Storage Area Network) Systems
What you need to know about the Vancouver Tech Scene
Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.


