Cognition

Research, Post-Training Data

Posted 3 Days Ago

Be an Early Applicant

In-Office or Remote

Hiring Remotely in CA

Mid level

In-Office or Remote

Hiring Remotely in CA

Mid level

The role involves designing data strategies, developing scalable pipelines for human labeling, and researching human preferences for data-driven AI model enhancements. It includes publishing research and improving model alignment and efficacy through human and synthetic data techniques.

The summary above was generated by AI

Who We Are

We are an applied AI lab building end-to-end software agents. We're the team behind Devin, the first AI software engineer, and Windsurf, an AI-native IDE. These products represent our vision for AI that doesn't just assist engineers, but works alongside them as a genuine teammate.

Our team is small and talent-dense: world-class competitive programmers, former founders, and researchers from the frontier of AI, including Scale AI, Palantir, Cursor, Google DeepMind, and others.

Role Mission

Post-training data research sits at the core of our roadmap and at the intersection of human insight and machine learning. This is the critical bridge between raw model intelligence and a system that is actually useful, safe, and collaborative for humans.

Our work combines human and synthetic data techniques to capture the nuances of human behavior and use them to steer models. We research the mechanisms that create value for people to explain, predict, and optimize for human preferences, behaviors, and satisfaction. We also explore new paradigms for human-AI interaction and scalable oversight. This role blends fundamental research and practical engineering; we don't distinguish between the two.

What You'll Accomplish

Data Strategy: Design and execute data collection and synthesis strategies for post-training by combining human feedback, preference data, and synthetic examples to guide model behavior.
Scalable Pipelines: Develop pipelines and frameworks for scalable, high-quality human labeling, model-assisted labeling, and synthetic data generation.
Human Preference Modeling: Research and model human preferences and behavior, creating data-driven methods to improve reasoning, truthfulness, and helpfulness.
Evaluation Design and Integrity: Iterate on evals through a continuous loop of defining evaluations, optimizing them, and identifying gaps. You'll be responsible for making numbers go up and making sure the numbers are meaningful.
Metrics and Benchmarks: Design and evaluate metrics that measure data quality, alignment, and the real-world impact of post-training interventions.
Scaling and Exploration: Scale existing methodologies and develop new ones when current approaches hit ceilings. We expect both rigor and invention.
Research Publication: Publish and present work that moves the community forward. Share code, datasets, and insights that accelerate progress across industry and academia.

Exceptional Candidates Have Demonstrated

Strong engineering skills with the ability to contribute code and debug in complex codebases
Experience with data curation, human feedback, or synthetic data generation for large language models or similar systems
Ability to design, run, and interpret experiments with scientific rigor and clarity
Proficiency in Python and at least one deep learning framework (PyTorch, TensorFlow, etc.); comfortable with distributed training and code that scales
Strong grasp of probability, statistics, and ML fundamentals; can distinguish real effects from noise and bugs
Prior experience with RLHF, RLAIF, preference modeling, or reward learning for large models
Experience managing or analyzing human data collection campaigns or large-scale annotation workflows
Research or engineering contributions in alignment, data-centric AI, or human-AI collaboration
Familiarity with synthetic data pipelines, active learning, or model-assisted labeling
We care more about demonstrated capability than credentials. A PhD is one signal among many.

Resources & Environment

Small, highly selective team where research and product move together; prototypes reach real deployment quickly
You'll have access to the data, tooling, and compute needed to run experiments and collection campaigns at frontier scale from day one
The environment rewards speed, autonomy, and technical depth with minimal process overhead; this is one of the most competitive and fast-moving problems in AI

Equal Opportunity

Cognition is an equal opportunity employer. We do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other protected characteristic under applicable law. We are committed to providing reasonable accommodations for candidates with disabilities throughout the hiring process - please let us know if you need any.

Top Skills

Python

PyTorch

TensorFlow

Similar Jobs

Block

Senior ML/AI Modeler, Risk Automation ML

16 Hours Ago

In-Office or Remote

Senior level

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency

The Senior ML/AI Modeler will automate risk investigations using AI, improve productivity through technology, and deploy ML systems at scale, while collaborating across teams to enhance performance and decision-making.

Top Skills: AirflowAWSGCPLlmsPythonPyTorchSnowflake

Cash App

Senior ML/AI Modeler, Risk Automation ML

16 Hours Ago

Remote or Hybrid

Senior level

Blockchain • Fintech • Mobile • Payments • Software • Financial Services

The role involves developing and deploying AI systems to automate risk and compliance processes, and driving strategic initiatives across teams to enhance productivity and decision-making.

Top Skills: AirflowAWSGCPLlmsMlPythonPyTorchSnowflake

People Inc.

Senior Software Engineer

18 Hours Ago

Remote or Hybrid

Alberta, AB, CAN

Senior level

AdTech • Consumer Web • Digital Media • eCommerce • Marketing Tech

Design and implement scalable mobile advertising experiences using React Native and Google Mobile Ads SDK. Translate revenue requirements into ad implementations, support native/interstitial/rewarded formats, manage ad configuration/loading/display, ensure performance and user experience, and collaborate with stakeholders while guiding engineering peers on consent management and programmatic advertising.

Top Skills: AndroidAndroid Development ToolsConsent ManagementGitGoogle Mobile Ads SdkHeader BiddingIab StandardsiOSJavaJIRAKotlinProgrammatic AdvertisingReact NativeSwiftThird-Party ApisTypescriptXcode

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.