Research Engineer, Tokens (Pre-training)

Anthropic • Full-time • San Francisco, California, United States • 1w ago

About the role:

You will be responsible for pretraining data research. You may be working on understanding pretraining data trends and scaling laws, optimizing pretraining data mixes, investigating potential new sources of data, building research tools to better understand experimental results, or figuring out how to process and use pretraining data most effectively.

You may be a good fit if you:

Have significant software engineering experience
Are results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Are comfortable with a very empirical research environment
Care about the societal impacts of your work

Strong candidates may also have experience with:

High performance, large-scale ML systems
Language modeling with transformers
Large-scale ETL
Designing ML experiments and researching ML fundamentals
Inspecting and iterating on data (e.g. ML competitions, Quantitative Finance)

Representative projects:

Comparing the compute efficiency of different datasets
Making a multimodal dataset in a format models can easily consume
Scaling a data processing job to thousands of machines
Designing a research tool to analyze and manage data ablation experiments
Creating an interactive visualization of semantic clusters in our training data

Related Jobs

AI Research Scientist, Language - Reality Labs, Wearables (PhD)

Meta • Full-time • Burlingame, CA, US • $117k - $173k / year • 2d ago

Research

2d ago

Research Engineer, Post-Training Evals

OpenAI • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago

Research Engineer, Pre-training

Anthropic • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago

Research Engineer, Agents

Anthropic • San Francisco, California, United States • 1w ago

Research

1w ago

Research Engineer, Frontier Red Team (CBRN, Biosecurity)

Anthropic • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago

Research Engineer / Scientist, Alignment Science (London)

Anthropic • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago

Research Engineer / Scientist, Safeguards

Anthropic • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago

Research Engineer / Scientist, Alignment Science

Anthropic • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago

Research Scientist, Automated Sciences

OpenAI • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago

Research Engineer, Knowledge Team

Anthropic • Full-time • San Francisco, California, United States • 1w ago

Research

1w ago