Featherless AI — AI Researcher — Distillation
About the role
Featherless AI is hiring an AI Researcher to advance model distillation techniques that compress large open-source LLMs into smaller, faster variants without sacrificing quality. Your work will directly improve the efficiency and reach of Featherless's hosted model fleet used by developers globally.
What you'll do
- Design and run knowledge distillation experiments at scale across diverse model families
- Evaluate trade-offs between model size, output quality, and inference cost
- Collaborate with engineering to integrate distilled models into the Featherless serving platform
- Curate and synthesize training datasets optimized for distillation objectives
- Publish findings internally and contribute to the broader open-source distillation ecosystem
Requirements
- Research background in model compression, knowledge distillation, or model efficiency
- Experience training transformer models at scale (pre-training, fine-tuning, or distillation)
- Proficiency in Python and PyTorch or JAX; familiarity with distributed training
- Strong experimental design skills and comfort analyzing large training runs
About Featherless AI
Featherless AI is a serverless inference platform hosting 3,000+ open-source LLMs, letting developers call any model via a simple API without managing GPU infrastructure.
AI Alerts shares third-party job opportunities for informational purposes only. We are not the employer and are not involved in the hiring process. Always verify the company and role through official channels before applying, and never pay to apply, train, onboard, process documents, or secure a job offer. Legitimate employers do not ask applicants for money. Read our Terms to learn more.