AI Research Engineer (Pre-training - LLM & Multi-Modal)
About the role
Join Tether's AI model team to drive foundational research in large-scale LLM and multi-modal pre-training on distributed GPU infrastructure. You will design and scale novel architectures, tokenizers, and cross-modal alignment layers, curate massive multi-modal datasets, and optimize pre-training pipelines integrating text, vision, and audio modalities.
What you'll do
- Conduct large-scale pre-training for LLMs and multi-modal models (text, vision, audio) on distributed servers equipped with thousands of NVIDIA GPUs
- Design and prototype innovative architectures, tokenizers, and cross-modal alignment layers to enhance model intelligence and multi-modal understanding
- Source, filter, and curate massive-scale textual and multi-modal datasets; build robust data pipelines for efficient pre-training
- Execute experiments independently and collaboratively, analyze results, and refine training methodologies for optimal performance and token efficiency
- Debug and eliminate bottlenecks in model efficiency, computational performance, and cross-modal alignment stability during long training runs
- Contribute to the advancement of distributed training systems for seamless scalability and hardware efficiency
Requirements
- PhD in NLP, Machine Learning, or a related field with a strong AI R&D track record and publications in A* conferences (ICLR, CVPR, NeurIPS, ICML, ACL, ICCV, IJCAI)
- Hands-on experience contributing to large-scale LLM or multi-modal pre-training runs on distributed servers with thousands of NVIDIA GPUs
- Familiarity with large-scale distributed training frameworks, libraries, and tools
- Deep knowledge of state-of-the-art transformer and non-transformer modifications aimed at improving intelligence, efficiency, and scalability
- Strong expertise in PyTorch and Hugging Face with practical experience in model development, continual pre-training, and deployment
About Tether Operations Limited
Tether is a global fintech company behind USDT, the world's most widely used stablecoin, and operates divisions in AI, communications, energy, and education through its Tether Data, Tether Evo, Tether Power, and Tether Education subsidiaries.
Visit Tether Operations Limited→
AI Alerts shares third-party job opportunities for informational purposes only. We are not the employer and are not involved in the hiring process. Always verify the company and role through official channels before applying, and never pay to apply, train, onboard, process documents, or secure a job offer. Legitimate employers do not ask applicants for money. Read our Terms to learn more.