Senior Machine Learning Engineer – LLMs

Netherlands - Amsterdam
PDT – Data Science & AI /
1. Role: Permanent /
Hybrid
Join our AI team at Prosus, the largest consumer internet company in Europe and one of the biggest tech investors in the world. You'll be working on the team that drives growth and innovation across the company, with your work directly impacting how millions of people shop online.

Who we’re looking for

We're seeking a Senior Machine Learning Engineer to train domain-specific language models and provide technical leadership to the team. You'll own critical parts of our training infrastructure, mentor engineers, and drive technical decisions from data preparation through production deployment. You have deep hands-on experience training language models at scale, lead by example through rigorous experimentation and high-quality code, and are motivated by seeing your work deployed to millions of users. You thrive in fast-paced environments where you balance technical depth with practical business impact.

What you’ll do

    • Analyze model performance and training data, formulate hypotheses, design and execute rigorous experiments to systematically improve model quality, training and inference efficiency, and downstream task performance
    • Drive technical decision-making for model architecture, training strategies, and infrastructure choices
    • Provide technical leadership and mentorship to ML engineers and interns, conducting code reviews, sharing best practices, and accelerating team growth
    • Train large language models through continued pre-training and full parameter fine-tuning on proprietary datasets
    • Build and optimize distributed training infrastructure across multi-node GPU clusters using frameworks like DeepSpeed, FSDP, Megatron-LM, or Axolotl
    • Own large-scale data preparation: filtering, quality assessment, deduplication, and data mixture strategies for training corpora at 100B+ token scale
    • Generate and curate high-quality synthetic data for instruction fine-tuning and capability enhancement
    • Debug training stability issues, optimize training and inference throughput (quantization, distillation, serving optimization), and monitor model performance throughout long-running distributed jobs
    • Build robust evaluation frameworks and establish metrics to measure model quality and guide decisions
    • Write production-grade, well-tested code and set engineering standards for the team

Minimum qualifications

    • 7+ years of ML engineering experience
    • Technical leadership experience: mentoring engineers, conducting code reviews, making architecture decisions, and delivering projects with measurable business impact
    • Proven experience training and deploying language models to production (embedding models, encoder models, or large language models) including pre-training, continued pre-training, or fine-tuning with rigorous evaluation and inference optimization
    • Experience preparing large-scale training datasets: data filtering, quality assessment, deduplication strategies, and data mixture design
    • Hands-on experience with distributed training frameworks (DeepSpeed, FSDP, Megatron-LM, or Axolotl) including orchestrating multi-node jobs, debugging failures, and optimizing throughput
    • Strong understanding of training dynamics at scale: debugging loss instabilities, tuning learning rate schedules, managing training stability across long-running multi-node jobs
    • Expert Python and PyTorch with production experience using training libraries (Transformers, DeepSpeed, Accelerate)

Preferred qualifications

    • Published research at ML conferences (NeurIPS, ICML, ICLR, ACL, EMNLP), released models on Hugging Face, created public benchmarks, or contributed to open-source projects
    • Experience with post-training methods: RLHF, DPO, GRPO, or other reinforcement learning approaches for alignment and instruction-following
    • Experience optimizing models for production inference including quantization, model compression, distillation, and serving frameworks (vLLM, TensorRT-LLM)
    • Understanding of memory optimization: gradient checkpointing, mixed precision training (FP16, BF16, FP8), ZeRO optimization
    • Deep knowledge of GPU architectures (A100, H100, H200) and their implications for training and inference optimization
    • Track record of building synthetic data generation pipelines for instruction tuning or domain adaptation

What we offer

    • High-impact AI projects that are strategically vital to the company, with direct engagement from senior leadership including the CEO
    • State-of-the-art infrastructure: H200 GPU fleet, massive proprietary datasets, access to frontier models (OpenAI, Anthropic, Google, Together.ai) for evaluation and baselines
    • Expert colleagues who have released top Hugging Face models, authored papers at NeurIPS, created well-known benchmarks, and built multiple production AI systems
    • Significant autonomy and freedom to test ideas, experiment with new approaches, and drive technical decisions
    • Modern tooling: Latest ML frameworks, coding assistants, best-in-class development environment
    • Hybrid work model with our Amsterdam office - home to the AI House, bringing together 200+ AI professionals through events, meetups, and startup collaborations
    •  Competitive compensation, top-spec MacBook Pro, and an environment genuinely built for professional growth and learning

    • If you're excited to apply your LLM training expertise to high-impact applications at scale, lead technical initiatives, and grow the next generation of ML engineers, let's talk.
Our Diversity & Inclusion Commitment

We respect the dignity and human rights of individuals and communities wherever we operate in the world. Building an inclusive workplace where everyone feels welcome and can thrive is critical for us. We provide access to education, which helps everyone understand the important role they play and the positive impact they can have.

For a deeper look at our journey and future plans, explore our latest Annual Report. Stay up to date with our latest news to see what makes Prosus stand out. Learn more at www.prosus.com.

If you’re excited to push the boundaries of AI applications — and to see your work make a tangible difference on a global scale—let’s talk.