AI Engineer (Audio)
White Circle
Job Description
TLDR: Audio / Multimodal ML Engineer to train and ship speech, audio and multimodal models for an AI safety platform that operates at 100M+ API calls/month.
About us
White Circle is an AI Safety company building the safety, reliability, and optimization layer for AI systems. At the core of our platform are policies β simple natural-language rules that define what an AI model should and shouldnβt do. We automatically test, enforce, and continuously improve these policies at scale.
- Weβve raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others
- We process over one hundred million API calls every month
- We fine-tune and train our own LLMs so they run faster and cheaper than any open or proprietary model
Weβre a small, highly focused team. If you want to work deeply on hard problems, see your work ship to production quickly, and influence how AI safety is actually built β youβre the one we need.
You will:
- Train and fine-tune large-scale audio and multimodal models from scratch and from pretrained checkpoints
- Design and run experiments: architecture changes, data mixes, training recipes
- Build and maintain audio data pipelines β from raw recordings to training-ready datasets
- Optimize models for production: quantization, distillation, streaming inference
- Deploy models end-to-end: from research checkpoint to low-latency serving
- Collaborate with research to turn experimental ideas into shippable features
- Define evaluation metrics and benchmarks that actually matter for the product
Youβll fit right in if you:
- 3+ years of experience training large-scale deep learning models in audio, speech, or acoustic domains
- Strong hands-on experience with PyTorch, distributed training (DeepSpeed, FSDP, or similar)
- Familiarity with audio/speech architectures (Audio Qwen, Whisper, HuBERT, Conformer, or similar)
- Experience with vision-language and multimodal architectures (Audio Flamingo, Omni Qwen, or similar)
- Track record of shipping models to production: you've hit latency targets, not just accuracy benchmarks
- Comfortable working with large-scale audio data pipelines: preprocessing, augmentation, dataset curation
- Understanding of audio signal processing fundamentals: spectrograms, mel features, noise reduction
- Experience with SFT, DPO, GRPO or other alignment techniques β ideally in multimodal setting
- Strong engineering fundamentals: clean code, version control, testing, documentation
Why White Circle
- Salary of $100,000 to $250,000 + equity
- 20 days of paid vacation
- Work from Paris (hybrid) + relocation package
- Best medical insurance in France
- All the hardware, tools, and services you need
- Covered subscriptions for AI agents and IDEs
- Team off-sites twice a year: weβve recently been to the Alps and to Saint-Tropez
How we hire
- Intro call with one of our colleagues
- Π‘omplete the take-home assignment
- Show your best during the technical interview
- Final call with our CEO and CTO
Please submit your application in English - itβs our company language so youβll be speaking lots of it if you join