Unity logo

Senior Machine Learning Engineer, On-Device & Mobile AI Optimization

Unity
San Francisco, CA, USAMountain View, CA, USA
Posted 3 days ago
Last seen 23 hours ago
Active
full-time
AI & Machine Learning
$188,200 - $282,200 USD

Job Summary

Join Unity as a Senior Machine Learning Engineer to revolutionize AI-driven game experiences by optimizing state-of-the-art generative models for lightning-fast, on-device performance across billions of mobile and constrained devices. This hands-on role offers the chance to deeply impact player experience by making cutting-edge AI run efficiently where it matters most.

About this job

As a Senior Machine Learning Engineer for On-Device & Mobile AI, you will optimize and deploy state-of-the-art multi-modal models like transformers and diffusion networks to run fast, small, and reliably on mobile and constrained hardware. This deeply hands-on role involves owning the inference stack from research checkpoints to shipped features, directly impacting the latency, quality, memory, and battery profile of AI experiences for billions of players.

Requirements

- 5+ years in software/ML engineering, with meaningful time focused on on-device / edge inference or real-time, performance-critical systems. - Production deployment of transformer- and/or diffusion-based models (e.g., ViT, Stable Diffusion, CLIP/SigLIP-style encoders) on mobile, desktop, or embedded hardware — shipped, not just prototyped. - Hands-on experience with at least one major inference runtime (ONNX Runtime / ORT Web, CoreML, TFLite, ExecuTorch) and a working understanding of operator fusion, memory layout, and runtime scheduling. - Low-level performance engineering: solid command of at least one GPU/compute API — WebGPU/WGSL, Metal, Vulkan, D3D12, or CUDA — and the profiling tools to go with it. You can read a frame capture and a kernel trace and reason about where the time and memory go. - Working knowledge of model-optimization techniques — quantization (INT4/INT8/FP16), weight sharing, pruning, and distillation — and the judgment to apply them to hit latency and memory budgets. You use them effectively as engineering tools. - Understanding of target hardware: mobile SoCs (Apple Neural Engine, Qualcomm Hexagon/Adreno, ARM Mali) and/or desktop/laptop GPUs (Apple Silicon, NVIDIA, AMD, Intel). - Strong Python for export pipelines and training-side tooling; familiarity with the core languages of a browser-native runtime (TypeScript/JavaScript, WGSL) is a plus. - Working fluency with the models you deploy — enough to read an architecture, modify it for deployment, and reason about accuracy trade-offs. - A collaborative working style: clear communication, reliable delivery, and a willingness to support and learn from teammates.

Benefits & Perks

- Comprehensive health, life, and disability insurance - Commute subsidy - Employee stock ownership - Competitive retirement/pension plans - Generous vacation and personal days - Support for new parents through leave and family-care programs - Office food snacks - Mental Health and Wellbeing programs and support - Employee Resource Groups - Global Employee Assistance Program - Training and development programs - Volunteering and donation matching program

Apply for this position

Apply Now

You'll be redirected to the company's application page to complete your application.

ManaBoard LogoManaBoard.io

The #1 platform for finding high-quality job postings in the gaming industry. Connect with top studios and talent.

Stay Updated

Get gaming job alerts and industry insights delivered to your inbox.

By subscribing, you agree to receive our newsletter and occasional updates. You can unsubscribe at any time.

Disclaimer: ManaBoard is an independent platform. Job listings and logos are sourced from public career pages and remain the property of their respective owners.

© 2026 ManaBoard. All rights reserved.