For Employers

Machine Learning Engineer - AI & ML Evaluation Frameworks

Apple
Cupertino, California, United StatesPosted 4 days ago
Location
Cupertino, California, United States

About the role

The Health Sensing Machine Learning Interpretability & Analytics (MLIA) team ensures clinical rigor and contextual trust are at the foundation of Apple's health sensing features. We are looking for an exceptional ML Engineer to help us build the next generation of scalable evaluation infrastructure and lead rigorous investigations into model performance.

Responsibilities

  • Architect and build large-scale evaluation frameworks to interrogate unimodal ML systems and multi-modal foundation models
  • Lead deep-dive ML evaluations, performing failure analysis to uncover performance gaps, reasoning flaws, and edge cases
  • Translate findings into actionable insights and work directly with algorithm teams to improve the safety and reliability of health features
  • Develop cutting-edge tools, synthetic data pipelines, and automated frameworks that ensure health features are mathematically sound, demographically equitable, and clinically safe
  • Empower teams across Apple to rapidly evaluate multi-modal sensor fusion while upholding Apple's privacy standards

Minimum qualifications

  • BS in Computer Science, Machine Learning, Statistics, or related field
  • 3+ years of experience in ML Engineering or Applied ML
  • Strong experience in evaluating supervised, unsupervised, LLMs and deep learning models
  • Proficiency in Python with the ability to write production-grade code (OOP, CI/CD, Git)
  • Hands-on experience in failure analysis, evaluating LLMs and driving subsequent model improvements
  • Experience building data pipelines, inference frameworks, and automated evaluation systems
  • Strong communication skills to articulate complex technical concepts across technical and non-technical audiences

Preferred qualifications

  • MS/PhD in Computer Science, Machine Learning, Statistics, or related field
  • Experience evaluating LLMs or agentic systems (e.g., LLM-as-a-judge, RAG evaluation)
  • Experience with synthetic data generation and prompt engineering
  • Experience in parallel data processing (Spark, Kubernetes, Airflow) or privacy-preserving ML (Federated Learning)
  • Background in AI Safety, model interpretability, or adversarial testing
  • Interest in digital health and clinical rigor

About Apple

Apple Inc. is a technology company that designs and sells consumer electronics, software, and services. Its core product lines are the iPhone line of smartphones, the iPad line of tablet computers, and the Mac line of personal computers, and it offers its products online and through a chain of retail stores known as Apple Stores. Other products include Apple Watch, Apple TV, and AirPods, along with services and platforms such as iOS, macOS, the App Store, and Apple TV.

Industry
Technology / Consumer electronics and software
Head office
Cupertino, California, United States
Company size
Approximately 164,000 full-time employees worldwide (as of late September 2024)
Founded
1976
iPhone smartphonesMac personal computersiPad tabletsApple WatchApple TVSoftware and services (iOS, macOS, App Store, Apple TV)Apple-designed siliconSpeech, audio, and conversational AI / machine learning
View Apple’s profile →

Interested in this role?

Apply now to join Apple.

Apply for this position

Similar roles

Machine Learning Engineer - AI & ML Evaluation Frameworks

Apply