AIML - Sr Engineering Program Manager, Evaluation
Job Description
Summary
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something.
We are seeking a highly skilled Engineering Program Manager (EPM) to lead user studies, data collection efforts, and evaluations for our AI models across multiple modalities, including text, images, and audio. The successful candidate will play a critical role in designing, implementing, and refining methodologies to assess LLM performance, while also driving user studies and large-scale data collection initiatives to support model development and improvement.
We are seeking a highly skilled Engineering Program Manager (EPM) to lead user studies, data collection efforts, and evaluations for our AI models across multiple modalities, including text, images, and audio. The successful candidate will play a critical role in designing, implementing, and refining methodologies to assess LLM performance, while also driving user studies and large-scale data collection initiatives to support model development and improvement.
Description
This role offers the opportunity to have a significant impact on the quality and effectiveness of our AI models by leading efforts in both evaluation and user research, ensuring we meet the highest standards of performance and user satisfaction.
* Develop, implement, and refine robust evaluation frameworks that align AI model performance with user needs and goals. Collaborate closely with data scientists, engineers, and cross-functional teams to ensure comprehensive assessment across various modalities
* Lead and manage user research programs, leveraging diverse methodologies to gather actionable user insights. Work closely with UX researchers to ensure that user feedback is systematically integrated into model training and evaluation processes
* Design and execute large-scale data collection efforts, ensuring the acquisition of high-quality, diverse datasets critical for AI model development. Coordinate with engineering, operations, and data science teams to streamline data collection workflows
* Partner with key stakeholders, including designers, data scientists, and engineers to ensure that evaluation and data collection efforts are well-integrated into the overall AI model development lifecycle
* Stay at the forefront of AI evaluation and data collection techniques, continuously innovating and refining methodologies to maintain state-of-the-art model performance and user experience
* Develop, implement, and refine robust evaluation frameworks that align AI model performance with user needs and goals. Collaborate closely with data scientists, engineers, and cross-functional teams to ensure comprehensive assessment across various modalities
* Lead and manage user research programs, leveraging diverse methodologies to gather actionable user insights. Work closely with UX researchers to ensure that user feedback is systematically integrated into model training and evaluation processes
* Design and execute large-scale data collection efforts, ensuring the acquisition of high-quality, diverse datasets critical for AI model development. Coordinate with engineering, operations, and data science teams to streamline data collection workflows
* Partner with key stakeholders, including designers, data scientists, and engineers to ensure that evaluation and data collection efforts are well-integrated into the overall AI model development lifecycle
* Stay at the forefront of AI evaluation and data collection techniques, continuously innovating and refining methodologies to maintain state-of-the-art model performance and user experience
Minimum Qualifications
- Proven track record in managing technical programs or projects related to AI/ML or software development
- Demonstrated ability to work with designers, data scientists, engineers, and operations teams
- Skilled in establishing priorities, developing plans, and driving complex projects from concept to completion
- Strong written and verbal communication skills with the ability to present complex technical concepts to all levels of the organization, and to influence stakeholders
Preferred Qualifications
- 7+ years of experience managing projects and programs with demonstrated leadership on larger projects
- Master’s or PhD degree in Computer Science, Data Science, Statistics, or related quantitative field
- Proven experience managing AI-powered solutions, particularly in the evaluation and user study domains