AIML - Machine Learning Engineer, Data & Machine Learning Innovation
Job Description
Summary
As part of Apple's AI and Machine Learning org, we inspire and create groundbreaking technology for large language models, multi-modal models, with strong agent and reasoning capabilities. The Data and Machine Learning Innovation (DMLI) team is looking for a passionate Machine Learning Engineer to explore new methods, challenge existing metrics or protocols, and develop new insightful practices that will change how we understand data and overcome real-world ML challenges.
As a team member, you will work on some of the most ambitious technical challenges in the field. Your role will involve collaborating closely with our team of machine learning researchers, engineers, and data scientists. Together, you will spearhead groundbreaking research initiatives and develop transformative products designed to create a significant impact for billions of users worldwide.
As a team member, you will work on some of the most ambitious technical challenges in the field. Your role will involve collaborating closely with our team of machine learning researchers, engineers, and data scientists. Together, you will spearhead groundbreaking research initiatives and develop transformative products designed to create a significant impact for billions of users worldwide.
Description
As a Machine Learning (ML) Engineer, you will be entrusted with the critical role of innovating and applying state-of-the-art research in foundation models to tackle complex data problems. The solutions you develop will significantly impact future Apple products and the broader ML development ecosystem.
You will work with a multidisciplinary team to actively participate in the data-model co-design and co-development practice. Your responsibilities will extend to the design and development of a comprehensive data generation and curation framework for foundation models at Apple. You will also be responsible to create robust model evaluation pipelines, integral to the continuous improvement and assessment of foundation models. Additionally, your role will entail an in-depth analysis of multi-modal data to underscore its influence on model performance.
Furthermore, you will have the opportunity to showcase your groundbreaking research work by publishing and presenting at premier academic venues.
Your work may span a variety of applications, including but not limited to:
- Enhancing current products and future hardware platforms with multi-modal perception data
- Designing and implementing semi-supervised, self-supervised representation learning techniques for maximizing the power of both limited labeled data and large-scale unlabeled data.
- Developing on-device intelligence and learning with strong privacy protections
- Employing data selection techniques such as novelty detection, active learning, and core-set selection for diverse data types like images, 3D models, natural language, and audio.
- Uncovering patterns in data, setting performance targets, and leveraging modern statistical and ML-based methods to model data distributions. This will aid in reducing redundancy and addressing out-of-distribution samples.
You will work with a multidisciplinary team to actively participate in the data-model co-design and co-development practice. Your responsibilities will extend to the design and development of a comprehensive data generation and curation framework for foundation models at Apple. You will also be responsible to create robust model evaluation pipelines, integral to the continuous improvement and assessment of foundation models. Additionally, your role will entail an in-depth analysis of multi-modal data to underscore its influence on model performance.
Furthermore, you will have the opportunity to showcase your groundbreaking research work by publishing and presenting at premier academic venues.
Your work may span a variety of applications, including but not limited to:
- Enhancing current products and future hardware platforms with multi-modal perception data
- Designing and implementing semi-supervised, self-supervised representation learning techniques for maximizing the power of both limited labeled data and large-scale unlabeled data.
- Developing on-device intelligence and learning with strong privacy protections
- Employing data selection techniques such as novelty detection, active learning, and core-set selection for diverse data types like images, 3D models, natural language, and audio.
- Uncovering patterns in data, setting performance targets, and leveraging modern statistical and ML-based methods to model data distributions. This will aid in reducing redundancy and addressing out-of-distribution samples.
Minimum Qualifications
- Demonstrated expertise in computer vision, natural language processing, and machine learning with a passion for data-centric machine learning.
- Strong programming skills and hands-on experience using the following languages or deep learning frameworks: Python, PyTorch, or Jax.
- 5+ years of experience with developing and evaluating ML applications, and demonstrated experience in understanding and improving data quality.
Preferred Qualifications
- Deep understanding in multi-modal foundation models.
- Staying on top of emerging trends in generative AI and multi-modal LLM.
- Demonstrated publication record in relevant conferences (e.g. CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, , etc) is a plus.
- Strong problem-solving and communication skills
- Apple is an Equal Opportunity Employer that is committed to inclusion and diversity. We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, women, protected veterans, and individuals with disabilities. Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants.