Software Data Engineer - Apple Media Products
Job Description
Summary
Apple is seeking a highly skilled data engineer to join the Data Engineering team within Apple Products: Apple Music, App Store, Podcast, iTunes and more. The team is data-driven engineers, collaborating with products team to build user-facing features that primary impacting worldwide customers. We are looking for a dedicated engineer who is motivated by challenging problems and enthusiastic about data. This is a unique opportunity to join a focused team and work collaboratively with other groups to create significant impact features
Description
We are seeking a hardworking, well responsible, tenanted engineer to build features and services, which power many of Apple products. As a core member of the Data Engineering team you will be responsible for designing and implementing features that rely on processing and serving very large datasets with an awareness of scalability. This will include crafting systems to model, ingest, process and compute large-scale, critically important data across High-throughput and reliability are essential. Also, understand data well and be able to help to analyze data and help with product teams come up creative features. This team created different types of charts for Apple Music, App Store, Podcast, Movies etc. Also generating internal dataset for Siri, search, recommendation etc to help them provide better Apple products.
Minimum Qualifications
- Bachelors/equivalent, or greater, in Computer Science or related field
- Experience with distributed computing technologies such as Hadoop and Spark
- Proficiency in Scala, Java, SQL
- Expertise in designing, implementing and supporting highly scalable data systems and services
- Expertise building and running large-scale data pipelines, including distributed messaging such as Kafka, data ingest from various sources to feed batch and near-realtime or streaming compute components
- Experience with data analysis, and be able to analyze large scale dataset
- Solid understanding of data-modeling and data-architecture optimized for big data patterns, such as efficient storage and query on HDFS
- Experience with distributed storage and network resources, at the level of hosts, clusters and Data Centers, to troubleshoot and prevent performance issues
Preferred Qualifications
- Experience with Apache Iceberg tables
- Familiarity with Docker and Kubernetes
- Familiarity with Apache Airflow
- Familiarity with Tableau
- Familiarity with Data modeling