DISNEY 的 Senior Data Engineer (Machine Learning) 职位 Skip Navigation
EN
The Walt Disney Company: Be Part of the Story

Be Part of the Story

Senior Data Engineer (Machine Learning)

立即申请 稍后申请 职位ID 769974BR 工作地点-国家/地区 华沙, 马索维亚, 波兰 工作发布公司 Disney Streaming Services; 发布日期 Jan. 18, 2021

职位介绍

Machine Learning Engineers in the Disney+ Personalization team specialize in applying machine learning
methods to meet strategic product personalization goals, explore innovative techniques that can be
applied to recommendations and constantly seek ways to optimize operational processes. As a member
of this team, you will collaborate across Data, Product, and Engineering teams to deeply understand
challenges and develop automated solutions to be built into our products.

Responsibilities:

● Process, cleanse, and verify the integrity of data used for model building, experimentation
evaluation and algorithm performance
● Build data pipelines as needed for feature engineering and the use of downstream machine
learning training
● Build and maintain automated machine learning workflows utilizing existing frameworks and
custom in-house designed solutions
● Build, maintain, and optimize machine learning infrastructure (e.g. compute resources,
deployment and performance monitoring tools) for algorithm development, testing, training,
and deployment
● Build experimentation tools for fast model evaluation and hyperparameter optimization
● Instrument tools for continuous integration and deployment of personalization,
recommendation, and other machine learning systems
● Extend existing libraries and frameworks or develop custom ones to facilitate model
development and promote code reusability
● Implement processes and resources (e.g. dashboards, slackbots) for monitoring of offline and
online model performance
● Establish and maintain algorithm development, testing, and deployment standards
● Educate team members and provide guidance on machine learning infrastructure and algorithm
development
● Perform deep-dive analysis on app interactions and events to better understand user
consumption behavior as it relates to personalization and recommendation.
● Collaborate with business stakeholders to help identify and define personalization
opportunities. Work with other data teams to improve our data collection, experimentation and
analysis.

Requirements:

● 5+ years writing production-level, scalable code (Python/Java/Scala)
● 5+ years of work experience using data processing and manipulation libraries and frameworks
(SQL, Spark, Pandas)

● Extensive experience with Amazon Web Services (specifically S3, EC2, RDS, Lambda, Batch, SNS,
SQS, IAM)
● Extensive experience with machine learning libraries (like scikit-learn) and frameworks (like
PyTorch, Keras)
● Experience building and deploying full-stack ML pipelines: data extraction, data mining, model
training, feature development, regression testing, testing, and deployment
● Experience engineering big-data solutions using technologies like EMR, S3, Spark, Databricks
● Experience with automated deployment and continuous integration (GitHub, CI/CD tools,
Docker)
● Very good understanding of data structures, data modeling, and software architecture
● In-depth understanding of modern machine learning concepts, models, and their mathematical
underpinnings
● Familiarity with data exploration and data visualization tools like Looker, Tableau, Chartio, etc.
● Understanding of statistics concepts (e.g., hypothesis testing, regression analysis)
● Ability to gauge the complexity of machine learning problems and a willingness to execute
simple approaches for quick effective solutions as appropriate
● Strong communication skills, as well as written and verbal presentation skills

Preferred Additional Skills:

● Experience with deep learning, NLP, and/or Bayesian modeling
● Experience with hyperparameter tuning methods and frameworks
● Experience with graph databases
● Experience with data workflow managers such as Apache Airflow
● Experience with web UI design and development (SPA, Javascript, backend web frameworks like
Django)
● Familiarity with statistical languages, libraries, and frameworks (R, Shiny)
● Familiarity with metadata management, data lineage, and principles of data governance
● Experience loading and querying cloud-hosted databases such as Snowflake
● Building streaming data pipelines using Kinesis, Kafka, Spark, or Flink


关于 Disney Streaming Services:

迪士尼流媒体服务部负责开发和运营华特迪士尼公司的全球直面消费者视频业务,包括 ESPN+ 优质体育流媒体服务;即将推出的迪士尼视频订阅服务;以及 BAMTECH Media,即直面消费者视频流媒体产品和解决方案的全球领导者。我们的核心使命是让全球观众在任何连接设备、时间或地点都可以自由访问内容。通过卓越的直面消费者视频服务,我们将备受全球观众欢迎的角色、经典故事、传奇运动员和盛大体育赛事呈现给世界各地的观众。我们每天都致力于发挥想象力,以创新技术挑战传统,让消费者能够自由访问内容,摆脱任何连接设备、时间或地点的限制。

关于 The Walt Disney Company:

华特迪士尼公司连同其子公司和关联公司,是一家领先的多元化国际家庭娱乐和媒体公司,拥有以下业务板块:媒体网络、乐园和度假区、影视娱乐、消费产品和互动媒体。它从 20 世纪 20 年代的一家小型卡通工作室,一跃发展成为当今娱乐业中的翘楚,可谓是家喻户晓。迪士尼公司非常荣幸地继续秉承其传统,为每位家庭成员打造世界一流的故事和体验。迪士尼的故事、人物和经历吸引了世界各地的消费者和游客。我们在 40 多个国家及地区经营业务,我们的员工和演职人员携手努力,打造在全世界和当地都备受钟爱的娱乐体验。

此职位隶属于 The Walt Disney Company (Polska ) sp. z o.o.,即我们称为 Disney Streaming Services 的业务部门的一部分。

立即申请 稍后申请