Boston Red Sox
POSITION OVERVIEW: The Data Engineer, Baseball Systems position will be a member of the baseball operations software development team, and is responsible for integrating, collecting, processing, and storing many sources of baseball data, as well as designing and building new data solutions. This position must be comfortable with on-premises and cloud solutions, and take the initiative to explore new optimizations and cutting-edge data technologies. This individual will work closely with our data architect, analysts, developers, and other members of baseball operations.
Build leading-edge baseball solutions together with the software development team, analysts, and others on new and existing baseball systems Build and maintain integration pipelines, often via an API or file-based, while also identifying areas of improvement and spending time to re-architect when required Build and maintain infrastructure to optimize extraction, transformation, and the loading of data from various sources Design, build, and maintain data warehousing solutions for the software development and analytics teams Build and maintain tools for the analysts to enable more efficient and extensive data modeling and simulation efforts Participate in key phases of the software development process of critical baseball applications, including requirements gathering, analysis, effort estimation, technical investigation, software design and implementation, testing, bug fixing, and quality assurance Actively participate with software developers and data architects in design reviews, code reviews, and other best practices Work closely at times with baseball analysts to design and implement data solutions Respond to and resolve technical problems and issues in a timely manner
TECHNICAL SKILLS & QUALIFICATIONS:
Bachelor’s degree in Computer Science, Software Engineering, Computer Engineering, Statistics, Information Systems, or a related field 2-3 years of experience in a Data Engineer role Strong SQL skills, including query optimization and database design Experience building custom API integrations, interfacing with JSON, XML, and custom data structures Experience with AWS, GCP, or Azure cloud services, such as Cloud SQL, RDS, Redshift, Azure SQL, Azure SQL DW, or others Experience building data solutions using Python, C#, C++, Ruby, or other languages Experience with scheduling and workflow management platforms, such as Airflow Experience with big data frameworks such as Hadoop or Spark is a plus Experience with R and RStudio is a plus Experience with engineering and productionizing statistical/ML models a plus Professional experience as an analyst/data scientist (or extensive coursework) a plus
GENERAL SKILLS: Ability to work autonomously and as a team in a fast paced environment High level of attention to detail with the ability to multi-task effectively Comfortable working remotely using Zoom, Teams, Slack, Trello, and other tools to communicate with all team members High degree of professionalism and ability to maintain confidential information Excellent organizational and time management skills An understanding of baseball is a plus