Data Scientist
- Full-time
- Career Site Team: Product Leadership
Job Description
About the job
NielsenIQ Advanced Analytics team develops and promotes a portfolio of global products. Leveraging NielsenIQ’s unparalleled data and global platforms, the Advanced Analytics team creates solutions to enable our clients to make better business decisions every day. We partner with Product Owners and Core Technology Teams to build scalable, always on, analytic products using relevant modeling/ML techniques.
We’re looking for a passionate and talented Sr Data Engineer to join our growing team. In this role, you’ll have the chance to be part of a global analytic team of data scientists and data engineers that are focused on designing and implementing analytic products in a Microsoft Azure environment using Python, Spark and cloud based technologies.
Responsibilities
Design, develop and optimize data pipelines to support analytics products - using AzureML, PySpark and other Python libraries.
Work with data scientists to optimise and scale statistical analysis and model implementations.
Collaborate with product owners, data scientists, and operations personnel to clarify requirements and objectives.
Create and implement QC testing routines and production monitoring processes.
Provide level 2 support for analytics products.
Investigate and implement new software components and techniques as needed.
Become familiar with NielsenIQ data assets
Coach and mentor junior data engineers on the team
About you
You’ve worked with large datasets, complex schemas, and have an emphasis on automation. Ideally you are someone who has a strong interest in implementing data models and data pipelines to build scalable products.
Qualifications:
Degree in computer science or related discipline and 5+ years of experience in big data processing.
Working knowledge of Python, SQL, and ETL
Solid knowledge of SQL databases and development experience, SQL performance monitoring and optimization techniques
Familiarity with the Hadoop ecosystem
A minimum of a year of professional Apache Spark data processing application design and development experience, PySpark preferred, but Java/Scala are also acceptable.
Experience using Pandas, NumPy and Spark ML is desirable
Experience using collaborative development tools such as Git
Experience working in a unix/linux environment
Experience in working on distributed or cloud computing platforms such as Microsoft Azure
Experience with Airflow or equivalent orchestration tool is desirable
Experience implementing machine learning (statistical modelling) models
Working using agile methodology (Scrum) is desired
Familiarity with Azure Storage (ADLS Gen2), Azure Databricks, MLFlow, Presto (Trino) and Snowflake is a plus
Certifications are preferred: Data Engineering on Microsoft Azure, Databricks Certified Associate Developer
Additional Information
About NielsenIQ
NielsenIQ is a global measurement and data analytics company that provides the most complete and trusted view available of consumers and markets worldwide. We provide consumer packaged goods manufacturers/fast-moving consumer goods and retailers with accurate, actionable information and insights and a complete picture of the complex and changing marketplace that companies need to innovate and grow. Our approach marries proprietary NielsenIQ data with other data sources to help clients around the world understand what’s happening now, what’s happening next, and how to best act on this knowledge. We like to be in the middle of the action. That’s why you can find us at work in over 90 countries, covering more than 90% of the world’s population. For more information, visit www.niq.com.
NielsenIQ is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class.