Junior Data Engineer

  • New York, NY

Job Description


The Junior Data Engineer will be building our distributed processing and big data infrastructure as well as the applications and tools which run on top of it. The right candidate should have a passion for solving difficult problems, working with a team of really smart people, constantly learning new things, and consider datasets of a few billion rows and a couple terabytes to be “small”. You will be working a lot with Scala, the Hadoop and Spark APIs, and SQL.


Responsibilities


Assist with development and testing of data applications making use of HDFS, Hadoop, and Spark


Work with senior team members to deploy and test applications at scale on 100+ node computing clusters


Write and optimize SQL queries


Construct data pipeline workflows and schedules


Qualifications & Experience


1-3 years of software engineering


SQL


Databases


 


Preferred Qualifications


Java


Hadoop


Git[hub]


Testing


Scala


Akka


Spark


 


Soft Skills


Strong teamwork and collaboration skills


Excellent written and oral communication skills


Strong desire to learn

Additional Information

All your information will be kept confidential according to EEO guidelines.