Data Scientist

  • 375 Hudson Street, New York, NY
  • Employees can work remotely
  • Full-time

Job Description

We are looking for a talented Data Scientist for an exciting opportunity on the Decisioning team. You would be involved with designing a new workflow for data science and analytics. Candidates considered based on their ability to productionize highly parallelized scripts related solutions anticipating behavior and deriving insights from big data. As well as their ability to manage research projects resulting in real world project and data pipelines that support data products. 

Responsibilities: 

  • Ability to prototype and deploy production worthy supervised and unsupervised machine learning frameworks
  • Design, implement and maintain high performance data infrastructure/systems, data processing pipelines (ETL management, Luigi, Airflow) - PySpark or Scala 
  • Work with Ops for AWS infrastructure optimization/tuning 
  • Work with team to design and develop APIs 
  • Productionize and maintain PySpark code base (PySpark) 
  • Work with DS on bug fixes, client enhancement requests and code optimization 
  • Write PySpark/Scala applications for data processing and engineering 
  • Productionize scalable and distributed scripts techniques across a variety of big data  
  • Processing, cleansing, and verifying the integrity of data used for analysis 

Qualifications

  • Enjoys being challenged and solve complex problems on a daily basis 
  • Excellent oral and written communication skills 
  • Ability to work in teams and collaborate across multiple groups/disciplines 
  • Communicate concisely and persuasively with engineers and product managers 

Additional Information

All your information will be kept confidential according to EEO guidelines.

Privacy Policy