PySpark Developer

  • Contract

Company Description

Centraprise is providing traditional staffing services, professional and technical staffing and management services to some of the country's leading companies with highest quality of service.
 

Due to our expertise across multi-platform technologies and skill-sets, Centraprise provides services to a wide spectrum of customers across verticals such as Banking, Financial Services, Healthcare, Human Resources, Telecom, Insurance, Hospitality, Retail & Distribution and Manufacturing. Serving multinational customers, Centraprise Inc has gained vast experience and competence to deliver quality services at competitive prices.

Job Description

Primary Skillset: Python PySpark with ETL AWS background.

 

·         6 years working experience in data integration and pipeline development.

·         BS degree in CS CE or EE

·         2 years of Experience with AWS Cloud on data integration with Apache Spark EMR Glue Kafka Kinesis and Lambda in S3 Redshift RDS MongoDB Dynamo DB ecosystems.

·         Strong real life experience in python development especially in pySpark in AWS Cloud environment.

·         Design develop test deploy maintain and improve data integration pipeline.

·         Experience in Python and common python libraries  Strong analytical experience with database in writing complex queries query optimization debugging user defined functions views indexes etc.

Strong experience with source control systems such as Git Bitbucket and Jenkins build and continuous integration tools Databricks or Apache Spark Experience is a plus

Qualifications

PySpark Developer

Additional Information

All your information will be kept confidential according to EEO guidelines.