(Permanent Job) Senior Data Engineer with Big Data Eco-Systems exp

  • Campus Dr, Irvine, CA, USA
  • Full-time

Job Description

This is our direct-client opening for an Sr. Data Engineer located in Temecula, CA. This is a full-time Permanent position 

The Sr. Data Engineer must be able to design, build and maintain Enterprise Level Data Pipe-Lines utilizing the tools available within Big Data Eco-System. 

KEY RESPONSIBILITIES 

Design, build, and deploy new data pipelines within Big Data Eco-Systems
Improve existing data pipelines by simplifying and increasing performance
Follow best practices on design and implementation to aid in company-wide data governance
Work closely with the data analysts and scientists, and database and systems administrators to create data solutions.
Documents new/existing pipelines, Data Sets and Data Sets lineage.
Abides by department development standards and SOP's.

EXPERIENCE/TRAINING/EDUCATION:

Bachelor's degree (B. A.) from four-year college or university from related field is required. 
COMMUNICATION SKILLS: This position requires the ability to read and interpret documents such as safety rules, operating and maintenance instructions, and procedure manuals. This position also requires the ability to write routine reports and correspondence. The ability to speak effectively before groups of customers or employees of the organization is required as well. 
MATHEMATICAL SKILLS: This position requires the ability to work with mathematical concepts such as probability and statistical inference, and fundamentals of plane and solid geometry and trigonometry. This position also requires the ability to apply concepts such as fractions, percentages, ratios, and proportions to practical situations. 
REASONING ABILITY: This position requires the ability to define problems, collect data, establish facts, and draw valid conclusions. This position also requires the ability to interpret an extensive variety of technical instructions in mathematical or diagram form and deal with several abstract and concrete variables. 

SKILLS/ABILITIES: 
Linux 
Hadoop (hdfs, hive, spark, sqoop) 
SQL and HiveQL 
noSQL (Mongodb, Cassandra, or Couchbase) 
Python or Scala 
Bonus: Kafka, Kinetica, Streamsets

Additional Information

All your information will be kept confidential according to EEO guidelines.