Lead Data Engineer

  • Full-time

Company Description

Blend is a premier AI services provider, committed to co-creating meaningful impact for its clients through the power of data science, AI, technology, and people. With a mission to fuel bold visions, Blend tackles significant challenges by seamlessly aligning human expertise with artificial intelligence. The company is dedicated to unlocking value and fostering innovation for its clients by harnessing world-class people and data-driven strategy. We believe that the power of people and AI can have a meaningful impact on your world, creating more fulfilling work and projects for our people and clients. For more information, visit www.blend360.com

Job Description

We are seeking a highly skilled Lead Data Engineer to join our data engineering team for an on-premise environment. A large portion of your time will be in the weeds working alongside your team architecture, designing, implementing, and optimizing data solutions. The ideal candidate will have extensive experience in building and optimizing data pipelines, architectures, and data sets, with a strong focus on Python, SQL, Hadoop, HDFS, and Apache NiFi.

What you’ll be doing?

  • Design, develop, and maintain robust, scalable, and high-performance data pipelines and data integration solutions.
  • Manage and optimize data storage in Hadoop Distributed File System (HDFS).
  • Design and implement data workflows using Apache NiFi for data ingestion, transformation, and distribution.
  • Collaborate with cross-functional teams to understand data requirements and deliver efficient solutions.
  • Ensure data quality, governance, and security standards are met within the on-premise infrastructure.
  • Monitor and troubleshoot data pipelines to ensure optimal performance and reliability.
  • Automate data workflows and processes to enhance system efficiency.

Qualifications

  • Bachelor’s degree in computer science, Software Engineering, or a related field
  • 6+ years of experience in data engineering or a related field
  • Strong programming skills in Python and SQL.
  • Hands-on experience with Hadoop ecosystem (HDFS, Hive, etc.).
  • Proficiency in Apache NiFi for data ingestion and flow orchestration.
  • Experience in data modeling, ETL development, and data warehousing concepts.
  • Strong problem-solving skills and ability to work independently in a fast-paced environment.
  • Good understanding of data governance, data security, and best practices in on-premise environments.

Good to Have:

  • Experience with other big data tools like Spark, Kafka, etc.

Additional Information

What do you get in return?

  • Competitive Salary: Your skills and contributions are highly valued here, and we make sure your salary reflects that, rewarding you fairly for the knowledge and experience you bring to the table.
  • Dynamic Career Growth: Our vibrant environment offers you the opportunity to grow rapidly, providing the right tools, mentorship, and experiences to fast-track your career.
  • Idea Tanks: Innovation lives here. Our "Idea Tanks" are your playground to pitch, experiment, and collaborate on ideas that can shape the future
  • Growth Chats: Dive into our casual "Growth Chats" where you can learn from the best whether it's over lunch or during a laid-back session with peers, it's the perfect space to grow your skills.
  • Snack Zone: Stay fueled and inspired! In our Snack Zone, you'll find a variety of snacks to keep your energy high and ideas flowing
  • Recognition & Rewards: We believe great work deserves to be recognized. Expect regular Hive-Fives, shoutouts and the chance to see your ideas come to life as part of our reward program.
  • Fuel Your Growth Journey with Certifications: We’re all about your growth groove! Level up your skills with our support as we cover the cost of your certifications.
Privacy PolicyImprint