Senior Data Engineer

  • Full-time

Company Description

About Yassir

 

Yassir is the leading super App for on demand, ride-hailing, last-mile delivery, payment services and more, set to change the way daily services are provided. It currently operates in 45 cities across multiple countries. It has raised $150 million in Series B funding, five times what it raised in its previous priced round last November with world class investors such as BOND and Y Combinator, which is the precursor of the likes of Airbnb, Stripe, Dropbox, Doordash, among others. 

 

We’re not just about serving people - we’re about creating a marketplace to bring people what they need while infusing social values.

 

Job Description

Yassir is seeking an experienced Data Engineer to join our growing team. As a Data Engineer,

you will be responsible for designing and maintaining our data infrastructure, ensuring that data

is stored, processed, and analyzed efficiently and accurately. You will work closely with our data

scientists and software engineers to implement data-driven solutions that improve our operations

and help us achieve our business objectives.

Responsibilities:

  • Design and develop data pipelines, ETL processes, and data warehousing solutions
  • Implement and maintain data integration between multiple systems
  • Collaborate with data scientists and software engineers to implement data-driven solutions
  • Develop and maintain data processing and monitoring tools to ensure data accuracy and integrity
  • Optimize data infrastructure for performance, scalability, and cost-effectiveness
  • Troubleshoot and resolve data-related issues
  • Stay up-to-date with the latest technologies and industry trends in data engineering

 

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • Minimum of 3 years of experience in data engineering or a related field
  • Strong programming skills in Python, Java, or Scala
  • Experience with big data technologies such as Hadoop, Spark, and Kafka
  • Experience with cloud-based data storage and processing (e.g. AWS, Azure, GCP)
  • Proficient in SQL and data modeling
  • Strong problem-solving and analytical skills
  • Excellent communication and collaboration skills
  • Ability to work in a fast-paced environment and handle multiple tasks simultaneously
  • A set of certifications in GCP: Cloud Architect, Data Engineer, Cloud Engineer, ML Engineer (or similar in another cloud vendor).
  • Hands-on experience in MLOps, ML model deployment, governance, and workflow optimization.
  • Hands-on experience in one or various: LightGBM/XGBoost/Sklearn, Numpy/SciPy, Keras/PyTorch/Tabnet, Rust/C++, TensorFlow/Caffe/MXNet, CUDA/Ray, Node.js, REST/GraphQL, Neo4J, Grafana/Datadog, Kafka/Spark/Presto.
  • Proficient in Pyspark and Spark optimizations
  • Strong hands-on experience in writing complex SQL queries
  • Expertise in data modeling
  • Familiarity with data visualization tools, preferably Looker/Superset
  • Work experience in GCP data services and Databricks
  • Strong communication skills