Senior Manager, Data Engineering

  • 505 Penobscot Dr, Redwood City, CA 94063, USA
  • Full-time

Company Description

Guardant Health is a leading precision oncology company focused on helping conquer cancer globally through use of its proprietary blood tests, vast data sets and advanced analytics.  The Guardant Health Oncology Platform is designed to leverage our capabilities in technology, clinical development, regulatory and reimbursement to drive commercial adoption, improve patient clinical outcomes and lower healthcare costs. 

In pursuit of our goal to manage cancer across all stages of the disease, Guardant Health has launched two liquid biopsy-based tests, Guardant360 and GuardantOMNI, for advanced stage cancer patients, and is developing programs for recurrence and early detection, called Project LUNAR. Since its launch in 2014, Guardant360 has been used by more than 9,000 oncologists in the care of more than 120,000 patients and is working with more than 60 biopharmaceutical companies.

Job Description

Senior Manager, Data Engineering and Platform

The Data Platform team provides an enriched and valuable ecosystem of data sources and data services that drive innovation for internal and external systems. This team is dedicated to developing advanced technology (Big Data , Cloud, Machine Learning), systems and services to make data secure, rich, high quality, and fast therefore enabling Guardant the ability to leverage its data assets in an effective and timely manner to maximize technology/business development in the extraordinarily complex oncology diagnostic and therapeutic landscape.

We connect patients with clinical trials, help clinicians order our test and receive our clinical reports, and deliver valuable genomic datasets to researchers to help uncover important insights into treatment paradigms and drug discovery. Our technology stack reflects our views of using the best tools for the job, employing Scala, Java, Python along with Kubernetes, Apache Spark, Presto, Kafka, Docker, MySQL, MongoDB and a variety of AWS services to analyze and disseminate vast volumes of genomic data.

Data Acquisition: Utilize expert coding skills to build real-time distributed and reliable data pipelines that ingest and process data at scale.

Data Architecture: Expertise in designing and building big data systems, data lakes; can translate the needs of the business to productize models and data visualizations into a very functional data architecture; partners with Healthcare Intelligence.

Data Validation / Accuracy: Develop quality checks to ensure data accuracy and integrity; recommend process improvements that enhance data integrity; ensure ongoing data integrity and performs skillful data validation.

Reporting / Analysis: Work independently with senior leaders to tackle complex problems by developing sophisticated, testable hypotheses; presents findings formally to diverse stakeholders and committees; meaningfully identifies opportunities for improvement that result in change.

Display / Visualization: Proficient with data visualization tools; develop visualization concepts; deliver excellent visual storytelling; solve complex technical challenges.

Clinical Data Expertise: Strong analytic resource in clinical subject areas with good understanding of the characteristics of data in sources including the EDW and the Data Lake.

Essential Duties and Responsibilities:

·       Work with all business functions to thoroughly understand the lifecycle and role of every data element in our business

·       Develop a technical strategy and roadmap to continuously deliver data governance, master and transactional data management, and analytics capabilities

·       Play a key role in shaping Guardant’s technical stack, technology choices and investment areas

·       Build the data processing stack and visualization toolset to process and disseminate very interesting human genomics data

·       Hire, coach, and drive an exceptional team of engineers to deliver on the roadmap

·       Own and manage company’s data stores. That is taking the data scalability and data availability to the next level considering the massive amount of genomics data we generate and process

 Qualifications:

  • 10+ years of experience in designing, implementing and operating data warehouses and data lakes, distributed data pipelines and integration architecture on AWS
  • Solid knowledge of data storage technologies (relational, NoSQL), their capabilities and applications
  • Solid hands on experience in building & operating real-time and batch data pipelines and APIs
  • Experience in implementing multiple data aggregation strategy (nightly/ intraday)
  • Hands-on experience with technologies like Snowflake, Redshift, Spark, Hive, Presto, Kafka and Sqoop
  • Understanding of various Visualization platform (Tableau, D3JS, others)
  • Good understanding of building reporting datasets sourcing from Oracle ERP, CRM and Hyperion(fp&a systems) footprint is essential
  • 3+ years of managing software or data engineering teams
  • Experience with application performance monitoring and assessment desired

·        Work collaboratively with business, bioinformatics scientists and translates business requirements into enterprise information architecture

·        Drive the architecture of data integration from various clinical application and stores, research databases and external sources

·        Develop the processes for updating and maintaining terminologies, and vocabularies including mapping from local to international standards when applicable

·        Strong knowledge of statistics, data analysis and databases

  • Flair for data, schema, data model, how to bring efficiency in big data related life cycle
  • Experience with managing data in regulated healthcare environment (HIPAA compliant) is a plus
  • Strong aesthetic sensibility that supports clear visual communication of quantitative information
  • Proficiency with agile or lean development practices
  • Knowledge of healthcare including Clinical terms and concepts is a plus
  • Bachelor’s degree in software engineering, CS, or related area

Additional Qualifications:

You have strong knowledge and experience addressing a broad range of accounting matters, ensuring it is processed in compliance with established internal controls.  You possess analytical skills needed to correctly grasp and communicate, analyze and reconcile accounts; ability to handle confidential and sensitive information with the appropriate discretion; and handle multiple deadlines.

 You are a self-starter, work well as a team player, but can work independently when appropriate. You possess the ability to analyze problems and actively strategize to resolve them, pay attention to detail, and have excellent organization and communication skills. You are results oriented. You can juggle multiple tasks, work cross-functionally and at all levels of the organization, whether internally or externally. You are flexible and comfortable in a dynamic, fast-paced environment and can prioritize to focus on the important, not just the urgent.

Qualifications

Qualifications:

  • 10+ years of experience in designing, implementing and operating data warehouses and data lakes, distributed data pipelines and integration architecture on AWS
  • Solid knowledge of data storage technologies (relational, NoSQL), their capabilities and applications
  • Solid hands on experience in building & operating real-time and batch data pipelines and APIs
  • Experience in implementing multiple data aggregation strategy (nightly/ intraday)
  • Hands-on experience with technologies like Snowflake, Redshift, Spark, Hive, Presto, Kafka and Sqoop
  • Understanding of various Visualization platform (Tableau, D3JS, others)
  • Good understanding of building reporting datasets sourcing from Oracle ERP, CRM and Hyperion(fp&a systems) footprint is essential
  • 3+ years of managing software or data engineering teams
  • Experience with application performance monitoring and assessment desired

·        Work collaboratively with business, bioinformatics scientists and translates business requirements into enterprise information architecture

·        Drive the architecture of data integration from various clinical application and stores, research databases and external sources

·        Develop the processes for updating and maintaining terminologies, and vocabularies including mapping from local to international standards when applicable

·        Strong knowledge of statistics, data analysis and databases

  • Flair for data, schema, data model, how to bring efficiency in big data related life cycle
  • Experience with managing data in regulated healthcare environment (HIPAA compliant) is a plus
  • Strong aesthetic sensibility that supports clear visual communication of quantitative information
  • Proficiency with agile or lean development practices
  • Knowledge of healthcare including Clinical terms and concepts is a plus
  • Bachelor’s degree in software engineering, CS, or related area

Additional Information

Additional Qualifications:

You have strong knowledge and experience addressing a broad range of accounting matters, ensuring it is processed in compliance with established internal controls.  You possess analytical skills needed to correctly grasp and communicate, analyze and reconcile accounts; ability to handle confidential and sensitive information with the appropriate discretion; and handle multiple deadlines.

 You are a self-starter, work well as a team player, but can work independently when appropriate. You possess the ability to analyze problems and actively strategize to resolve them, pay attention to detail, and have excellent organization and communication skills. You are results oriented. You can juggle multiple tasks, work cross-functionally and at all levels of the organization, whether internally or externally. You are flexible and comfortable in a dynamic, fast-paced environment and can prioritize to focus on the important, not just the urgent.

#LI-KH1

Privacy Policy