Function Tech Lead – Data Engineering

Job Description


The Position

We advance science so that we all have more time with the people we love.

Genentech Research & Early Development (gRED) Computational Sciences (gCS) is on a mission to partner across the organization to realize the potential of data, technology and computational approaches that will revolutionize how targets and therapeutics are discovered and developed, ultimately enabling novel treatments for patients across the world. We stand at the beginning of an exciting journey.The Computational Catalysts group within gCS is a diverse, curious and action-driven team at the intersection of computation, engineering and science with ambition to advance our technical excellence. The focus of the team is on partnering with the informatics and scientific communities to create a computational and data ecosystem that powers scientific discovery and accelerates decision making. We aim to modernize our ability to acquire, store, link, share, find and analyze data across the organization through scalable and integrated solutions that truly make every data point count. Reporting directly to the Executive Director, Data Solutions and Engineering this Function Tech Lead will play a key role in defining and executing the strategy for the Data Fabric for this newly created group.

The Data Solutions and Engineering group within Computational Catalyst is accountable for establishing a common Data Fabric which connects our Systems, specifically our Data Pipelines and Applications for data acquisition, collection, storage, transformation, linkage and sharing. This team strives to build delightful applications and systems for our stakeholders with a strategic mindset. The team is responsible for the end to end product lifecycle management and the work done is leveraged downstream for building key scientific insights and enabling our ML/AI workflows and models. The Function Tech Lead for Data Engineering, will closely work with Software Engineers, Product and Tech/ML Ops in their role.

The Function Tech Lead for Data Engineering is an experienced and highly credible visionary and technical leader with the proven ability to deliver, form collaborative partnerships and foster an engaging team culture built on trust. You will be responsible and accountable for designing and helping implement the data fabric that underpins the flow of data within the Computational Catalysts function. You will deeply contribute towards building common data solutions and frameworks which can be leveraged across multiple initiatives. You will be responsible for setting standards that will be adopted across Catalysts and the broader organization. The Function Tech Lead will have influence over a number of outstanding data and software engineers, product managers, DevOps and MLOps who are building data solutions. You will have deep expertise and hands-on experience of data & software engineering and be familiar with modern and cutting-edge approaches, with experience in managing data flows in high performance computing environments. You will have an understanding of how to build flexible, robust and extensible data pipelines that exemplify industry best practices, minimizing manual interventions, ensuring robustness and scalability, and avoiding technical debt. You will be responsible for driving solutions that ensure data align with FAIR Principles and can be collected effectively and flow seamlessly into a variety of different downstream applications, such as large scale machine learning models, including foundational models. You will collaborate with leaders in other parts of the Roche Group (e.g. pRED and Product Development) to identify enterprise wide opportunities and best practices. You will help break down silos between efforts and foster collaborative efforts to accelerate common solutions. The Function Tech Lead will have familiarity with a variety of database and data analytics systems and can deeply analyze pros and cons and propose future proof scalable, reliable, performant and robust solutions. You are passionate about learning in general and newer technologies in specific. You are skilled and passionate about mentoring and coaching on diverse topics but specially on technical topics. You will make decisions related to technical roadmaps, solutions and capability development and will be expected to make key contributions to the overall success of gCS.

The Opportunity:
  • Provide technical leadership around Data Engineering for Data and Solution Engineering group and more broadly as appropriate
  • Build the Data Fabric strategy for Catalysts Organization in close collaboration with partners and stakeholders
  • Drive alignment and own completion of a common API layer for the Data Fabric
  • Drive consolidation and deprecation of legacy tools, solutions and systems and reduction of tech Debt
  • Identifying key trends, technologies, methodologies and influence there adoption by taking an Open Source focussed, Cloud first, API first and AI first approach
  • Identify common patterns to break data silos while maintaining FAIR practices
  • Learn, deeply understand and ultimately improve our Data Ecosystem across structured and unstructured data which powers our systems
  • Ensuring our technical choices and solutions are innovative, best-in-class and integrated by delivering data flows and pipelines within and across gCS, Research Biology, Drug Discovery, Translational Medicine, Development Sciences and beyond
  • Understand and influence technical decisions around data acquisition, collection, storage, transformation, linkage and sharing while working collaboratively with our key partners
  • Deliver on the goal of bringing diverse sets of data together to support a wide range of activities such as AI/ML, search, reporting, and analytics
  • Build a strong and collaborative community of Data Engineers with a strong focus on mentoring, standardization and best practices (CI/CD, coding standards, code reviews, testing and more)
  • Facilitate the implementation of large-scale models that take advantage of advances in machine learning and artificial intelligence
  • Establish and foster strong internal and external partnerships and relationships with leaders and stakeholders in Computational Catalysts, gCS and beyond
  • Lead by example to establish and demonstrate the culture and working environment of this new organization aligned with our gCS values: impact, collaboration, diversity, scientific excellence and curiosity

Who You Are:
  • Bachelor’s degree in Computer Science or similar technical field. Master’s degree or higher preferable and 12 years of experience in software engineering
  • Experience in data solutions architecture and with building complex data fabric/data mesh and data engineering solutions
  • Experience building solutions leveraging industry leading OLTP and/or OLAP systems
  • Familiarity and experience with concepts like: SQL, NoSQL, ETL, ELT, Data Lakes, Event Streaming, Data Fabric/Data Mesh, Elasticsearch, GraphQL, Dev/ML Ops
  • Hands on with programming languages such as Python or Java
  • Experience with building Designs for Data products which are highly reliable, scalable, performant, secure and robust and ideally on a public cloud platform
  • Takes an Open Source, Cloud First, API First, AI First approach towards problem solving
  • Experience technically leading large and complex projects, involving multiple teams and stakeholders and achieving outstanding results in a timely and efficient manner
  • Experience influencing, motivating and aligning others towards common technical decisions and leverages shared ownership as appropriate.
  • Highly collaborative and ability to build trusted partnerships with internal and external stakeholders
  • Ability to think strategically and optimize for the long term while acting with a sense of urgency
  • Cares about product excellence and building highly usable solutions
  • Experience building Software Standards and best practices which can be leveraged by broader organization and experience reducing Tech Debt and consolidating and deprecating legacy solutions
  • Acts as a mentor and coach for others and has strong oral and written communication skills

Relocation benefits are available for this job posting.

The expected salary range for this position based on the primary location of California is $181,900 – $337,700 of hiring range. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. A discretionary annual bonus may be available based on individual and Company performance. This position also qualifies for the benefits detailed at the link provided below.





Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.