Data Engineer II

Cambridge, MA
$130,050 - $175,950 / Year 
Contract, C2H (contract-to-hire)
Senior level

Job Description

Site Name: USA - California - San Francisco, Cambridge 300 Technology Square, London The Stanley Building, Seattle Sixth Ave
Posted Date: Jan 16 2024

The Onyx Research Data Platform organization represents a major investment by GSK RD and Digital Tech, designed to deliver a step- change in our ability to leverage data, knowledge, and prediction to find new medicines. We are a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure ,and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:

  • Building a next-generation data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity and reducing

  • time spent on “data mechanics”

  • Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent

  • Aggressively engineering our data at scale to unlock the value of our combined data assets and predictions in real-time

Data Engineering is responsible for the design, delivery, support, and maintenance of industrialized automated end-to-end data services and pipelines. They apply standardized data models and mapping to ensure data is accessible for end users in end-to-end user tools through the use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external, structured and unstructured data in line with Product requirements.

A Data Engineer II is a technical contributor who can take a well-defined specification for a function, pipeline, service, or other sort of component, devise a technical solution, and deliver it at a high level. They have a strong focus on the operability of their tools and services, and develop, measure, and monitor key metrics for their work to seek opportunities to improve those metrics. They are aware of, and adhere to, best practices for software development in general (and data engineering in particular), including code quality, documentation, DevOps practices, and testing. They ensure the robustness of our services and serve as an escalation point in the operation of existing services, pipelines, and workflows.

A Data Engineer II should be deeply familiar with the most common tools (languages, libraries, etc) in the data space, such as Spark, Kafka, Storm, etc., and aware of the open-source communities that revolve around these tools. They should be constantly seeking feedback and guidance to further develop their technical skills and expertise and should take feedback well from all sources in the name of development.

Key responsibilities for the Senior Data Engineer include:

  • Builds modular code / libraries / services / etc using modern data engineering tools (Python/Spark, Kafka, Storm, …) and orchestration tools (e.g. Google Workflow, Airflow Composer)

  • Produces well-engineered software, including appropriate automated test suites and technical documentation

  • Develop, measure, and monitor key metrics for all tools and services and consistently seek to iterate on and improve them

  • Ensure consistent application of platform abstractions to ensure quality and consistency with respect to logging and lineage

  • Fully versed in coding best practices and ways of working, and participates in code reviews and partnering to improve the team’s standards

  • Adhere to QMS framework and CI/CD best practices

  • Provide L3 support to existing tools / pipelines / services

Why you?

Basic Qualifications:

We are looking for professionals with these required skills to achieve our goals:

  • 4+ years of data engineering experience with a Bachelors degree.

  • 2+ years of data engineering experience with a PhD or a Masters degree.

  • Cloud experience (e.g., AWS, Google Cloud, Azure, Kubernetes)

  • Experience in automated testing and design

  • Experience with DevOps-forward ways of working

Preferred Qualifications:

If you have the following characteristics, it would be a plus:

  • Software engineering experience

  • Demonstratable experience overcoming high volume, high compute challenges

  • Familiarity with orchestrating tooling

  • Knowledge and use of at least one common programming language: e.g., Python (preferred), Scala, Java, including toolchains for documentation, testing, and operations / observability

  • Strong experience with modern software development tools / ways of working (e.g. git/GitHub, DevOps tools, metrics / monitoring, …)

  • Cloud experience (e.g., AWS, Google Cloud, Azure, Kubernetes)

  • Application experience of CI/CD implementations using git and a common CI/CD stack (e.g. Jenkins, CircleCI, GitLab, Azure DevOps)

  • Experience with agile software development environments using Jira and Confluence

  • Demonstrated experience with common tools and techniques for data engineering (e.g. Spark, Kafka, Storm, …)

  • Knowledge of data modeling, database concepts, and SQL

Why GSK?

Our values and expectations are at the heart of everything we do and form an important part of our culture.

These include Patient focus, Transparency, Respect, Integrity along with Courage, Accountability, Development, and Teamwork. As GSK focuses on our values and expectations and a culture of innovation, performance, and trust, the successful candidate will demonstrate the following capabilities:

  • Agile and distributed decision-making – using evidence and applying judgement to balance pace, rigour and risk
  • Managing individual and team performance. Committed to delivering high quality results, overcoming challenges, focusing on what matters, execution.
  • Implementing change initiatives and leading change.
  • Sustaining energy and well-being, building resilience in teams.
  • Continuously looking for opportunities to learn, build skills and share learning both internally and externally.
  • Developing people and building a talent pipeline.
  • Translating strategy into action - a compelling narrative, motivating others, setting objectives and delegation.
  • Building strong relationships and collaboration, managing trusted stakeholder relationships internally and externally.
  • Budgeting and forecasting, commercial and financial acumen.



The annual base salary for new hires in this position ranges from $130,050 to $175,950 taking into account a number of factors including work location, the candidate’s skills, experience, education level and the market rate for the role. In addition, this position offers an annual bonus and eligibility to participate in our share based long term incentive program which is dependent on the level of the role. Available benefits include health care and other insurance benefits (for employee and family), retirement benefits, paid holidays, vacation, and paid caregiver/parental and medical leave.

Please visit GSK US Benefits Summary to learn more about the comprehensive benefits program GSK offers US employees.

Why Us?

GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organization where people can thrive. Getting ahead means preventing disease as well as treating it, and we aim to positively impact the health of 2.5 billion people by the end of 2030.

Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a workplace where everyone can feel a sense of belonging and thrive as set out in our Equal and Inclusive Treatment of Employees policy. We’re committed to being more proactive at all levels so that our workforce reflects the communities we work and hire in, and our GSK leadership reflects our GSK workforce.

If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or +1 801 567 5155 (outside US).

GSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.

Important notice to Employment businesses/ Agencies

GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.

Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit GSK’s Transparency Reporting For the Record site.

Ref #
30+ days ago
Last updated 30+ days ago

Stay Inspired!
Join other developers and designers who have already signed up for our mailing list.
Terms     Privacy     Cookies       Do Not Sell       Licensing      
Made with    in Austin, Texas.  - vsn 44.0.0
© Data & Object Factory, LLC.