Skip to main content
EURAXESS

Data Engineer

The Human Resources Strategy for Researchers
21 Nov 2022

Job Information

Organisation/Company
Centre For Genomic Regulation
Research Field
Computer science
Researcher Profile
First Stage Researcher (R1)
Country
Spain
Application Deadline
Type of Contract
Other
Job Status
Full-time
Hours Per Week
40
Is the job funded through the EU Research Framework Programme?
Not funded by an EU programme
Is the Job related to staff position within a Research Infrastructure?
No

Offer Description

The Institute

The Centro Nacional de Análisis Genómico (CNAG-CRG) is one of the largest Genome Sequencing Centres in Europe.

With the increasing demand in genomic tests on rare diseases, cancer and other diseases, genomic data management, analysis and interpretation is a real bottleneck in the healthcare systems. The Bioinformatics Unit at CNAG-CRG designs and develops innovative platforms and solutions to analyse large amounts of genomics and clinical data, with the ultimately goal of improving the implementation of data-driven and high-quality personalised medicine in the Healthcare System. Resources developed by the Unit are currently used by hundreds of clinical researchers in Europe and are part of major International genomic initiatives such as the Global Alliance for Genomics and Health (GA4GH), the European Infrastructure for life-science information (ELIXIR) and the 1+ Million Genomes Initiative (1+MG). The Unit includes bioinformaticians, engineers, software developers and biologists highly experienced on genomic data management, analysis and interpretation.

It is integrated with the Centre for Genomic Regulation (CRG), an international biomedical research institute of excellence, based in Barcelona, Spain, with more than 400 scientists from 44 countries. The CRG is composed by an interdisciplinary, motivated and creative scientific team which is supported both by a flexible and efficient administration and by high-end and innovative technologies.

In April 2021, the Centre for Genomic Regulation (CRG) received the renewal of the 'HR Excellence in Research' logo from the European Commission. This is a recognition of the Institute's commitment to developing an HR Strategy for Researchers, designed to bring the practices and procedures in line with the principles of the European Charter for Researchers and the Code of Conduct for the Recruitment of Researchers (Charter and Code).

Please, check out our Recruitment Policy

 

The role

We have an opening for a Data Engineer to play a key role in Instand-NGS4P (https://www.instandngs4p.eu/). The aim is to develop a standardised Next Generation Sequencing (NGS) workflow from NGS data analysis to medical-decision making for common and rare adult and paediatric cancer. The workflow will leverage, among other, the current RD-Connect Genome Phenome Analysis Platform (https://platform.rd-connect.eu/), with the objective to cover data management, clinical and genome data integration, genome analysis pipelines, variant annotation, interpretation and reporting. With the supervision of the lead of the Data Platforms and Tools Development team and in collaboration with cancer specialists, bioinformaticians and software engineers, the successful candidate will implement the data infrastructure and back-end of the product for the cancer platform.

His/ Her responsibilities include:

  1. Implement pipelines in Apache Spark
  2. Integrate pipelines in Jenkins pipeline or NextFlow workflow manager systems
  3. Collaborate with back-end developers and bioinformaticians to integrate data into the cancer platform
  4. Implement and improve queries in SQL (Postgres) and NoSQL databases (Elasticsearch, MongoDB, etc.)
  5. Gather and address technical and design requirements
  6. Follow emerging technologies

 

About the team

The successful candidate will join the Data Platforms and Tools Development team, coordinated by Dr. Davide Piscia (https://www.cnag.crg.eu/teams/bioinformatics-unit/data-platforms-and-too...). The team is part of the CNAG-CRG Bioinformatics Unit (led by Dr. Sergi Beltran), which has over 30 members and offers continuous growth and support on a professional level. The team works in a stimulating scientific environment, applying state-of-the-art technologies to breakthrough research projects in Genomics that have an impact on people’s health.

Requirements

Research Field
Computer science
Education Level
Undergraduate
Skills/Qualifications

Whom would we like to hire?  

Must Have 

  • A minimum experience of 1 years in Software related position, preferentially as Data engineer. 

  • Hands on experience with programming languages like Python, Scala, Java and similar 

  • Understanding of pipeline orchestration  

  • Knowledge of distributed computing (Apache Spark, Apache Flink or similar)  

  • Experience with source control system as git 

Nice to have 

  • Experience with genomics and clinical data 

  • Experience with work-flow orchestrator (Jenkins pipeline, Nextflow, Airflow, prefect, snakemake, etc.) 

  • Experience with databases (Postgres, Elasticsearch, Cassandra, etc.) 

  • Experience with data pipeline testing  

Education and training 

  • Bachelor degree or Master degree in Computer science or related fields 

Languages 

  • Good spoken and written English  

Competences 

  • Good organisational, prioritising, communication and interpersonal skills. 

Languages
ENGLISH
Level
Good

Additional Information

Benefits

We provide a highly stimulating environment with state-of-the-art infrastructures, and unique professional career development opportunities. To check out our training and development portfolio, please visit our website in the training section

We offer and promote a diverse and inclusive environment and welcomes applicants regardless of age, disability, gender, nationality, ethnicity, religion, sexual orientation or gender identity. 

The CRG is committed to reconcile a work and family life of its employees and are offering extended vacation period and the possibility to benefit from flexible working hours. 

Eligibility criteria

All applications must include: 

  1. A complete CV including contact details.  

  2. A motivation letter addressed to Dr Davide Piscia will be highly valued. 

 

All applications must be addressed to Human Resources and be submitted online on the CRG Career site - https://recruitment.crg.eu/content/jobs/position/data-engineer-2 

 

Selection process
  • Pre-selection: The pre-selection process will be based on qualifications and expertise reflected on the candidates CVs. It will be merit-based.
  • Interview: Preselected candidates will be interviewed by the Hiring Manager of the position and a selection panel if required.
  • Offer Letter: Once the successful candidate is identified the Human Resources department will send a Job Offer, specifying the start day, salary, working conditions, among other important details.

 

Additional comments

Suggestions: The CRG believes in ongoing improvement and promotes a culture of feedback. This is one of the reasons we have in place, at your disposal as a candidate, a mechanism to gather your suggestions/complaints concerning your candidate experience in our recruitment processes. Your feedback really matters to us in our aim at creating a positive candidate journey. You can make a difference and help us improve by letting us know your suggestions through the following form

Work Location(s)

Number of offers available
1
Company/Institute
CNAG-CRG
Country
Spain
State/Province
Barcelona
City
Barcelona
Postal Code
08028
Street
c/ Baldiri Reixac, 4 Barcelona Science Park - Tower I 08028 Barcelona, Spain
Geofield

Contact

State/Province
Barcelona
City
Barcelona
Website
Street
Dr. Aiguader, 88
Postal Code
08003
E-Mail
rrhh@crg.eu