Data Engineer


Amsterdam, The Netherlands

Posted on: 2nd July 2018

The world’s most cutting-edge scientific breakthroughs are powered by data. And we’re using the latest technologies to build advanced capabilities into the world’s largest repository of scientific, medical and technological data. It’s complex, fascinating work. And it will be of tremendous use to the world’s academic community as they pursue the research that will benefit all of humanity.

We need a crack Data Engineer, who can prepare, integrate and normalize data, in order to design and create high quality data sets. These data sets will form the foundation for new applications, powered by machine learning, which will make the world’s scientific knowledge more accessible, more searchable and more actionable.

Stack Overview

The most advanced data repository in the world demands the most advanced technologies in the world – and we’re delivering, with tech like:

  • Scala, Java
  • Curly brace languages/Python
  • A reactive AWS cloud platform
  • Spark environments (Databrick, Zeppelin)
  • SQL

More about the role

As a Data Engineer, you’ll work on developing our content and information services. These services rely on existing text and data mining as well as content structuring and meta-data generation processes – many of which rely on human interaction and creation. But we’re keen to introduce more automation by capturing human inputs and backing the submission and annotation systems using machine learning engines.

To really succeed in this role, you’ll need a curiosity for algorithm development, and to be open to learning and working with new technologies. The ability to take a wider view, to understand the product cycle – from front-end functionality to back-end requirements – will make you irreplaceable to the Content & Innovation team. Good communication and documentation skills, with a natural ability to convey complex technical concepts to non-technical professionals, will enable you to implement process changes and disruptive technologies across our wider organization.

More about you

As a Data Engineer, you’ll need excellent grounding in software engineering, process optimization, coding, data standards and cloud infrastructure. We’re really looking for a Masters degree in Computer Science (or related area), and working experience of: scripting for task automation, writing queries, handling data (ETL), using *nix systems/open source software/libraries, and working in Big Data environments. You’ll also need to be familiar with agile software development, cloud technologies like AWS, software version control, web development/services and APIs. Good knowledge of state-of-the-art tooling in capturing content and translating human annotations to machine models will really set you apart.

Oh, and you’ve got a positive, constructive approach with an emphasis on collaboration and flexibility, right? We need someone with confidence, but who can admit when someone else has a better idea too.

More about Elsevier

At Elsevier, pride, passion and the pursuit of knowledge meet innovation, opportunity and growth. We are an organization that runs on intelligence. We supply it, develop it, and rely on it – and it requires intelligent people to make the intelligent career choice that will see us improve further. If you join us in Amsterdam, you will be part of shaping science’s next steps.

Apply for this job