R&D Data Engineer in AI and Computer Vision

Job title:

R&D Data Engineer in AI and Computer Vision

Company:

Atos

Job description

Eviden, part of the Atos Group, with an annual revenue of circa € 5 billion is a global leader in data-driven, trusted and sustainable digital transformation. As a next generation digital business with worldwide leading positions in digital, cloud, data, advanced computing and security, it brings deep expertise for all industries in more than 47 countries. By uniting unique high-end technologies across the full digital continuum with 47,000 world-class talents, Eviden expands the possibilities of data and technology, now and for generations to come.Our team is building Eviden Computer Vision Platform, a real-time video analytics solution for different verticals. We use AI technologies to design and develop the product, as well as Big Data software components to manage the related data.We are seeking a skilled and motivated Data Engineer to complete our team on end-to-end data pipeline implementation and data lake operation.Responsibilities:

  • Build and maintain robust data pipelines to ingest, transform, and load data from various sources. Ensure data quality, consistency, and reliability throughout the pipeline.
  • Implement data transformation logic to clean, enrich, and transform raw data into structured formats suitable for analysis and reporting. Leverage ETL/ELT processes as needed.
  • Manage the data platform infrastructure, including choosing appropriate storage technologies, optimizing storage utilization, and ensuring data accessibility.
  • Implement and enforce data security measures, access controls, and compliance standards to maintain the integrity and privacy of the data.
  • Develop mechanisms for efficient data search and retrieval, considering relevancy, query performance, and user experience.
  • Monitor and optimize the performance of data pipelines and storage systems to ensure efficient data processing and retrieval.
  • Maintain clear and comprehensive documentation of data pipeline designs, processes, and configurations to facilitate knowledge sharing and future maintenance.
  • Implement workflows to automate the building, testing and deployment of data lake components following DevOps practices.
  • Implement unit and integration tests following guidelines and propagate knowledge across the team.
  • Manage AI assets (i.e., datasets, models) properly and securely.
  • Integrate meta-data extraction components leveraging AI models and third-party tools such as labelling frameworks.
  • Collaborate effectively with cross-functional teams including data scientists, data engineers, frontend and backend developers, and product owners to align on project goals and requirements.

Education

  • Bachelor’s, Master’s, or PhD in Computer Science, Electrical Engineering, or a related field.

Essential knowledge and professional experience

  • Proven experience (3+ years) in designing, building, and maintaining large-scale data pipelines and data lake infrastructure.
  • Strong proficiency in programming languages such as Python.
  • Hands-on experience in REST API development.
  • Hands-on experience with Elasticsearch, including data ingestion, indexing, and search capabilities. Familiarity with Elasticsearch Query DSL and search relevancy concepts.
  • Knowledge of data modelling, schema design, and ETL/ELT processes.
  • Prepare software applications for their deployment using Docker and Kubernetes.
  • Proficiency in the usage of Git and GitHub Actions.
  • Practice of agile methodology.
  • Proficient user of Linux environments (bash or shell).
  • English level B2

Additional knowledge

  • Experience in MLOps tools, e.g., MLFlow, Kubeflow.
  • Experience with Google Cloud Platform (GCP).
  • CPU vs GPU programming.
  • General knowledge about clusters.

Competences

  • Autonomy: capacity to seek and read documentation.
  • Ability to collaborate: provide constructive comments and embrace best practices and guidelines.
  • Fluency in English
  • Good writing and presentation skills.
  • Strong personal soft skills set communicative, enthusiastic, highly collaborative, proactive, and self-driven.
  • Ability and enthusiasm to learn new technologies quickly.

Benefits

  • Half-day Fridays.
  • Intensive workday in summer.
  • Personalised training and upskilling programme.
  • Flexible working hours and ways of working.
  • R&D environment. New ideas and technologies are welcome.

Let’s grow together.

Expected salary

Location

Madrid

Job date

Sun, 23 Jun 2024 05:34:08 GMT

To help us track our recruitment effort, please indicate in your email/cover letter where (vacanciesineu.com) you saw this job posting.

Share
yonnetim

Published by
yonnetim
Tags: phd

Recent Posts

CDI – Acheteur Category Manager – Boucherie F/H

Job title: CDI - Acheteur Category Manager - Boucherie F/H Company: Metro Job description Description…

20 seconds ago

Postdoctoral position in the INFOODMATION research project

Job title: Postdoctoral position in the INFOODMATION research project Company: Aarhus Universitet Job description The…

1 min ago

Apprentice Lineworker

Location: Calcasieu Parish, Louisiana, United Kingdom Salary: Competitive Type: Permanent Main Industry: Search Graduate Jobs…

2 mins ago

Cybersecurity Agent (German Speaker)

Location: Athens, Attica, Greece Salary: Salary: 14 salaries per year x 1300€ gross per month…

10 mins ago

Firmware (Test) – North Italy (Contract)

Location: (12084) Italy Salary: market rate Type: Permanent Main Industry: Search Engineering Jobs Advertiser: microTECH…

11 mins ago

Welder Mechanic

Job title: Welder Mechanic Company: Forum Jobs Job description Function descriptionWhat does a day in…

22 mins ago
If you dont see Apply Button. Please use Non-Amp Version