AI Engineer

Location:
(5656) Netherlands
Salary:
market rate
Type:
Contract
Start Date:
asap
Contract Period:
6 months
Main Industry:
Search Information Technology Jobs
Advertiser:
microTECH Global Ltd
Job ID:
132640278
Posted On:
19 January 2026

Job Title: AI Engineer

Location: Munich/Hamburg or Eindhoven

Type: Contract

Duration: 12 Months

Brief:

We are looking for an AI Engineer passionate about Generative AI and Agentic AI systems, someone who thrives on optimizing models for efficient on-device deployment. You will work on large language models (LLMs), large multimodal models (LMMs), and Vision-Language-Action (VLA) models, ensuring they run reliably and efficiently on our NPU-based platforms.

Responsibilities:

Optimize LLMs and multimodal models for on-device deployment

Investigate, develop and apply advanced quantization (8-bit, 4-bit, mixed precision), pruning, and distillation techniques for deriving optimized models for NXP NPU targets.

Accelerate inference performance

Investigate, develop and implement system optimizations such as speculative decoding and other efficient decoding algorithms tailored for edge environments.

Engineer agentic AI capabilities towards tiny agents

Investigate methodologies for enhancing the performance of small language models towards enabling tiny agents at the edge, while ensuring these follow safety principles.

Work with inference engines and deployment frameworks

Deploy optimized models using Ollama, llama.cpp, ONNX Runtime, and TFLite for efficient NPU inference.

Benchmark LLMs and agentic systems

Design benchmarking pipelines for assessing the performance of Generative and Agentic AI systems on-device

Requirements:

MSc, PhD or EngD in a technical specialism, like Computer Science or equally relevant.

5+ years of experience in software/AI engineering with deep exposure to LLMs, VLMs, and systems performance.

Experience with LLM quantization techniques (e.g., SmoothQuant, SpinQuant, QuaRoT), pruning (Wanda, SparseGPT, etc.) and other system optimizations like speculative decoding.

Track-record experience in working with AI frameworks (PyTorch, TensorFlow, etc.), required.

Experience with Agentic AI technologies and familiarity with existing frameworks (e.g., LangChain, Google ADK, SmolAgents, etc.)

Understanding of AI toolchains, deployment, portability and inference engines (CUDA, TensorRT, TFLite, ONNX, Ollama, etc.) preferred.

Affinity and experience with embedded systems, and NPU accelerators required.

Broad experience with Operating systems GNU/Linux, embedded systems, development boards, and processors, and SW competencies required.

Familiarity with setting up and maintaining related ML-Ops development environments (MLFlow, ClearML, etc.) required.

Knowledge of build systems (YOCTO, OpenEmbedded, etc.) beneficial, working with cross-compilation toolchains for ARM preferred.

Solid programming experience of C, C++, Python and Bash programming languages on Linux systems required.

If this sounds of interest,

To help us track our recruitment effort, please indicate in your email/cover letter where (vacanciesineu.com) you saw this job posting.

Share
yonnetim

Published by
yonnetim

Recent Posts

Technician – Construction (Electrical)

Location: Birmingham (B12) - West Midlands, West Midlands, United Kingdom Salary: £28031 - £30378 per…

13 minutes ago

2i/c of Science

Location: Faversham (ME13) - Kent, South East, United Kingdom Salary: £32,916 - 51,048 per year…

42 minutes ago

Account Executive

Location: Cosham (PO6) - Hampshire, South East, United Kingdom Salary: £24000 - £31000 per annum…

1 hour ago

Paediatric Physiotherapist

Location: Galway, Connaught, Ireland Salary: £20 - £23 per hour Type: Contract Start Date:  ASAP…

2 hours ago

Aftersales Support Engineer – Commercial Heating & Cooling

Location: West London, London, United Kingdom Salary: £50000 - £55000 per annum Type: Permanent Main…

2 hours ago

Architectural Technician

Location: West Sussex, South East, United Kingdom Salary: £32000 - £36000 per annum + Company…

2 hours ago
If you dont see Apply Button. Please use Non-Amp Version