Data Engineer GenAI Job Description
Mediahuis is seeking an experienced Data Engineer to strengthen its Data & Insights Team. This role focuses on productizing and maintaining GenAI solutions by bridging infrastructure and machine learning.
Your Key Responsibilities:
* CLOUD INFRASTRUCTURE MANAGEMENT
o Design and maintain CI/CD pipelines and cloud infrastructure using Terraform, emphasizing reliability, security, and scalability.
* CONTAINERIZED ML APPLICATIONS
o Manage containerized ML applications with Docker; familiarity with Kubernetes is a plus.
* MACHINE LEARNING LIFECYCLE SUPPORT
o Monitor model performance, optimize infrastructure, and share best practices across the team for continuous improvement.
We're looking for a specialist with a can-do mentality, passion for data, and strong problem-solving skills. A proactive and pragmatic attitude, along with a collaborative mindset, is essential. Affinity with machine learning workflows is also important – you don't need to be an ML expert, but understand how to take a model from notebook to production.
Requirements:
* 3-5 years of experience as a data engineer, DevOps engineer or in a similar role.
* Programming expertise in Python in a machine learning or data context.
* Strong knowledge of Docker and cloud infrastructure; experience with Azure is a plus. And experience with Terraform; experience with Kubernetes (K8s) is a plus.
* Experience in a data engineering context is a plus, though heavy ETL is not the focus. Familiarity with cloud environments (Azure and GCP) and open-source tools like Airflow.
* English fluency is essential; knowledge of Dutch is a plus.
Bonus Points:
Experience with MLOps tools (such as MLflow, Weights & Biases, or Azure ML), knowledge of security best practices in cloud-based ML environments, or interest in or experience with GenAI, LLMs, or prompt engineering are all valuable assets.