Codit Luxembourg - Connecting is everything
In Luxembourg, Codit is a brand of Proximus Luxembourg dedicated to the corporate market.
The Codit team leverages scalable Microsoft Azure technologies backed by seamless integration across platforms, applications and data sources to convert data into actionable business insights.
Codit provides next-level consultancy, technology, and managed services to organisations worldwide and successfully helps them reduce operational costs, improve efficiency and enhance communication by integrating people, applications, and things.
Both Codit and Proximus NXT are key players within their respective domains: by combining these skills, Proximus Luxembourg is able to provide integrated 360° digital solutions, helping companies to drive their business through their entire journey of digital transformation– www.proximusnxt.lu
Your mission :
We are seeking an experienced Data Scientist with specialized expertise in Generative AI. The ideal candidate will join our team to work on diverse customer projects, collaborating with businesses to solve unique challenges through data. The ideal candidate will also contribute to our core product, a cloud-agnostic advanced chatbot solution built on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), embeddings, and various API components.
Your primary responsibilities will include using your expertise in statistics, machine learning, and programming to transform complex data into actionable insights and strategic recommendations that deliver tangible business value. You will be instrumental in both implementation projects and consulting missions. Additionally, you will be participating in the development of Back-End components for GenAI solutions, rigorously test and troubleshoot the solution through iterative releases based on defined functional requirements, and selecting appropriate technologies, libraries, and open-source tools.
Key Responsibilities
* Work closely with customers to understand their data needs and deliver tailored solutions through data analysis, data cleaning, consultancy, and implementation missions.
* Perform exploratory data analysis (EDA) and build machine learning models to solve customer-specific data challenges.
* Develop data pipelines and ETL processes to prepare and clean data for analysis and model training.
* Participate in the development of GenAI applications in Python, focusing on back-end components and APIs.
* Integrate Python components with LLM frameworks, RAG pipelines, embeddings, OCR libraries and databases
* Evaluate and contribute in architectural decisions and technology selection
* Package and deploy containerized components within Kubernetes environments
* Troubleshoot and resolve issues related to code, configuration, performance, and infrastructure
Profile Specify
* Proven experience in data science, demonstrating a solid understanding of the full project lifecycle, from problem framing and data collection to model deployment and monitoring
* Experience working with large and complex datasets from various sources
* Proficient in Python and API development, with the ability to write clean, scalable, and modular code using modern software design patterns
* Deep theoretical and practical knowledge of Generative AI
* Familiarity with Cloud Platform, containerized environments, CI/CD pipelines, and agile development workflows
* Capable of translating complex technical concepts into clear, understandable language for non-technical stakeholders
* Strong commitment to project deadlines with the ability to deliver tested and stable solutions.
* Effective in collaborative environments, contributing to code reviews and shared responsibilities.
Your profile :
Experience
* 5+ years of professional experience as a Data Scientist or similar role
* Experience in a client-facing role, including reporting and managing project outcomes for customers
* Proven track record of developing and deploying machine learning models in a production setting
* Practical experience with LLMs, embeddings, APIs, and GenAI libraries in production environments.
* Solid experience in Python and developing RESTful APIs.
* Familiarity developing in public cloud environments (Azure, Google Cloud Platform)
* Familiarity with containerization and deployment using Docker, Kubernetes, and Harbor.
Technical Skills
* Data & Machine Learning: NumPy, TensorFlow, PyTorch, Apache Airflow, and a strong understanding of MLOps principles
* Databases: Vector, SQL and NoSQL databases, specifically PostgreSQL, SQL Server.
* Machine learning operations (MLOps) principles and tools for model deployment and management.
* Languages/Programming: Python (including asynchronous programming), Structured programming, Bash, SQL, Rest APIs
* Version Control: Git platforms such as GitLab, GitHub, or Azure DevOps.
* Frameworks/APIs: LLM Frameworks (VLLM, Ollama...), FastAPI/Flask, vectorDBs (ChromaDB, FAISS…), LangChain, LangGraph...
* Infrastructure: Docker, Kubernetes, Harbor, Cloud Storage Buckets
Soft Skills
* Collaborative team player, open to feedback, with a growth mindset
* Excellent communication skills, able to clearly document and explain complex systems and code
* Curious and open to experimenting with emerging AI tools and technologies
* Strong problem-solving skills with a meticulous and detail-oriented development approach
* Proactive and self-driven, with a strong focus on continuous improvement and quality
* Able to perform under pressure and manage tight effectively
* Committed to delivering high-quality, stable, and well-tested solutions
* Fluent in spoken and written in English, French is a plus
Our offer :
A professional and stimulating work environment in the IT & telecom sector. Multiple career opportunities within the Proximus Group at national and international level, cutting-edge training in new technologies, a wealth of recognized expertise. We also offer an attractive salary package and many other benefits.
Our company is an equal opportunity employer, valuing diversity in all its forms. We firmly believe that each individual brings a unique richness to our teams, and we are committed to creating an inclusive environment where every voice counts.
If at the end of the process your application is successful, you will be asked to provide an extract from your criminal record. Your personal information will be handled in compliance with applicable data protection laws.