At Scope Merge, we connect top Tunisian engineers with leading European companies. We offer long-term roles, international exposure, and above-market compensation. You’ll work on exciting international projects while we handle employment, payroll, and benefits.
For this position, you’ll join an innovation-driven R&D company that uses emerging technologies to solve complex problems and build future-ready products. The team combines deep technical expertise with a dynamic, interdisciplinary culture focused on creating long-lasting value.
We are hiring a Senior AI Engineer with deep expertise in Large Language Models (LLMs) to join a fast-growing European startup applying cutting-edge AI to solve real-world problems. You’ll work on the development, fine-tuning, and deployment of LLMs in production.
Tasks
- Design, train, and fine-tune LLMs for specific business use cases.
- Build and optimize inference pipelines for LLM-based applications, ensuring low-latency and scalability.
- Evaluate and integrate open-source LLMs (e.g., LLaMA, Mistral, Falcon) or APIs (e.g., OpenAI, Anthropic) depending on use case and cost constraints.
- Collaborate with backend engineers to deploy models efficiently using tools like Triton, vLLM, or ONNX Runtime.
- Design and run evaluation frameworks (e.g., prompt quality, hallucination detection, latency).
- Monitor models in production and implement mechanisms for feedback loops and continuous improvement.
- Stay up to date with advances in generative AI, open-source LLM tooling, and fine-tuning strategies.
Requirements
- 4+ years of experience in applied ML or NLP, with at least 1–2 years focused on LLMs.
- Strong knowledge of transformer architectures and experience working with model libraries like Hugging Face Transformers, LangChain, or LLM orchestration tools.
- Proven experience deploying LLMs into production (custom or API-based) and optimizing them for inference.
- Familiarity with techniques such as LoRA, QLoRA, PEFT, RAG, or prompt engineering.
- Solid Python skills, especially in ML stack (e.g., PyTorch, TensorFlow, FastAPI for serving).
- Experience working with cloud infrastructure (AWS/GCP) and containerized deployments (Docker, Kubernetes).
- Bonus: Experience with data pipelines, vector databases (e.g., Weaviate, Pinecone, FAISS), or hybrid search.
Benefits
- Work on international projects with top startups and tech companies.
- Collaborate with global teams and gain cross-border experience.
- Grow your skills through hands-on challenges and real-world impact.Modern offices in Lac 2, Tunis
- Supportive sick leave policy that respects your health and well-being
- Receive above-market salary and financial stability