About Mistral
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.
We are a dynamic, collaborative team passionate about AI and its potential to transform society.
Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
Role Summary
We are seeking a talented and experienced software engineer to join our Engineering team. You will be responsible for identifying and solving complex system problems with the aim to optimize the overall performance in different layers (API, micro-services, databases, infrastructure), monitoring and profiling service usage, developing systems that optimize the latency, throughput and robustness of large distributed systems. The role is based in Paris, France and reports to the Head of Engineering.
🧢 Reporting line : Head of Engineering
📍 Location : Paris, France
What you will do
Key Responsibilities
Executing, deploying and productizing LLM at our scale often involves discovering and solving new and complex systems problems. As a core member of our engineering team, your will be responsible for identifying and solving these problems with the aim to optimize the overall performance of our systems.
Your scope will include (but will not be limited to):
• Identifying performance issues and areas of improvement in different layers (API, micro-services, databases, infrastructure, build systems...)
• Monitoring and profiling how each services is used
• Developing systems that optimize the latency, throughput and robustness of our large-scale distributed systems
• Spreading a performance culture and best practices in the engineering team
Representative projects
• Optimizing API Latency (caching, CDNs, db queries, load balancing, threading..)
• Implement caching strategies to store frequently requested data / precomputing data
• Setting up performance monitoring tools to track response times
• Implement low-latency high-throughput sampling for large language models
• Write custom load-balancing algorithms to optimize serving efficiency
About you
• 8+ years of relevant professional work experience
• Master’s degree in Computer Science, Information Technology or a related field
• Experience building and scaling complex distributed systems
• Excellent proficiency in backend software development (Python is a strong plus)
• Solid proficiency in infrastructure management (Docker, CI/CD, K8s, Helm, Terraform...)
• Good knowledge of cloud ecosystems
• Autonomous and self-starter
• Ability to communicate with influence
What We Offer
• Ability to shape the exciting journey of AI and be part of the very early days of one of the hottest startup in AI
• A fun, dynamic multicultural team and collaborative work environment
• Opportunities for professional growth and development
💰 Package: competitive salary and equity structure
🧑⚕️ Health: full health insurance for you and your family
🥕 Food: daily lunch vouchers
🥎 Sport: monthly contribution to a Gympass subscription
🚴 Transportation: monthly contribution to a mobility pass
🍼 Parental: generous parental leave policy
🌎 Visa sponsorship if required