Staff Software Engineer

Palo Alto, California

Mistral AI

About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on

Role Summary

We are seeking a talented and experienced software engineer to join our Engineering team. You will be responsible for identifying and solving complex system problems with the aim to optimize the overall performance in different layers (API, micro-services, databases, infrastructure), monitoring and profiling service usage, developing systems that optimize the latency, throughput and robustness of large distributed systems. The role is based in Paris, France and reports to the Head of Engineering.

Reporting line : Head of Engineering

Location : Paris, France

What you will do

Key Responsibilities

Executing, deploying and productizing LLM at our scale often involves discovering and solving new and complex systems problems. As a core member of our engineering team, your will be responsible for identifying and solving these problems with the aim to optimize the overall performance of our systems.

Your scope will include (but will not be limited to):

• Identifying performance issues and areas of improvement in different layers (API, micro-services, databases, infrastructure, build systems )

• Monitoring and profiling how each services is used

• Developing systems that optimize the latency, throughput and robustness of our large-scale distributed systems

• Spreading a performance culture and best practices in the engineering team

Representative projects

• Optimizing API Latency (caching, CDNs, db queries, load balancing, threading )

• Implement caching strategies to store frequently requested data / precomputing data

• Setting up performance monitoring tools to track response times

• Implement low-latency high-throughput sampling for large language models

• Write custom load-balancing algorithms to optimize serving efficiency

About you

• 8+ years of relevant professional work experience

• Master's degree in Computer Science, Information Technology or a related field

• Experience building and scaling complex distributed systems

• Excellent proficiency in backend software development (Python is a strong plus)

• Solid proficiency in infrastructure management (Docker, CI/CD, K8s, Helm, Terraform )

• Good knowledge of cloud ecosystems

• Autonomous and self-starter

• Ability to communicate with influence

What We Offer

• Ability to shape the exciting journey of AI and be part of the very early days of one of the hottest startup in AI

• A fun, dynamic multicultural team and collaborative work environment

• Opportunities for professional growth and development

Package: competitive salary and equity structure

Health: full health insurance for you and your family

Food: daily lunch vouchers

Sport: monthly contribution to a Gympass subscription

Transportation: monthly contribution to a mobility pass

Parental: generous parental leave policy

Visa sponsorship if required

Date Posted: 26 March 2025

Apply for this Job

Show me similar jobs

Send me jobs by email