mixmode
Sr. Software Engineer-AI Reliability
At a Glance
- Location
- Remote
- Work Regime
- remote
- Experience
- 7+ years
- Compensation
- argeting for this position is $150,000-$210,000 though we can adjust based on
- Posted
- 2026-03-03T14:53:58-05:00
Key Requirements
Required Skills
Domain Knowledge
- Engineering
Requirements
7+ years of professional software engineering experience
Proven experience designing, building, and operating distributed systems in production
Strong understanding of service architecture, concurrency, resource management, and distributed failure modes
Experience operating Kubernetes deployments
Strong experience with relational databases, including query performance analysis, indexing, and connection management
Demonstrated ability to diagnose and resolve performance, scalability, and reliability issues across system layers
Responsibilities
We are hiring a Senior Software Engineer to enhance the reliability, performance, and scalability of our production AI systems.
This role focuses on understanding, refining, and strengthening existing distributed services across application, database, and container orchestration layers.
You will collaborate with ML researchers to make our systems more robust, maintainable, flexible, and scalable.
Own the reliability, performance, and operational health of production AI services
Refactor and harden existing systems to improve resilience, clarity, and maintainability
Diagnose and resolve issues across distributed services, data pipelines, and storage layers