Senior Infrastructure Engineer
Quandri
Other Engineering
Vancouver, BC, Canada
CAD 130k-170k / year + Equity
Posted 6+ months ago
Who we are
Our mission at Quandri is to transform insurance into a trusted and delightful experience using AI. We do this by delivering an AI operating system for North America’s best insurance agencies and brokerages.
What we do
Quandri is transforming the insurance experience for policyholders across North America by providing AI to the industry’s critical stakeholders - insurance agencies and brokerages. Insurance agents are the lifeblood of the insurance industry, and are where most Property & Casualty insurance is sold and serviced. Yet, there are forces making it harder than ever for them to operate; more consolidation, rising talent crisis, increasing complexity of insurance and the AI revolution are significant challenges for how agencies have typically operated.
This challenge is a massive opportunity to rethink how insurance agencies operate. Quandri is at the cutting edge of this change, delivering real transformation to the entire insurance experience one process at time through the deep understanding of insurance workflows, systems and data paired with deep technical capabilities and proprietary models.
We’re making insurance better for policyholders, while helping insurance brokerages deliver a better client service, grow faster, and harness AI so that they control the future of insurance distribution.
Why join us?
-We are moving fast and focused on customer impact. You will enjoy and be successful at Quandri if you:
-Have a deep customer focused mindset
-Can operate in a fast paced environment where change and innovation is a constant
-Take high levels of ownership in everything you do
-Have AI-first thinking and want to apply AI to solve the world’s problems of tomorrow
-Operate with urgency, and aren’t afraid to move fast and make some mistakes along the way
-Are low ego, and believe more in achieving the best outcome for our customers and achieving big goals as a team
-Want to have a big impact, and are not content with making changes at the edges
Our head office is in Vancouver BC, with ¾ of our team working here and the rest distributed across North America. We’re backed by leading US and Canadian investors, are growing fast, and have a few awards to prove it:
Most importantly, if you want to do the best work of your life changing an industry with technology alongside talented people who are both high-performing and kind, Quandri is the place for you.
About the Role
As a Senior Infrastructure Engineer, you’ll help lead the design and evolution of Quandri’s core infrastructure and platform capabilities. You’ll play a critical role in how our software is deployed, run, and scaled — partnering across engineering to build robust systems that support our fast-growing customer base.
This team is responsible for our cloud infrastructure, developer experience, security and compliance posture, and platform reliability. You’ll work closely with engineering and product peers to ensure that our infrastructure is performant, scalable, cost-effective, and secure. This is a high-impact role for someone who thrives on ownership, is motivated by operational excellence, and wants to help shape the foundational platform of a company growing rapidly in a dynamic, customer-centric space.
What you’ll do:
- Build our AI Bot Control Plane — Design and build the critical orchestration layer between the Quandri application and the high-scale AI bots that power it. You’ll own the infrastructure that handles bot scheduling, lifecycle management, deployment pipelines, autoscaling, resource allocation, and real-time health monitoring — ensuring hundreds of concurrent bots run reliably, efficiently, and at scale.
- Shape our Observability Strategy — Architect and implement a full-stack observability platform built on OpenTelemetry, spanning logs, metrics, and distributed traces. You’ll work across the Grafana ecosystem (Prometheus, Loki, Tempo, Grafana) to give engineering deep visibility into system behavior, define and enforce SLOs/SLAs, build dashboards that drive decisions, and establish incident management workflows that keep us ahead of issues — not reacting to them.
- Lay the Foundations of our Security Posture — Implement and mature foundational security practices including SOC 2 compliance, incident response procedures, data loss prevention (DLP), Identity and Access Management (IAM), secrets management, and security monitoring. You’ll be instrumental in building a security-first culture across the engineering organization.
- Pioneer our MLOps Infrastructure — Partner with our Intelligence team to build and operate the MLOps infrastructure that powers our AI capabilities. This includes standing up and scaling our LLM service layer, model training pipelines, model versioning and lineage tracking, experiment management, and production model serving — enabling the team to iterate on models rapidly and deploy them with confidence.
- Scale our Data Infrastructure — Build and maintain the data backbone that captures and processes the high-volume event streams generated by our bots. You’ll design and evolve our data lake and data warehouse architecture on Databricks, enabling reliable ingestion, transformation, and access to the data that fuels our product, analytics, and machine learning initiatives.
- Champion an AI-First Developer Experience — Help define the strategy and build autonomous AI agents that proactively identify infrastructure issues, diagnose root causes, and remediate them — pushing us toward a self-healing platform. You’ll shape how we leverage AI to supercharge developer productivity and operational excellence across the engineering organization.
What We’re Looking For:
- 5+ years of experience in infrastructure, DevOps, platform, or SRE roles in a modern cloud environment
- Deep knowledge of AWS, Terraform (or equivalent IaC tools), and Kubernetes (ECS or EKS preferred)
- Experience with observability platforms and the Grafana ecosystem (Prometheus, Loki, Tempo, Grafana) or equivalent tools; familiarity with OpenTelemetry is a strong plus
- Experience improving CI/CD systems using GitHub Actions and Argo CD
- Understanding of IAM, secrets management, and compliance frameworks (SOC 2 experience a plus)
- Familiarity with MLOps concepts — model serving, training pipelines, LLM infrastructure, or experiment tracking
- Strong written and verbal communication skills; you can document, share, and teach effectively
- A thoughtful, collaborative problem-solver who’s excited to shape foundational systems
Nice to Have:
- Exposure to data infrastructure concepts — event streaming, data lakes, data warehouses, or platforms like Databricks
- Experience building or working with autonomous AI developer agents, AI-powered DevEx tooling, or self-healing infrastructure systems
Our guiding principles:
- Customers at the core. We put the customer at the center of all we do. At a basic level, we believe business success comes down to talking to customers and building something they want. We don’t listen to customers and just take what they say blindly, but we think critically about it and build what they need. Customers are the core of everything we do, and our business exists to serve them. We prioritize their needs over all else within the company.
- Move with urgency. There are times when we need to move slowly and deliberately, but we default to acting fast and with urgency. We slow down when necessary, but this should be a deliberate choice. Businesses become more lethargic as they grow, this principle is designed to fight this fact.
- Be curious. We understand the world by being curious and asking why. We aren’t satisfied with surface level understanding, and seek a deeper understanding of why things are the way they are. Don’t take someone’s word for it or the answer “because that’s how we do it.” Understand why and dig deep.
- Excellence in execution. We know that what separates good from great is a high level of execution. We commit ourselves to excellence in everything that we do, from delivering an amazing product to writing a great email.
- Act like an owner. We’re all owners of the business and act like it. We follow through on commitments, own our results and think long-term.
- Fight for simplicity. The law of increasing functional information states that systems evolve to become more complex over time. At Quandri, we believe there is sophistication in simplicity; as such, we intentionally fight for streamlined solutions and are committed to the uncomplicated.
Compensation and Benefits:
- The range for base pay is $130,000 - $170,000 CAD which is dependent on level of experience, performance and choice of stock option compensation
- Employee stock options based on experience level
- Comprehensive health benefits, including $500 Lifestyle Spending Account
- Four weeks of paid vacation per year
- Work anywhere in the world for 60 calendar days of the year
- Parental leave top-ups: 6 months for birthing parents, 8 weeks for non-birthing parents (up to $100,000 annual salary)
Quandri is dedicated to fostering a diverse and inclusive workplace. As an equal opportunity employer, Quandri adheres to Canadian labour laws and does not engage in discrimination based on race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or any other status protected under Canadian law.
Don’t let imposter syndrome stop you from applying. Great people sometimes don’t have the “right” experience. If you think that you’ll be amazing at this role then we encourage you to apply.