← all jobs

[Remote] Staff Site Reliability Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Domino Data Lab is a company that builds software for AI-driven organizations to enhance data science and AI solutions. They are looking for a Staff Site Reliability Engineer to lead the development of AI-assisted reliability tooling, improve observability, and mentor engineers while enhancing operational practices.

Responsibilities

  • Lead the development of Domino's internal AI-assisted reliability tooling, including systems that analyze tickets, logs, traces, and documentation to help teams resolve outages faster with less recurring toil
  • Improve the observability coverage and signal quality for our most critical customer-facing systems, so engineers have more to work with throughout the development and support lifecycle
  • Own incident response end-to-end, from detection to remediation, and leave each problem space better documented, better understood, and less likely to recur
  • Guide the development of customer and user-facing observability tools within our products
  • Define and mature SLO/SLI frameworks for priority services, turning abstract reliability goals into measurable, actionable standards
  • Scale cloud operations practices for Domino’s single-tenant SaaS offering, and work with engineering teams to improve the reliability and repeatability of customer deployments and upgrades
  • Mentor other engineers and shape how SRE is practiced at Domino, including incident response workflows, operational readiness expectations, and post-incident learning culture

Skills

  • Deep experience in Site Reliability Engineering, platform engineering, or a software engineering role with genuine, hands-on operational ownership
  • Fluency with Kubernetes, Linux, cloud platforms, and observability tooling, and the ability to use them to investigate complex, real-world production problems
  • A strong ability to perceive and close reliability gaps in technical products, tools and processes
  • Strong software engineering skills in Python or Go, with a track record of building internal tools or services that people actually rely on
  • Comfort leading technically ambiguous work and influencing direction across teams without needing direct authority to get things done
  • A history of improving reliability through engineering and automation, not just putting out fires manually
  • Strong communication skills and real experience mentoring engineers or shaping technical decision-making on your team
  • Sound judgment about AI/LLM tooling: you know where it genuinely helps in operational workflows and where it adds noise instead of signal
  • Experience with LLM-based systems, retrieval workflows, SaaS platform operations, or building tooling for support or developer teams

Benefits

  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • Medical, dental, and vision benefits
  • Wellness stipends

Company Overview

  • Domino Data Lab provides an enterprise platform designed to help organizations build, deploy, and manage AI and machine learning models. It was founded in 2013, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is https://domino.ai.
  • Company H1B Sponsorship

  • Domino Data Lab has a track record of offering H1B sponsorships, with 8 in 2025, 6 in 2024, 10 in 2023, 7 in 2022, 11 in 2021, 6 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Incident Response Security Engineer

    Work from home Full-time role

    [Remote] Quality Assurance Automation Engineer

    Work from home Full-time role

    [Remote] Finance Intern

    Work from home Full-time role

    [Remote] Client Technology - Engineering - Software Engineering

    Work from home Full-time role

    [Remote] Program Analyst

    Work from home Full-time role

    Only W2 / 1099 Contract | Senior AI Data Engineer (Data Platforms, GenAI, RAG, Cloud) | Remote (PST Zone)

    Work from home Full-time role

    FL Reinstatement/Billing Legal Assistant

    Work from home Full-time role

    Regional Business Manager - Upstate New York

    Work from home Full-time role

    Implementation Specialist

    Work from home Full-time role

    Service Desk Analyst 11am 8pm

    Work from home Full-time role

    Remote Customer Service Representative – Compassionate Care & Support for Healthcare Members at careerzynith

    Work from home Full-time role

    Mobile Customer Service Entry Level Agent

    Work from home Full-time role

    Director, Pharmacovigilance

    Work from home Full-time role

    Entry-Level Remote Data Entry Specialist – Flexible Work‑From‑Home Role for Fresh Graduates at careerzynith

    Work from home Full-time role

    Experienced Full Stack Customer Service Associate – Healthcare Support Services

    Work from home Full-time role

    Software Engineer - Federal Geospatial (Hub-Remote: DC or Philly Metro)

    Work from home Full-time role

    Government Guaranteed Loan Processor & Closer II - Fully Remote

    Work from home Full-time role

    Customer Care Associate I - Remote: Deliver Exceptional Service and Shape the Future of careerzynith

    Work from home Full-time role

    Staff Technical Account Manager

    Work from home Full-time role

    Sr. Group Life Claims Examiner (HYBRID or REMOTE)

    Work from home Full-time role

    [Remote] Senior Customer Success Manager (Central or East Coast) - Data Storage / Cloud

    Work from home Full-time role