-

Rootly Announces 2025’s Top 50 People Making the World More Reliable

SAN FRANCISCO--(BUSINESS WIRE)--Rootly, the leading AI-native on-call and incident response platform, today unveiled the Reliability Top 50, an annual list recognizing the people who keep the world’s most ambitious technologies resilient at scale.

When a new AI frontier model drops or an EV company pushes an autonomy update, the headlines focus on breakthroughs and valuations. But the real test is quieter: does it work when millions log in at once, when GPUs fail mid‑inference, when a global outage ripples across the internet? The magic only lasts because reliability leaders make sure it does.

“Reliability is now a leadership competency, not just a systems property,” said JJ Tang, Co‑founder of Rootly. “The people on this list embody a new standard: they design for failure, operationalize learning, and make resilience a strategic advantage.”

The Reliability Top 50 spans SREs, infrastructure leaders, and incident commanders across AI companies like Anthropic and Mistral, hardware innovators like NVIDIA and Cerebras, cloud platforms including Google and Microsoft, and enterprise leaders at Okta, Twilio, and Salesforce. Some build the pipelines that make inference possible; others lead response teams when things inevitably go wrong. Together, they ensure scale never comes at the expense of stability.

What This Year’s List Shows

  • Reliability is cross‑disciplinary. This year’s honorees include not just SREs but also incident commanders, quality leaders, and customer support directors, reflecting how resilience now cuts across the entire org chart.

  • AI is mainstreaming reliability. Frontier labs, voice‑AI startups, and GPU providers treat SRE as a first‑class discipline, not an afterthought once research is “done.”

  • Incidents define culture. From Twilio to Tesla to theScore, leaders on the list run the playbooks that keep global services steady under stress and write the lessons the rest of the industry will copy.

  • Hardware meets software. Wafer‑scale chips, DGX clusters, and GPU clouds demand reliability practices once reserved for telcos and hyperscalers.

Modern tech is a tower of dependencies: AI models on orchestration frameworks on GPU clouds on global networks. Remove one link and the illusion crumbles. The Reliability Top 50 honors those who keep the tower upright, turning SLOs into uptime, transforming postmortems into industry standards, and teaching us all how to fail more gracefully. These are the people who make “always‑on” possible.

View the 2025 Reliability Top 50 on Rootly’s blog.

About Rootly

Rootly is the AI-native on-call and incident management platform that provides proactive support to help teams resolve incidents faster, improve system resilience, and streamline on-call operations. It’s your always-on SRE copilot that automates root cause analysis and identifies patterns that drive continuous improvement—trusted by leading companies like LinkedIn, NVIDIA, Replit, Elastic, Canva, Figma, Tripadvisor, Glean, Okta, and Grammarly.

Contacts

Press Inquiries: sylvain@rootly.com

Rootly


Release Versions

Contacts

Press Inquiries: sylvain@rootly.com

More News From Rootly

Rootly Announced as One of Deloitte's Technology Fast 50 Program Winners for 2025

TORONTO--(BUSINESS WIRE)--Rootly has been recognized by Deloitte's 2025 Technology Fast 50™ awards program with a top-10 placement for its rapid growth, entrepreneurial spirit, and bold innovation. The program recognizes Canada's 50 fastest-growing technology companies based on the highest revenue growth percentage over the past four years. Rootly CEO JJ Tang credits the company’s growth to its obsession with customer success and continued investment in AI-native features. “This recognition fro...

Rootly Launches AI Labs to Advance Reliability Engineering Through Community Innovation

SAN FRANCISCO--(BUSINESS WIRE)--Rootly, the leading AI-native incident response platform, today announced the launch of Rootly AI Labs, a fellow-led community dedicated to redefining the future of reliability engineering. Rootly AI Labs aims to accelerate operational excellence by developing innovative prototypes, creating open-source tools, and producing groundbreaking research. The official launch will occur at GitHub HQ in San Francisco. The event will feature keynote presentations and discu...

Rootly Makes Its API AI-Agent-First to Elevate Incident Management

SAN FRANCISCO--(BUSINESS WIRE)--Rootly, the AI-native on-call and incident response platform, is taking another step forward in its AI-first approach by making its API AI-Agent-First. As AI agents become integral to modern engineering workflows, they need to interface seamlessly with APIs to execute complex tasks. However, APIs have been designed for developers – not for AI, limiting their ability to take full advantage of them. It's time for an API redesign focusing on AX (Agent Experience). R...
Back to Newsroom