Causely Blog

In 2025, I resolve to be smarter about observability data

In 2025, I resolve to be smarter about observability data

Collecting “more data” has been the defining characteristic of observability practices and tools for the last few decades. But over-collection creates inefficiencies, noise, and cost without adding meaningful value. This trajectory must and can be changed.

Shmuel kliger

January 17, 2025

Read more
In 2025, I resolve to be smarter about observability data

In 2025, I resolve to be smarter about observability data

Collecting “more data” has been the defining characteristic of observability practices and tools for the last few decades. But over-collection creates inefficiencies, noise, and cost without adding meaningful value. This trajectory must and can be changed.

Shmuel kliger

January 17, 2025

Read more
In 2025, I resolve to make my application environment more resilient

In 2025, I resolve to make my application environment more resilient

By identifying potential risks in real time, predicting future demand, and adapting resources dynamically, teams can maintain reliability even under extreme conditions. This isn’t about eliminating unpredictability; it’s about building systems that respond intelligently to it.

Endre Sara

January 16, 2025

Read more
In 2025, I resolve to be proactive about reliability

In 2025, I resolve to be proactive about reliability

Making changes to production environments is one of the riskiest parts of managing complex systems. In 2025, let's transform how changes are made, empowering teams to anticipate risks, validate decisions, and protect system stability—all before the first line of code is deployed.

Enlin Xu

January 15, 2025

Read more
Causely's Capabilities That Power Causal Analysis

Causely's Capabilities That Power Causal Analysis

There are 13 tenets of Causely that are required to continuously assure applications reliability and performance. With Causely, teams automate root cause analysis, prevent SLO violations, and gain transparency and organizational alignment.

Shmuel kliger

January 15, 2025

Read more
In 2025, I resolve to eliminate escalations and finger pointing

In 2025, I resolve to eliminate escalations and finger pointing

Explore the challenges of multi-team escalations, and the capabilities needed to address them. We’ll show how observability can be transformed to make escalations less contentious and more productive.

Steffen Geißinger

January 14, 2025

Read more
In 2025, I resolve to spend less time troubleshooting

In 2025, I resolve to spend less time troubleshooting

SREs and developers can make troubleshooting more manageable in 2025 by adopting systems that solve the root cause analysis problem.

Christine Miller

January 13, 2025

Read more
The O11ys 2024 – The Winners!

The O11ys 2024 – The Winners!

Read the Observability 360 announcement of all The O11ys 2024 winners.  Best Use of AI Winner: Causely Many observability systems now claim to support Root Cause Analysis. At the same time though, most of these systems use algorithms – admittedly, advanced…

Karina Babcock

Karina Babcock

January 2, 2025

Read more
Tackling CPU Throttling in Kubernetes for Better Application Performance

Tackling CPU Throttling in Kubernetes for Better Application Performance

CPU throttling is a frequent challenge in containerized environments, particularly for resource-intensive applications. It happens when a container surpasses its allocated CPU limits, prompting the scheduler to restrict CPU usage. While this mechanism ensures fair resource sharing, it can significan

Will Searle

Will Searle

November 27, 2024

Read more
What’s a “Service Owner” and how can they improve application reliability?

What’s a “Service Owner” and how can they improve application reliability?

Assuring application reliability is a persistent challenge faced by every IT organization, complicated by rapid technology evolution and the increased emphasis on lean engineering.  One trend among progressive companies is to designate a “Service Owner” who is responsible for making…

Will Searle

Will Searle

November 18, 2024

Read more
Watch out! Sharks at KubeCon

Watch out! Sharks at KubeCon

Based on my LinkedIn news feed, it must be that time of year when thousands of open source enthusiasts congregate to talk tech at various parties, dinners, and other networking events surrounding KubeCon. In fact, we’re hosting a couple of…

Prashant Sridharan

November 5, 2024

Read more
Observability talks sure to make waves at KubeCon

Observability talks sure to make waves at KubeCon

KubeCon North America 2024 is around the corner! This year I’m especially excited, as it’s my first KubeCon since we launched Causely. The energy at KubeCon is unmatched, and it’s a great opportunity to catch up with familiar faces and make new…

Causely

Causely

October 31, 2024

Read more
The use of eBPF – in Netflix, GPU infrastructure, Windows programs and more

The use of eBPF – in Netflix, GPU infrastructure, Windows programs and more

Takeaways from eBPF Summit 2024 How are organizations applying eBPF to solve real problems in observability, security, profiling, and networking? It’s a question I’ve found myself asking as I work in and around the observability space – and I was pleasantly…

Causely

Causely

September 25, 2024

Read more
The “R” in MTTR: Repair or Recover? What’s the difference?

The “R” in MTTR: Repair or Recover? What’s the difference?

Finding meaning in a world of acronyms There are so many ways to measure application reliability today, with hundreds of key performance indicators (KPIs) to measure availability, error rates, user experiences, and quality of service (QoS). Yet every organization I…

Causely

Causely

September 17, 2024

Read more
Intelligence Augmentation: An Important Step in the Journey to Continuous Application Reliability

Intelligence Augmentation: An Important Step in the Journey to Continuous Application Reliability

In an article that I published nearly two years ago titled Are Humans Actually Underrated, I talked about how technology can be used to augment human intelligence to empower humans to work better, smarter and faster. The notion that technology…

Causely

Causely

September 11, 2024

Read more
Preventing Out-of-Memory (OOM) Kills in Kubernetes: Tips for Optimizing Container Memory Management

Preventing Out-of-Memory (OOM) Kills in Kubernetes: Tips for Optimizing Container Memory Management

Running containerized applications at scale with Kubernetes demands careful resource management. One very complicated but common challenge is preventing Out-of-Memory (OOM) kills, which occur when a container’s memory consumption surpasses its allocated limit. This brutal termination by the Kubernet

Causely

Causely

August 28, 2024

Read more
Causely brings on a new CEO to accelerate growth

Causely brings on a new CEO to accelerate growth

Yotam Yemini joins Causely as CEO after departing Cisco and previously leading go-to-market efforts at Oort, Quantum Metric, and IBM Turbonomic   Thursday, August 22, 2024 – Today, Causely is excited to welcome Yotam Yemini as the company’s Chief Executive…

Causely

Causely

August 22, 2024

Read more
The Rising Cost of Digital Incidents: Understanding and Mitigating Outage Impact

The Rising Cost of Digital Incidents: Understanding and Mitigating Outage Impact

Digital disruptions have reached alarming levels. Incident response in modern application environments is frequent, time-consuming and labor intensive. Our team has first-hand experience dealing with the far-reaching impacts of these disruptions and outages, having spent decades in IT Ops….

Causely

Causely

August 8, 2024

Read more
Explainability: The Black Box Dilemma in the Real World

Explainability: The Black Box Dilemma in the Real World

The software industry is at a crossroads. I believe those who embrace explainability as a key part of their strategy will emerge as leaders. Those who resist will risk losing customer confidence and market share. The time for obfuscation is…

Causely

Causely

August 7, 2024

Read more
Understanding the Kubernetes Readiness Probe: A Tool for Application Health

Understanding the Kubernetes Readiness Probe: A Tool for Application Health

Application reliability is a dynamic challenge, especially in cloud-native environments. Ensuring that your applications are running smoothly is make-or-break when it comes to user experience. One essential tool for this is the Kubernetes readiness probe. This blog will explore the…

Causely

Causely

July 23, 2024

Read more
Beyond the Blast Radius: Demystifying and Mitigating Cascading Microservice Issues

Beyond the Blast Radius: Demystifying and Mitigating Cascading Microservice Issues

Microservices architectures offer many benefits, but they also introduce new challenges. One such challenge is the cascading effect of simple failures. A seemingly minor issue in one microservice can quickly snowball, impacting other services and ultimately disrupting user experience. The…

Causely

Causely

July 15, 2024

Read more
Using OpenTelemetry and the OTel Collector for Logs, Metrics, and Traces

Using OpenTelemetry and the OTel Collector for Logs, Metrics, and Traces

OpenTelemetry (fondly known as OTel) is an open-source project that provides a unified set of APIs, libraries, agents, and instrumentation to capture and export logs, metrics, and traces from applications. The project’s goal is to standardize observability across various services…

Causely

Causely

July 10, 2024

Read more
Causely Overview

Causely Overview

Causely assures continuous reliability of cloud applications. Causely for Cloud-Native Applications, built on our Causal Reasoning Platform, automatically captures cause and effect relationships based on real-time, dynamic data across the entire application environment. This means that we can detect

Causely

Causely

June 13, 2024

Read more
Real-time Data & Modern UXs: The Power and the Peril When Things Go Wrong

Real-time Data & Modern UXs: The Power and the Peril When Things Go Wrong

Imagine a world where user experiences adapt to you in real time. Personalized recommendations appear before you even think of them, updates happen instantaneously, and interactions flow seamlessly. This captivating world is powered by real-time data, the lifeblood of modern…

Causely

Causely

June 7, 2024

Read more
Crossing the Chasm, Revisited

Crossing the Chasm, Revisited

Sometimes there’s a single book (or movie, podcast or Broadway show) that seems to define a particular time in your life. In my professional life, Geoffrey Moore’s Crossing the Chasm has always been that book. When I started my career…

Causely

Causely

May 30, 2024

Read more
Bridging the Gap Between Observability and Automation with Causal Reasoning

Bridging the Gap Between Observability and Automation with Causal Reasoning

Observability has become a growing ecosystem and a common buzzword. Increasing visibility with observability and monitoring tools is helpful, but stopping at visibility isn’t enough. Observability lacks causal reasoning and relies mostly on people to connect application issues with potential…

Causely

Causely

May 22, 2024

Read more
What is Causal AI & why do DevOps teams need it?

What is Causal AI & why do DevOps teams need it?

Causal AI can help IT and DevOps professionals be more productive, freeing hours of time spent troubleshooting so they can instead focus on building new applications. But when applying Causal AI to IT use cases, there are several domain-specific intricacies…

Causely

Causely

May 1, 2024

Read more
Building Startup Culture Isn’t Like It Used To Be

Building Startup Culture Isn’t Like It Used To Be

When does culture get established in a startup? I’d say the company’s DNA is set during the first year or two, and the founding team should do everything possible to make this culture intentional vs a series of disconnected decisions….

Causely

Causely

April 24, 2024

Read more
Assure application reliability with Causely

Assure application reliability with Causely

In this video, we’ll show how easy it is to continuously assure application reliability using Causely’s causal AI platform.   In a modern production microservices environment, the number of alerts from observability tooling can quickly amount to hundreds or even…

Causely

Causely

April 22, 2024

Read more
Cause and Effect: Solving the Observability Conundrum

Cause and Effect: Solving the Observability Conundrum

The pressure on application teams has never been greater. Whether for Cloud-Native Apps, Hybrid Cloud, IoT, or other critical business services, these teams are accountable for solving problems quickly and effectively, regardless of growing complexity. The good news? There’s a…

Causely

Causely

April 18, 2024

Read more
Fools Gold or Future Fixer: Can AI-powered Causality Crack the RCA Code for Cloud Native Applications?

Fools Gold or Future Fixer: Can AI-powered Causality Crack the RCA Code for Cloud Native Applications?

The idea of applying AI to determine causality in an automated Root Cause Analysis solution sounds like the Holy Grail, but it’s easier said than done. There’s a lot of misinformation surrounding RCA solutions. This article cuts the confusion and…

Causely

Causely

April 8, 2024

Read more

On security platforms

🎧 This Tech Tuesday Podcast features Endre Sara, Founding Engineer at Causely! Causely is bridging observability with automated orchestration for self-managed, resilient applications at scale. In this episode, Amir and Endre discuss leadership, how to make people’s lives easier by…

Causely

Causely

April 5, 2024

Read more
Dr. Shmuel Kliger on Causely, Causal AI, and the Challenging Journey to Application Health

Dr. Shmuel Kliger on Causely, Causal AI, and the Challenging Journey to Application Health

Dr. Shmuel Kliger, the founder of Causely.io, discusses his journey in the IT industry and the development of Causely. With a strong focus on reducing labor associated with IT operations, Dr. Kliger emphasizes the importance of understanding causality and building…

Causely

Causely

April 3, 2024

Read more
Are you ready to eat your own dogfood?

Are you ready to eat your own dogfood?

It’s a truism of all cloud SaaS companies that we should run our businesses on our own technology. After all, if this technology is so valuable and innovative that customers with dozens of existing vendors, tools and processes need to…

Causely

Causely

March 27, 2024

Read more
The Fast Track to Fixes: How to Turbo Charge Application Instrumentation & Root Cause Analysis

The Fast Track to Fixes: How to Turbo Charge Application Instrumentation & Root Cause Analysis

In the fast-paced world of cloud-native development, ensuring application health and performance is critical. The application of Causal AI, with its ability to understand cause and effect relationships in complex distributed systems, offers the potential to streamline this process. A…

Causely

Causely

March 19, 2024

Read more
Time to Rethink DevOps Economics? The Path to Sustainable Success

Time to Rethink DevOps Economics? The Path to Sustainable Success

As organizations transform their IT applications and adopt cloud-native architectures, scaling seamlessly while minimizing resource overheads becomes critical. DevOps teams can play a pivotal role in achieving this by embracing automation across various facets of the service delivery process. Automa

Causely

Causely

March 5, 2024

Read more
The Fellowship of the Uptime Ring: A Quest for Site Reliability

The Fellowship of the Uptime Ring: A Quest for Site Reliability

Reposted with permission from its original source on LinkedIn.  A digital chill swept through King Reginald as he materialized back in his royal chambers, having returned from the Cloud Economic World Forum. The summit had cast a long shadow, its…

Causely

Causely

February 28, 2024

Read more
Don’t Forget These 3 Things When Starting a Cloud Venture

Don’t Forget These 3 Things When Starting a Cloud Venture

I’ve been in the cloud since 2008. These were the early days of enterprise cloud adoption and the very beginning of hybrid cloud, which to this day remains the dominant form of enterprise cloud usage. Startups that deliver breakthrough infrastructure…

Causely

Causely

February 27, 2024

Read more
Mission Impossible? Cracking the Code of Complex Tracing Data

Mission Impossible? Cracking the Code of Complex Tracing Data

In this video, we’ll show how Causely leverages OpenTelemetry. (For more on how and why we use OpenTelemetry in our causal AI platform, read the blog from Endre Sara.)     Distributed tracing gives you a bird’s eye view of…

Causely

Causely

February 19, 2024

Read more
Cloud Cuckoo Calamity: The eBPF-Wielding Hobbit and the Root Cause Griffin Save the Realm!

Cloud Cuckoo Calamity: The eBPF-Wielding Hobbit and the Root Cause Griffin Save the Realm!

Reposted with permission from its source on LinkedIn.  The fate of the realm hangs in the balance. Join the mayhem in Cloud Cuckoo Calamity, the thrilling sequel to Data, Dragons & Digital Dreams. A mournful dirge echoed through the digital…

Causely

Causely

February 13, 2024

Read more
Eating Our Own Dog Food: Causely’s Journey with OpenTelemetry & Causal AI

Eating Our Own Dog Food: Causely’s Journey with OpenTelemetry & Causal AI

Implementing OpenTelemetry at the core of our observability strategy for Causely’s SaaS product was a natural decision. In this article I would like to share some background on our rationale and how the combination of OpenTelemetry and Causal AI addresses…

Causely

Causely

February 7, 2024

Read more
Data, Dragons & Digital Dreams: The Saga Of Microservice Manor

Data, Dragons & Digital Dreams: The Saga Of Microservice Manor

Reposted with permission from its source on LinkedIn.  In the bustling kingdom of Microservice Manor, where code flowed like rivers and servers hummed like contented bees, all was not well. Glitches lurked like mischievous sprites, transactions hiccuped like startled unicorns,…

Causely

Causely

January 30, 2024

Read more
Causely for asynchronous communication

Causely for asynchronous communication

Managing microservices-based applications at scale is challenging, especially when it comes to troubleshooting and pinpointing root causes. In a microservices-based environment, when a failure occurs, it causes a flood of anomalies across the entire system. Pinpointing the root cause can…

Causely

Causely

January 16, 2024

Read more
Moving Beyond Traditional RCA In DevOps

Moving Beyond Traditional RCA In DevOps

Reposted with permission from LinkedIn.  Modernization Of The RCA Process Over the past month, I have spent a significant amount of time researching what vendors and customers are doing in the devops space to streamline the process of root cause…

Causely

Causely

January 3, 2024

Read more
Root Cause Chronicles: Connection Collapse

Root Cause Chronicles: Connection Collapse

The below post is reposted with permission from its original source on the InfraCloud Technologies blog. This MySQL connection draining issue highlights the complexity of troubleshooting today’s complex environments, and provides a great illustration of the many rabbit holes SREs…

Causely

Causely

December 14, 2023

Read more
Understanding failure scenarios when architecting cloud-native applications

Understanding failure scenarios when architecting cloud-native applications

Developing and architecting complex, large cloud-native applications is hard. In this short demo, we’ll show how Causely helps to understand failure scenarios before something actually fails in the environment. In the demo environment we have a dozen applications with database…

Causely

Causely

December 8, 2023

Read more
Troubleshooting cloud-native applications with Causely

Troubleshooting cloud-native applications with Causely

Running large, complex, distributed cloud-native applications is hard. This short demo shows how Causely can help. In this environment, we are running a number of applications with database servers, caches, in a cluster, multiple services, pods, and containers. At any…

Causely

Causely

December 8, 2023

Read more
Unveiling the Causal Revolution in Observability

Unveiling the Causal Revolution in Observability

Reposted with permission from LinkedIn. OpenTelemetry and the Path to Understanding Complex Systems Decades ago, the IETF’s (Internet Engineering Task Force) developed an innovative protocol, SNMP, revolutionizing network management. This standardization spurred a surge of innovation, fostering a ne

Causely

Causely

December 6, 2023

Read more
All Sides of the Table

All Sides of the Table

Reflecting on the boardroom dynamics that truly matter This past month has been an eventful one. Like everyone in the tech world, I’m riveted by the drama unfolding at OpenAI, wondering how the board and CEO created such an extreme…

Causely

Causely

November 21, 2023

Read more
Navigating Kafka and the Challenges of Asynchronous Communication

Navigating Kafka and the Challenges of Asynchronous Communication

Welcome back to our series, “One Million Ways to Slow Down Your Application.” Having previously delved into the nuances of Postgres configurations, we now journey into the world of Kafka and asynchronous communication, another critical component of scalable applications. Kafka…

Causely

Causely

September 27, 2023

Read more
Why do this startup thing all over again? Our reasons for creating Causely

Why do this startup thing all over again? Our reasons for creating Causely

Why be a serial entrepreneur? It’s a question that my co-founder, Shmuel, and I are asked many times. Both of us have been to this rodeo twice before – Shmuel, with SMARTS and Turbonomic, myself with ClearSky Data and CloudSwitch….

Causely

Causely

June 29, 2023

Read more
Causely raises $8.8M in Seed funding to deliver IT industry’s first causal AI platform

Causely raises $8.8M in Seed funding to deliver IT industry’s first causal AI platform

Automation of causality will eliminate human troubleshooting and enable faster, more resilient cloud application management Boston, June 29, 2023 – Causely, Inc., the causal AI company, today announced it has raised $8.8M in Seed funding, led by 645 Ventures with participation…

Causely

Causely

June 29, 2023

Read more
One million ways to slow down your application response time and throughput

One million ways to slow down your application response time and throughput

This blog was originally posted on LinkedIn. Navigating the Perilous Waters of Misconfigured MaxOpenConnection in Postgres Applications Welcome to the inaugural post in our series, “One Million Ways to Slow Down Your Application Response Time and Throughput”. In this series,…

Causely

Causely

June 12, 2023

Read more
DevOps may have cheated death, but do we all need to work for the king of the underworld?

DevOps may have cheated death, but do we all need to work for the king of the underworld?

This blog was originally posted on LinkedIn. How causality can eliminate human troubleshooting Tasks that are both laborious and futile are described as Sisyphean. In Greek mythology, Sisyphus was the founder and king of Ephyra (now known as Corinth). Hades…

Causely

Causely

May 25, 2023

Read more