Kubernetes Active Remediation empowers users to detect issues, gain contextualized insights and make changes to repair Kubernetes environments—all within one integrated workflow
Datadog, Inc. Â the monitoring and security platform for cloud applications, today announced the launch of Kubernetes Active Remediation, which builds on Datadog’s automated troubleshooting capabilities to provide curated remediation guidance, best practices and end-to-end issue management for Kubernetes organizations.
Traditional monitoring tools track health and performance based on metrics, but teams need deeper insights and guidance to navigate the complexities of Kubernetes environments. Teams are also often slowed down by the limited availability of their organization’s internal Kubernetes experts. Without actionable recommendations, diagnosing the root cause of issues can be time consuming and resource intensive, leading to prolonged downtime and decreased operational efficiency.
Kubernetes Active Remediation offers the deeper insights at scale that teams need by providing a comprehensive overview of cluster-level resource problems ranked by their importance and relevance. After a problematic cluster or workload has been identified, teams can view consolidated troubleshooting information, including root cause analysis and recommended fixes. They can then directly trigger deployment patches for the key issues from within the Datadog platform.
“Today’s announcement builds on our launch of Kubernetes Autoscaling at DASH to empower users to detect issues, gain contextual insights and make changes to their Kubernetes resources directly from Datadog’s unified platform,” said Yrieix Garnier, VP of Product at Datadog. “Customers are looking for help to identify an issue, curate data about it and take steps to solve it. Kubernetes Active Remediation will help customers do just that so they can resolve issues faster.”
Kubernetes Active Remediation helps organizations:
- Automate root-cause analysis and detection: Recommendations are automatically issued with full contextual data and triaged to the correct owner. Curated remediation actions can also be pre-approved by the DevOps or security teams to automate the process downstream.
- Directly repair Kubernetes environments: Explanations and suggestions are provided based on troubleshooting patterns that are commonly seen in Kubernetes environments. Once the user has the full context and recommended next steps, they can directly trigger deployment patches from within Datadog to remediate the issue.
- Improve troubleshooting and remediation speed: By accelerating a user’s time to detect and resolve an issue, Datadog helps application development teams be more efficient, understand root causes and automate their remediation processes.