A challenge for many teams continues to be managing cost, governance, and reliability across an ever-larger footprint. This year’s DASH announcements help teams operate efficiently at scale, with new tools to cut cloud and AI spend, eliminate waste automatically, maintain observability during outages, and manage many organizations and agents as a single unit.
Whether you’re attributing AI spend across providers, automating cost optimization within guardrails you define, keeping observability online through a cloud outage with Disaster Recovery, or storing and searching logs at petabyte scale in your own infrastructure, these features help you control complexity and cost without slowing your teams down. Review everything new for operating at scale below, and read our other roundup posts for the latest in AI, observability, and security.
Run Datadog reliably at scale
Maintain observability during cloud outages with Datadog Disaster Recovery
Cloud provider outages can leave teams without visibility into production systems during active incidents. Datadog Disaster Recovery (DDR) lets you configure a secondary Datadog site ahead of time, automatically replicates more than 30 resource types, including dashboards, monitors, and users on a regular schedule, and activates on demand when your primary site is impacted. Failover can be triggered via Fleet Automation and Remote Configuration for Agent-based cutover, or via a dedicated DNS intake endpoint that routes traffic without changes to your Agent fleet. DDR is now generally available. To enable DDR for your organization, contact your Datadog account manager, or read the blog post to learn more.











