Edit

Share via


Business continuity and disaster recovery

Note

As of September 3, 2025, the self-service disaster recovery feature has been expanded to include failover support for Dynamics 365 Contact Center. With this enhancement, organizations can now seamlessly initiate failover for their contact center environments, ensuring smooth execution of disaster recovery drills or continued operations from an alternate region, when needed.

Businesses expect their applications and customer data to be protected and resilient during unavoidable outages and disruptions. It's important to document a business continuity plan that minimizes the effects of outages. Make sure the plan lists stakeholders, processes, and specific steps to recover and resume operations.

Microsoft provides business continuity and disaster recovery capabilities to all production type environments in Dynamics 365 and Power Platform software as a service (SAAS) applications. This article describes how Microsoft keeps your production data resilient during outages.

The diagram shows a typical architecture of a geography that serves one or more countries or regions. Power Platform admins only need to know the geography ___location, but within each geography, Microsoft deploys more infrastructure to provide scale and extra protection for your data.

The diagram shows a typical architecture of a geography that serves a single or multiple countries/regions.

A geography has at least one Azure region, which usually includes three availability zones, but never fewer than two availability zones.

Built-in disaster recovery in-region with Azure availability zones

Infrastructure components like network, power, or cooling can fail unexpectedly, for example, because of a lightning strike, and can affect one or more datacenters. To ensure resilience, Microsoft deploys availability zones, so your environment is replicated across at least two distinct zones.

Microsoft automatically detects availability zone-level failures and switches to other availability zones in the region almost instantly to protect you from data loss, while keeping downtime near zero in most cases. This in-region capability is for production environments that host business-critical app processes and data. To avoid disruption, don't deploy production processes and data in nonproduction types like sandbox, developer, or trial environments.

Availability zones provide built-in resilience for seamless disaster recovery without manual intervention. Zone-redundant data services replicate data across multiple zones, so a failure in one zone doesn't affect data availability. The recovery point objective is near zero, and the recovery time objective is less than five minutes. If one zone fails, traffic is automatically rerouted to the remaining zones with minimal service disruption.

Backup of production environments

The transition to availability zones is a significant improvement over the earlier backup and failover process for Dynamics 365 and Power Platform workloads. Those workloads typically require contacting customer support for manual intervention. Your data and services stay highly available within the primary region, with built-in, real-time redundancy across multiple zones.

Key improvements include:

  • Always-on resilience: Your environments automatically replicate across multiple availability zones, so you don't need separate geo-secondary backups.
  • Faster recovery: Synchronous replication across zones lets failover within a region happen almost instantly, minimizing disruptions and data loss.
  • Seamless experience: Unlike traditional backups that require restoration, availability zones keep your environment continuously active.
  • Reduced support dependency: Automated failover within the primary region means you don't need to contact Microsoft support for most disaster recovery scenarios.

A limited number of customers in certain regions are transitioning to the improved architecture. Whether the region has transitioned or is transitioning, the service always keeps a backup of environment data in more than one data center.

Availability zones are far enough apart to reduce the chance of an outage affecting more than one zone, but close enough to keep low-latency connections to other availability zones. Availability zones are typically separated by several kilometers, and are usually within 100 kilometers.

Customers who need greater distance within a geography can use self-service disaster recovery to keep a copy in a secondary region. With this feature, customers control failover operations and run disaster recovery drills as described in the following section.

Cross-region self-service disaster recovery

Most geographies usually have region pairs separated by at least 300 miles when possible, to help protect your data in large-scale disasters.

Self-service disaster recovery is a Power Platform infrastructure capability that lets you replicate your environment across long distances and start environment failover between regions on your own.

You usually have multiple environments of different types in your tenant. This capability is available only for production environments.

To turn on self-service disaster recovery, make sure your environment is managed and linked to a pay-as-you-go billing plan. For more information about managed environments, go to Managed Environments.

Allow Virtual Network pairing for self-service disaster recovery in Dynamics 365

If your Dynamics 365 environment is deployed within a Virtual Network and you plan to use self-service disaster recovery, you must configure a Virtual Network pair. This pairing ensures that your primary and secondary environments can communicate securely during failover and failback operations. Without a Virtual Network pair, disaster recovery operations fail because network connectivity between regions can't be established.

For setup instructions, go to Set up virtual network support for Power Platform.

Turn on self-service disaster recovery

This action sets up resources and starts replicating data between the primary and secondary locations. The process can take up to 48 hours to finish. Admins get a notification when the process is done.

Turning on disaster recovery in an environment doesn't affect the environment or its data.

To turn on disaster recovery, follow these steps.

  1. Sign in to the Power Platform admin center as a system administrator.
  2. In the navigation pane, select Manage.
  3. In the Manage pane, select Environments. The Environments page shows.
  4. Select the production environment where you want to turn on self-service disaster recovery.
  5. Select Disaster Recovery in the command bar at the top of the page. The Disaster Recovery pane appears.
  6. Select the checkbox to turn on Disaster Recovery.
  7. Select Save.
  8. The environment briefly shows the Edit details page.
  9. The Environment details page shows that the process of turning on the feature has started.

You might also want to turn on disaster recovery for other events, like:

  • Disaster recovery drill
  • Emergency response for a major regional outage

Disaster recovery drills

Your company might have disaster recovery drills documented as a requirement in your internal business continuity plans. Some industries and companies might be required by government regulations to perform audits on their business continuity disaster recovery capabilities. In these cases, you can run a disaster recovery drill on an environment. A disaster recovery drill lets you do self-service disaster recovery without losing any data. The duration of the failover action can be slightly longer while all remaining data is replicated to the secondary region.

We recommend doing drills on a copy of a production environment, since this involves downtime that can last for minutes. For example, you might want to copy a production environment to a sandbox environment and then change the type from sandbox to production.

Emergency response failover

Choose this option during an emergency, when the primary region has an outage and you can't use environments or data. If you select this option, the environment fails and doesn't copy any more data except what's already replicated before the outage.

When you start an emergency response, you see the amount of data loss shown in time. Compare this to your recovery point objective to check if it's acceptable before you continue. The environment stays in a Running state until disaster recovery finishes and normal operation resumes from the secondary region.

Note

Database backups are not replicated to secondary regions for scenarios supported by self-service disaster recovery, unless the customer has explicitly allowed self-service disaster recovery. Without self-service disaster recovery, backups remain in the primary region only, which means cross-region failover can't be guaranteed. To ensure business continuity and compliance with your disaster recovery strategy, configure self-service disaster recovery for your environment.

Switch back to primary region

After you fix an outage or finish your drill, switch the environment back to its primary region. The environment can operate with limited resources in the paired region. You don't lose data during this action.

Environment disaster recovery status

Admins check the current disaster recovery state and ___location of an environment in the Environment details page. Admins also select Disaster Recovery in the command bar to open the Disaster Recovery pane.

To check data replication latency at any time, select Disaster Recovery, and then select Emergency response as the disaster recovery reason. This action opens a confirmation dialog that shows the last replication time between regions for that environment. You can select Cancel if your only purpose is to check the potential loss of data if there is a failover operation. Remember, the last sync time always changes because data is replicated continuously.

Document your business continuity plan

We recommend that you perform disaster recovery drills or an emergency response before a real disaster strikes, so you can document all steps required for any integration points that are external to Power Platform. Your company is then more prepared for recovery if there's a real disaster.

Frequently Asked Questions

What are the costs associated with using self-service disaster recovery?

  • Capacity charges are based on the consumption of the environment's paired region for Database, File, and Log storage types.
  • Capacity consumption is reflected in the familiar licensing experience within the Power Platform Admin Center. Learn more in View usage and billing information.

How does billing work for self-service disaster recovery?

  • If your environment is configured to draw capacity from your tenant's Dataverse capacity entitlement, then entitled capacity is consumed first and a pay-as-you-go billing plan is still required so you can avoid capacity overages.
  • Admins can allocate capacity to the environment, after which the pay-as-you-go plan will be billed.
  • Customers can't turn off the pay-as-you-go plan in the billing experience if self-service disaster recovery is turned on.

Can I switch regions during a regional outage?

In the case of a regional outage, the system supports failover to the designated secondary region as part of self-service disaster recovery, but it doesn't allow switching to any other arbitrary regions.

What should I know about the capacity experience?

  • When you allow self-service disaster recovery, you notice that more storage consumption is displayed in the Dataverse capacity graph, clearly indicating the extra capacity used by the cross-region backup.
  • When you don't allow self-service disaster recovery, the capacity graph shows standard usage without the extra storage for replication.
  • When the self-service disaster recovery is active, the capacity graph displays the extra consumption from cross-region replication, with a Disaster recovery active tag in the Dataverse capacity summary.

How do I disable self-service disaster recovery?

To disable self-service disaster recovery, go to the disaster recovery pane in Power platform admin center and uncheck the Turn on disaster recovery checkbox.

What happens when I disable self-service disaster recovery?

Disabling self-service disaster recovery deletes all replicated environment data in the paired region. You are prompted to confirm the environment's name before proceeding.

Can I disable self-service disaster recovery while in a paired region?

No, self-service disaster recovery can't be disabled while the environment is in a paired region. You must switch to the primary region first.

Are there any known limitations during a region-wide outage that self-service disaster recovery can't mitigate?

In the rare event of a region-wide outage, the following scenarios may experience temporary degradation, depending on the severity of the outage.

  • Power Automate flows impacted by the regional outage may not recover through self-service disaster recovery and remain unavailable until the primary region is restored.
  • Copilot Studio conversation requests may fail until Microsoft restores the service in the primary region.
  • In Dynamics 365, analytics and automation in Sales, real-time updates in Customer Insights, and case or knowledge base access in customer service may be unavailable.
  • Under field service, Resource Scheduling Optimization (RSO), may be impacted during a regional outage. For AI workloads, training and predictive analytics may fail in the secondary region.
  • Connectors may have recovery issues, when dependent on external systems like SharePoint or SQL.
  • Dynamics 365 Sales: Analytics, reporting, and functions dependent on automation (such as sales forecasting) are unavailable.
  • Finance and operations products aren't currently supported for self-serve disaster recovery during regional outages.