Share via


Set up data quality for Fabric shortcut databases

Shortcuts are objects in Microsoft OneLake that point to other storage locations. The ___location can be internal or external to OneLake. The ___location that a shortcut points to is known as the target path of the shortcut.
The ___location where the shortcut appears is known as the shortcut path. Shortcuts appear as folders in OneLake and any workload or service that has access to OneLake can use them.

Shortcuts in OneLake allow you to unify your data across domains, clouds, and accounts by creating a single virtual data lake for your entire enterprise. All Microsoft Fabric experiences and analytical engines can directly connect to your existing data sources such as Azure, Amazon Web Services (AWS), and OneLake through a unified namespace. OneLake manages all permissions and credentials so you don't need to separately configure each Fabric workload to connect to each data source.

For more information about Fabric shortcuts, see the OneLake shortcuts documentation.

Configure data quality for Fabric shortcut databases

Sign in to your Microsoft Fabric workspace. Select the ellipsis button under Tables, and select New Shortcut. From here, you can create:

Screenshot of the Fabric workspace, with the new shortcut button highlighted.

Azure Data Lake Gen2 shortcut

  1. Select Azure Data Lake Storage Gen2 on the Fabric workspace New shortcut page.

    Screenshot of the Fabric  new shortcut page with ADLS Gen2 highlighted.

  2. Select ADLS Gen2 SAS authentication.

    Screenshot of  the new shortcut window with the SAS token authentication selected.

  3. Generate a SAS and connection string for your ADLS Gen2 resource in the Azure portal.

  4. Copy the endpoint of the data lake.

    Screenshot of copying the data lake end point in the Azure portal.

  5. Add storage details for the shortcut storage.

    Screenshot to add storage details to the Fabric shortcut in the new shortcut window.

  6. Navigate to and choose the correct delta folder.

    Screenshot to choose correct delta folder in the new shortcut window.

  7. Preview the shortcut delta table in your Fabric workspace.

    Screenshot of the OneLake delta table preview.

  8. Start a scan of your Azure Data Lake Gen2 resource in Microsoft Purview Data Map using service principal authentication.

    Screenshot of the data map scan for ADLS Gen2.

  9. When the scan finishes, your data asset appears in Microsoft Purview Unified Catalog as a Microsoft Fabric Lakehouse table.

  10. Associate the asset with a data product for curation and data quality assessment.

  11. In Unified Catalog, run a data quality scan or profile your data as usual.

Amazon S3 shortcut

  1. Select New shortcut in the Microsoft Fabric workspace.

  2. Select AWS S3 and add the URL, access key ID, and access key shortcut.

    Screenshot of the Amazon S3 new shortcut page with added details.

  3. Add the connection URL and storage details.

    Screenshot of the Amazon S3 new shortcut page with added connection URL and storage details.

  4. Preview the shortcut in your Fabric workspace.

  5. Start a scan of your Amazon S3 resource in Data Map using service principal authentication.

  6. When the scan finishes, your data asset appears in Unified Catalog.

  7. Associate the asset with a data product for curation and data quality assessment.

  8. In Unified Catalog, run a data quality scan or profile your data as usual.

Google Cloud Storage (GCS) shortcut

  1. Select New shortcut in the Fabric workspace.

  2. Select Google Cloud Storage and add the URL, access key ID, and access key shortcut.

    Screenshot of GCS shortcut HMAC key.

  3. Add the connection URL and storage details.

    Screenshot of GCS connection url.

  4. Preview the shortcut in your Fabric workspace.

  5. Start a scan of your Amazon S3 resource in Data Map using service principal authentication.

  6. When the scan finishes, your data asset appears in Unified Catalog.

  7. Associate the asset with a data product for curation and data quality assessment.

  8. In Unified Catalog, run a data quality scan or profile your data as usual.

Important

  • Use a service principal for Data Map scans and managed identity for data quality scans.
  • Any data sourced through a shortcut is processed in the same region.
  • The Fabric team needs to differentiate shortcut items from native items in the Microsoft OneLake SDK for Lakehouse subartifacts. For now, all shortcut items (tables and files) are considered as native items in scanning.