Azure Data Factory Linked Service Data Lake Storage Gen2

This page shows how to write Terraform and Azure Resource Manager for Data Factory Linked Service Data Lake Storage Gen2 and write them securely.

azurerm_data_factory_linked_service_data_lake_storage_gen2 (Terraform)

The Linked Service Data Lake Storage Gen2 in Data Factory can be configured in Terraform with the resource name azurerm_data_factory_linked_service_data_lake_storage_gen2. The following sections describe 10 examples of how to use the resource and its parameters.

Example Usage from GitHub

adf_adls_gen2_linkedservice.tf#L4
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "adf_adlsg2_linkedservice" {
  name                  = "adlsgen2_linked_service"
  resource_group_name   = azurerm_resource_group.synapse-experiments-rg.name
  data_factory_name     = azurerm_data_factory.azure-data-factory.name
  service_principal_id  = data.azurerm_client_config.current.client_id
  service_principal_key = "exampleKey"
module.tf#L1
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "linked_service_data_lake_storage_gen2" {
  name                     = var.name
  resource_group_name      = var.resource_group_name
  data_factory_name        = var.data_factory_name
  description              = try(var.description, null)
  integration_runtime_name = try(var.integration_runtime_name, null)
data-factory.tf#L22
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "gen2" {
  name                  = "storage"
  resource_group_name   = azurerm_resource_group.final_task.name
  data_factory_name     = azurerm_data_factory.pipeline.name
  url                   = "https://finalterraform.dfs.core.windows.net/"
  storage_account_key   = ""
module.tf#L1
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "linked_service_data_lake_storage_gen2" {
  name                     = var.name
  resource_group_name      = var.resource_group_name
  data_factory_name        = var.data_factory_name
  description              = try(var.description, null)
  integration_runtime_name = try(var.integration_runtime_name, null)
main.tf#L7
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "this" {
  additional_properties    = var.additional_properties
  annotations              = var.annotations
  data_factory_name        = var.data_factory_name
  description              = var.description
  integration_runtime_name = var.integration_runtime_name
main.tf#L7
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "this" {
  additional_properties    = var.additional_properties
  annotations              = var.annotations
  data_factory_name        = var.data_factory_name
  description              = var.description
  integration_runtime_name = var.integration_runtime_name
module.tf#L10
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "linked_service_data_lake_storage_gen2" {
  name                     = azurecaf_name.dataset.name
  resource_group_name      = var.resource_group_name
  data_factory_name        = var.data_factory_name
  description              = try(var.description, null)
  integration_runtime_name = try(var.integration_runtime_name, null)
data_factory.tf#L25
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "storage" {
  name                  = "sharedstorage"
  resource_group_name   = azurerm_resource_group.deploy.name
  data_factory_name     = local.azurerm_data_factory_name
  service_principal_id  = data.azurerm_client_config.current.client_id
  service_principal_key = "exampleKey"
module.tf#L1
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "linked_service_data_lake_storage_gen2" {
  name                     = var.name
  resource_group_name      = var.resource_group_name
  data_factory_name        = var.data_factory_name
  description              = try(var.description, null)
  integration_runtime_name = try(var.integration_runtime_name, null)
adfarchitect.tf#L81
resource "azurerm_data_factory_linked_service_data_lake_storage_gen2" "df-to-dl" {
  name                  = "datalakestorageacct"
  resource_group_name   = azurerm_resource_group.rg.name
  data_factory_name     = azurerm_data_factory.df.name
  service_principal_id  = data.azurerm_client_config.current.client_id
  service_principal_key = "exampleKey"

Review your Terraform file for Azure best practices

Shisho Cloud, our free checker to make sure your Terraform configuration follows best practices, is available (beta).

Parameters

Explanation in Terraform Registry

Manages a Linked Service (connection) between Data Lake Storage Gen2 and Azure Data Factory.

Note: All arguments including the service_principal_key will be stored in the raw state as plain-text. Read more about sensitive data in state.

Tips: Best Practices for The Other Azure Data Factory Resources

In addition to the azurerm_data_factory, Azure Data Factory has the other resources that should be configured for security reasons. Please check some examples of those resources and precautions.

risk-label

azurerm_data_factory

Ensure to disable public access

It is better to disable public access for Data Factory, which is enabled as default.

Review your Azure Data Factory settings

In addition to the above, there are other security points you should be aware of making sure that your .tf files are protected in Shisho Cloud.

Microsoft.DataFactory/factories/linkedservices (Azure Resource Manager)

The factories/linkedservices in Microsoft.DataFactory can be configured in Azure Resource Manager with the resource name Microsoft.DataFactory/factories/linkedservices. The following sections describe how to use the resource and its parameters.

Example Usage from GitHub

An example could not be found in GitHub.

Parameters

  • apiVersion required - string
  • name required - string

    The linked service name.

  • properties required
      • additionalProperties optional - object

        Unmatched properties from the message are deserialized this collection

      • annotations optional - array

        List of tags that can be used for describing the linked service.

      • connectVia optional
          • parameters optional - object

            An object mapping parameter names to argument values.

          • referenceName required - string

            Reference integration runtime name.

          • type required - string

            Type of integration runtime.

      • description optional - string

        Linked service description.

      • parameters optional - undefined

        Definition of all parameters for an entity.

  • type required - string

Frequently asked questions

What is Azure Data Factory Linked Service Data Lake Storage Gen2?

Azure Data Factory Linked Service Data Lake Storage Gen2 is a resource for Data Factory of Microsoft Azure. Settings can be wrote in Terraform.

Where can I find the example code for the Azure Data Factory Linked Service Data Lake Storage Gen2?

For Terraform, the ajith-ramanath/SynapseExperiments, anmoltoppo/Terraform and AbraMyrk/ITechArt_lab source code examples are useful. See the Terraform Example section for further details.