Cloud

How to Connect Azure Databricks to Data Lake Using SAS Token

Azure Databricks is a powerful platform for big data analytics and machine learning, and Azure Data Lake Storage (ADLS) provides a highly scalable and secure data lake for big data analytics. In this blog post, we will explore how to connect Azure Databricks to Azure Data Lake using a Shared Access Signature (SAS) token and how to list the contents of an Azure Data Lake container, specifically focusing on CSV file data.

Prerequisites

  • An active Azure subscription.
  • Azure Databricks workspace.
  • Azure Data Lake Storage Gen2 account with a container.

Step 1: Generate a SAS Token for Azure Data Lake

  • Navigate to your Azure Data Lake Storage account in the Azure portal.
  • Select the container you want to connect to.
  • Click on ‘Generate SAS’ using 3 dot.

  • Set the required permissions, expiry time, and other parameters.

A screenshot of a computer

Description automatically generated

  • Click ‘Generate SAS token and URL’.
  • Copy the SAS token (the part after? in the generated URL).

Step 2: Configure Azure Databricks

  1. Launch your Azure Databricks workspace.
  2. Create a new cluster or use an existing one.

Step 3: Connect to Azure Data Lake Storage Using SAS Token

To connect Azure Databricks to your Azure Data Lake Storage using a SAS token, you need to configure the SAS token and use it in your Databricks notebook.

  1. Open a new Databricks notebook.
  2. Set the SAS token and the storage account details as follows

spark.conf.set("fs.azure.account.auth.type..dfs.core.windows.net", "SAS")
spark.conf.set("fs.azure.sas.token.provider.type..dfs.core.windows.net", "org.apache.hadoop.fs.azurebfs.sas.FixedSASTokenProvider")
spark.conf.set("fs.azure.sas.fixed.token..dfs.core.windows.net", "SAS_TOKEN")

  • with the Azure Storage account name.
  • with the Azure Databricks secret scope name.
  • with the name of the key containing the Azure storage SAS token.

Below is the example

Step 4: List Contents of the Azure Data Lake Container

2 thoughts on “How to Connect Azure Databricks to Data Lake Using SAS Token

  1. porn2021.biz says:

    you’re really a golod webmaster. Thee wweb sie loadingg speedd iis incredible.
    It kind of feelos tuat yyou are doin anny unique
    trick. Also, Thee contentss arre masterpiece.
    you hav performedd a wonderful joob on thijs topic!

    My web-site … porn2021.biz

  2. Rosetta says:

    Heyy there! Do you know iff they make anyy lugins tto hekp
    with Searcdh Engune Optimization? I’m tryig too geet mmy bloog tto rannk foor some targetged keyhwords buut I’mnot seeding very goo gains.
    If yyou know oof anny plesase share. Appreciste it!

    Feell freee too suf to myy site: xmxxtube.com (Rosetta)

Leave a Reply

Your email address will not be published. Required fields are marked *