Get started with Azure Blob Storage and Python
This article shows you how to connect to Azure Blob Storage by using the Azure Blob Storage client library for Python. Once connected, use the developer guides to learn how your code can operate on containers, blobs, and features of the Blob Storage service.
If you're looking to start with a complete example, see Quickstart: Azure Blob Storage client library for Python.
API reference | Package (PyPi) | Library source code | Samples
Prerequisites
- Azure subscription - create one for trial
- Azure storage account - create a storage account
- Python 3.8+
Set up your project
This section walks you through preparing a project to work with the Azure Blob Storage client library for Python.
From your project directory, install packages for the Azure Blob Storage and Azure Identity client libraries using the pip install
command. The azure-identity package is needed for passwordless connections to Azure services.
pip install azure-storage-blob azure-identity
Then open your code file and add the necessary import statements. In this example, we add the following to our .py file:
from azure.identity import DefaultAzureCredential
from azure.storage.blob import BlobServiceClient, BlobClient, ContainerClient
Blob client library information:
- azure.storage.blob: Contains the primary classes (client objects) that you can use to operate on the service, containers, and blobs.
Asynchronous programming
The Azure Blob Storage client library for Python supports both synchronous and asynchronous APIs. The asynchronous APIs are based on Python's asyncio library.
Follow these steps to use the asynchronous APIs in your project:
Install an async transport, such as aiohttp. You can install
aiohttp
along withazure-storage-blob
by using an optional dependency install command. In this example, we use the followingpip install
command:pip install azure-storage-blob[aio]
Open your code file and add the necessary import statements. In this example, we add the following to our .py file:
import asyncio from azure.identity.aio import DefaultAzureCredential from azure.storage.blob.aio import BlobServiceClient, BlobClient, ContainerClient
The
import asyncio
statement is only required if you're using the library in your code. It's added here for clarity, as the examples in the developer guide articles use theasyncio
library.Create a client object using
async with
to begin working with data resources. Only the top level client needs to useasync with
, as other clients created from it share the same connection pool. In this example, we create aBlobServiceClient
object usingasync with
, and then create aContainerClient
object:async with BlobServiceClient(account_url, credential=credential) as blob_service_client: container_client = blob_service_client.get_container_client(container="sample-container")
To learn more, see the async examples in Authorize access and connect to Blob Storage.
Blob async client library information:
- azure.storage.blob.aio: Contains the primary classes that you can use to operate on the service, containers, and blobs asynchronously.
Authorize access and connect to Blob Storage
To connect an app to Blob Storage, create an instance of the BlobServiceClient class. This object is your starting point to interact with data resources at the storage account level. You can use it to operate on the storage account and its containers. You can also use the service client to create container clients or blob clients, depending on the resource you need to work with.
To learn more about creating and managing client objects, including best practices, see Create and manage client objects that interact with data resources.
You can authorize a BlobServiceClient
object by using a Microsoft Entra authorization token, an account access key, or a shared access signature (SAS). For optimal security, Azure recommends using Microsoft Entra ID with managed identities to authorize requests against blob data. For more information, see Authorize access to blobs using Microsoft Entra ID.
To authorize with Microsoft Entra ID, you need to use a security principal. Which type of security principal you need depends on where your app runs. Use the following table as a guide:
Where the app runs | Security principal | Guidance |
---|---|---|
Local machine (developing and testing) | Service principal | To learn how to register the app, set up a Microsoft Entra group, assign roles, and configure environment variables, see Authorize access using developer service principals |
Local machine (developing and testing) | User identity | To learn how to set up a Microsoft Entra group, assign roles, and sign in to Azure, see Authorize access using developer credentials |
Hosted in Azure | Managed identity | To learn how to enable managed identity and assign roles, see Authorize access from Azure-hosted apps using a managed identity |
Hosted outside of Azure (for example, on-premises apps) | Service principal | To learn how to register the app, assign roles, and configure environment variables, see Authorize access from on-premises apps using an application service principal |
Authorize access using DefaultAzureCredential
An easy and secure way to authorize access and connect to Blob Storage is to obtain an OAuth token by creating a DefaultAzureCredential instance. You can then use that credential to create a BlobServiceClient object.
The following example creates a BlobServiceClient
object using DefaultAzureCredential
:
def get_blob_service_client_token_credential(self):
# TODO: Replace <storage-account-name> with your actual storage account name
account_url = "https://<storage-account-name>.blob.core.chinacloudapi.cn"
credential = DefaultAzureCredential()
# Create the BlobServiceClient object
blob_service_client = BlobServiceClient(account_url, credential=credential)
return blob_service_client
If your project uses asynchronous APIs, instantiate BlobServiceClient
using async with
:
# TODO: Replace <storage-account-name> with your actual storage account name
account_url = "https://<storage-account-name>.blob.core.chinacloudapi.cn"
credential = DefaultAzureCredential()
async with BlobServiceClient(account_url, credential=credential) as blob_service_client:
# Work with data resources in the storage account
Build your app
As you build apps to work with data resources in Azure Blob Storage, your code primarily interacts with three resource types: storage accounts, containers, and blobs. To learn more about these resource types, how they relate to one another, and how apps interact with resources, see Understand how apps interact with Blob Storage data resources.
The following guides show you how to access data and perform specific actions using the Azure Storage client library for Python:
Guide | Description |
---|---|
Configure a retry policy | Implement retry policies for client operations. |
Copy blobs | Copy a blob from one location to another. |
Create a container | Create blob containers. |
Create a user delegation SAS | Create a user delegation SAS for a container or blob. |
Create and manage blob leases | Establish and manage a lock on a blob. |
Create and manage container leases | Establish and manage a lock on a container. |
Delete and restore blobs | Delete blobs and restore soft-deleted blobs. |
Delete and restore containers | Delete containers and restore soft-deleted containers. |
Download blobs | Download blobs by using strings, streams, and file paths. |
Find blobs using tags | Set and retrieve tags, and use tags to find blobs. |
List blobs | List blobs in different ways. |
List containers | List containers in an account and the various options available to customize a listing. |
Manage properties and metadata (blobs) | Get and set properties and metadata for blobs. |
Manage properties and metadata (containers) | Get and set properties and metadata for containers. |
Performance tuning for data transfers | Optimize performance for data transfer operations. |
Set or change a blob's access tier | Set or change the access tier for a block blob. |
Upload blobs | Learn how to upload blobs by using strings, streams, file paths, and other methods. |