Quickstart: Traverse vertices and edges with the Gremlin console and Azure Cosmos DB for Apache Gremlin
APPLIES TO: Gremlin
Azure Cosmos DB for Apache Gremlin is a fully managed graph database service implementing the popular Apache Tinkerpop
, a graph computing framework using the Gremlin query language. The API for Gremlin gives you a low-friction way to get started using Gremlin with a service that can grow and scale out as much as you need with minimal management.
In this quickstart, you use the Gremlin console to connect to a newly created Azure Cosmos DB for Gremlin account.
Prerequisites
- An Azure account with an active subscription.
- No Azure subscription? Sign up for a Azure account.
- Docker host
- Don't have Docker installed? Try this quickstart in GitHub Codespaces.
- Azure Command-Line Interface (CLI)
Create an API for Gremlin account and relevant resources
The API for Gremlin account should be created prior to using the Gremlin console. Additionally, it helps to also have the database and graph in place.
Create shell variables for accountName, resourceGroupName, and location.
# Variable for resource group name resourceGroupName="msdocs-cosmos-gremlin-quickstart" location="westus" # Variable for account name with a randomly generated suffix let suffix=$RANDOM*$RANDOM accountName="msdocs-gremlin-$suffix"
If you haven't already, sign in to the Azure CLI using
az login
.Use
az group create
to create a new resource group in your subscription.az group create \ --name $resourceGroupName \ --location $location
Use
az cosmosdb create
to create a new API for Gremlin account with default settings.az cosmosdb create \ --resource-group $resourceGroupName \ --name $accountName \ --capabilities "EnableGremlin" \ --locations regionName=$location \ --enable-free-tier true
Get the API for Gremlin endpoint NAME for the account using
az cosmosdb show
.az cosmosdb show \ --resource-group $resourceGroupName \ --name $accountName \ --query "name"
Find the KEY from the list of keys for the account with
az-cosmosdb-keys-list
.az cosmosdb keys list \ --resource-group $resourceGroupName \ --name $accountName \ --type "keys" \ --query "primaryMasterKey"
Record the NAME and KEY values. You use these credentials later.
Create a database named
cosmicworks
usingaz cosmosdb gremlin database create
.az cosmosdb gremlin database create \ --resource-group $resourceGroupName \ --account-name $accountName \ --name "cosmicworks"
Create a graph using
az cosmosdb gremlin graph create
. Name the graphproducts
, then set the throughput to400
, and finally set the partition key path to/category
.az cosmosdb gremlin graph create \ --resource-group $resourceGroupName \ --account-name $accountName \ --database-name "cosmicworks" \ --name "products" \ --partition-key-path "/category" \ --throughput 400
Start and configure the Gremlin console using Docker
For the gremlin console, this quickstart uses the tinkerpop/gremlin-console
container image from Docker Hub. This image ensures that you're using the appropriate version of the console (3.4
) for connection with the API for Gremlin. Once the console is running, connect from your local Docker host to the remote API for Gremlin account.
Pull the
3.4
version of thetinkerpop/gremlin-console
container image.docker pull tinkerpop/gremlin-console:3.4
Create an empty working folder. In the empty folder, create a remote-secure.yaml file. Add this YAML configuration to the file.
hosts: [<account-name>.gremlin.cosmos.azure.cn] port: 443 username: /dbs/cosmicworks/colls/products password: <account-key> connectionPool: { enableSsl: true, sslEnabledProtocols: [TLSv1.2] } serializer: { className: org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerV2d0, config: { serializeResultToString: true } }
Note
Replace the
<account-name>
and<account-key>
placeholders with the NAME and KEY values obtained earlier in this quickstart.Open a new terminal in the context of your working folder that includes the remote-secure.yaml file.
Run the Docker container image in interactive (
--interactive --tty
) mode. Ensure that you mount the current working folder to the/opt/gremlin-console/conf/
path within the container.docker run -it --mount type=bind,source=.,target=/opt/gremlin-console/conf/ tinkerpop/gremlin-console:3.4
Within the Gremlin console container, connect to the remote (API for Gremlin) account using the remote-secure.yaml configuration file.
:remote connect tinkerpop.server conf/remote-secure.yaml
Create and traverse vertices and edges
Now that the console is connected to the account, use the standard Gremlin syntax to create and traverse both vertices and edges.
Add a vertex for a product with the following properties:
Value label product
id 68719518371
name
Kiama classic surfboard
price
285.55
category
surfboards
:> g.addV('product').property('id', '68719518371').property('name', 'Kiama classic surfboard').property('price', 285.55).property('category', 'surfboards')
Important
Don't foget the
:>
prefix. THis prefix is required to run the command remotely.Add another product vertex with these properties:
Value label product
id 68719518403
name
Montau Turtle Surfboard
price
600
category
surfboards
:> g.addV('product').property('id', '68719518403').property('name', 'Montau Turtle Surfboard').property('price', 600).property('category', 'surfboards')
Create an edge named
replaces
to define a relationship between the two products.:> g.V(['surfboards', '68719518403']).addE('replaces').to(g.V(['surfboards', '68719518371']))
Count all vertices within the graph.
:> g.V().count()
Traverse the graph to find all vertices that replaces the
Kiama classic surfboard
.:> g.V().hasLabel('product').has('category', 'surfboards').has('name', 'Kiama classic surfboard').inE('replaces').outV()
Traverse the graph to find all vertices that
Montau Turtle Surfboard
replaces.:> g.V().hasLabel('product').has('category', 'surfboards').has('name', 'Montau Turtle Surfboard').outE('replaces').inV()
Clean up resources
When you no longer need the API for Gremlin account, delete the corresponding resource group.
Create a shell variable for resourceGroupName if it doesn't already exist.
# Variable for resource group name resourceGroupName="msdocs-cosmos-gremlin-quickstart"
Use
az group delete
to delete the resource group.az group delete \ --name $resourceGroupName
How did we solve the problem?
Azure Cosmos DB for Apache Gremlin solved our problem by offering Gremlin as a service. With this offering, you aren't required to stand up your own Gremlin server instances or manage your own infrastructure. Even more, you can scale your solution as your needs grow over time.
To connect to the API for Gremlin account, you used the tinkerpop/gremlin-console
container image to run the gremlin console in a manner that didn't require a local installation. Then, you used the configuration stored in the remote-secure.yaml file to connect from the running container the API for Gremlin account. From there, you ran multiple common Gremlin commands.