Connect to John Snow Labs
John Snow Labs provides production-grade, scalable, and trainable versions of the latest research in natural language processing (NLP) through the following products:
- Spark NLP: state-of-the-art NLP for Python, Java, or Scala.
- Spark NLP for Healthcare: state-of-the-art clinical and biomedical NLP.
- Spark OCR: a scalable, private, and highly accurate OCR and de-identification library.
You can integrate your Azure Databricks clusters with John Snow Labs.
Note
John Snow Labs does not integrate with Databricks SQL warehouses (formerly Databricks SQL endpoints).
Connect to John Snow Labs manually
Follow these instructions to automatically install the John Snow Labs NLP and OCR libraries and notebooks on your cluster, and to activate your trial of John Snow Labs if you do not already have a John Snow Labs account.
Requirements
Before you integrate with John Snow Labs, you must have the following:
An Azure Databricks cluster in your Azure Databricks workspace.
An Azure Databricks personal access token.
Note
As a security best practice, when you authenticate with automated tools, systems, scripts, and apps, Databricks recommends that you use personal access tokens belonging to service principals instead of workspace users. To create tokens for service principals, see Manage tokens for a service principal.
Procedure
To integrate with John Snow Labs, complete these steps:
Make sure you meet the requirements for John Snow Labs.
Go to the John Snow Labs NLP on Databricks webpage.
Click Install in my Databricks account.
In the Please tell us about yourself dialog, enter your first name, last name, and company email address.
For Databricks instance url, enter your Azure Databricks workspace URL, for example
https://adb-1234567890123456.7.databricks.azure.cn/?o=1234567890123456
.For Databricks access token, enter your token value from the requirements in this article.
Click Test connection.
After the connection succeeds, for Choose a cluster to install on, select the cluster from the requirements in this article.
Click Get Trial License.
Check your email inbox for a message from John Snow Labs that contains a request to validate your email address.
In the message, click Validate my email.
After a few minutes, check your email inbox again for another message from John Snow Labs that contains instructions about how to get started. Note that in some cases it could take up to a half hour for this message to arrive.
Follow the instructions in the message.
Note
To manually install the John Snow Labs libraries and notebooks on your cluster, see the following on the John Snow Labs website:
To upgrade your trial of John Snow Labs, sign in to your John Snow Labs account, at https://my.johnsnowlabs.com/login.
Continue with Next steps.
Next steps
Explore one or more of the following resources on the John Snow Labs website: