DataHub MCP Server
The DataHub MCP Server implements the Model Context Protocol (MCP), which standardizes how applications provide context to LLMs and AI agents. This enables AI agents to query DataHub metadata and use it to find relevant assets, traverse lineage, and more.
Want to learn more about the motivation, architecture, and advanced use cases? Check out our deep dive blog post.
There's two ways to use the MCP server, which vary in setup required but offer the same capabilities.
- Managed MCP Server - Available on DataHub Cloud v0.3.12+
- Self-Hosted MCP Server - Available for DataHub Core
Capabilities
Search for Data
Find the right data to use for your projects & analysis by asking questions in plain English - skip the tribal knowledge.
Dive Deeper
Separate the signal from noise with rich context about your data, including usage, ownership, documentation, tags, and quality.
Lineage & Impact Analysis
Understand the impact of upcoming changes to tables, reports, and dashboards using DataHub’s end-to-end lineage graph.
Query Analysis & Authoring
Understand how your mission-critical data is typically queried, or build custom queries for your tables.
Works Where You Work
Seamlessly integrates with AI-native tools like Cursor, Windsurf, Claude Desktop, and OpenAI to supercharge your workflows.
Managed MCP Server Usage
For folks on DataHub Cloud v0.3.12+, you can use our hosted MCP server endpoint.
The managed MCP server endpoint is only available with DataHub Cloud v0.3.12+. For DataHub Core and older versions of DataHub Cloud, you'll need to self-host the MCP server.
Prerequisites
To connect to the MCP server, you'll need the following:
- The URL of your DataHub Cloud instance e.g.
https://<tenant>.acryl.io
- A personal access token
Your hosted MCP server URL is https://<tenant>.acryl.io/integrations/ai/mcp/?token=<token>
.
Configure
Claude Desktop
- Open your
claude_desktop_config.json
file. You can find it by navigating to Claude Desktop -> Settings -> Developer -> Edit Config. - Update the file to include the following content. Be sure to replace
<tenant>
and<token>
with your own values.
{
"mcpServers": {
"datahub-cloud": {
"command": "npx",
"args": [
"-y",
"mcp-remote",
"https://<tenant>.acryl.io/integrations/ai/mcp/?token=<token>"
]
}
}
}
Cursor
- Navigate to Cursor -> Settings -> Cursor Settings -> MCP -> add a new MCP server
- Enter the following into the file:
{
"mcpServers": {
"datahub-cloud": {
"url": "https://<tenant>.acryl.io/integrations/ai/mcp/?token=<token>"
}
}
}
- Once you've saved the file, confirm that the MCP settings page shows a green dot and a couple tools associated with the server.
Other
Most AI tools support remote MCP servers. For those, you'll typically need to:
- Provide the hosted MCP server URL:
https://<tenant>.acryl.io/integrations/ai/mcp/?token=<token>
- Ensure that the authentication mode is not set to "OAuth" (if applicable)
For AI tools that don't yet support remote MCP servers, you can use the mcp-remote
tool to connect to the MCP server.
- Command:
npx
- Args:
-y mcp-remote https://<tenant>.acryl.io/integrations/ai/mcp/?token=<token>
Self-Hosted MCP Server Usage
You can run the open-source MCP server locally.
Prerequisites
Install
uv
# On macOS and Linux.
curl -LsSf https://astral.sh/uv/install.sh | shFor authentication, you'll need the following:
- The URL of your DataHub instance's GMS endpoint; e.g.
http://<localhost>:8080
orhttps://<tenant>.acryl.io
- A personal access token
- The URL of your DataHub instance's GMS endpoint; e.g.
Configure
Claude Desktop
Run
which uvx
to find the full path to theuvx
command.Open your
claude_desktop_config.json
file. You can find it by navigating to Claude Desktop -> Settings -> Developer -> Edit Config.Update the file to include the following content. Be sure to replace
<tenant>
and<token>
with your own values.
{
"mcpServers": {
"datahub": {
"command": "<full-path-to-uvx>", // e.g. /Users/hsheth/.local/bin/uvx
"args": ["mcp-server-datahub"],
"env": {
"DATAHUB_GMS_URL": "<your-datahub-url>",
"DATAHUB_GMS_TOKEN": "<your-datahub-token>"
}
}
}
}
Cursor
- Navigate to Cursor -> Settings -> Cursor Settings -> MCP -> add a new MCP server
- Enter the following into the file:
{
"mcpServers": {
"datahub": {
"command": "uvx",
"args": ["mcp-server-datahub"],
"env": {
"DATAHUB_GMS_URL": "<your-datahub-url>",
"DATAHUB_GMS_TOKEN": "<your-datahub-token>"
}
}
}
}
- Once you've saved the file, confirm that the MCP settings page shows a green dot and a couple tools associated with the server.
Other
For other AI tools, you'll typically need to provide the following configuration:
- Command:
uvx
- Args:
mcp-server-datahub
- Env:
DATAHUB_GMS_URL
:<your-datahub-url>
DATAHUB_GMS_TOKEN
:<your-datahub-token>