databricks

Azure Databricks Deployment Documentation

Production-ready Azure Databricks workspace deployments with Unity Catalog, networking security, and enterprise features.


πŸš€ Quick Start

New to this repo? Start here:

  1. Quickstart Guide - Deploy your first workspace in 15 minutes
  2. Deployment Checklist - Prerequisites and preparation
  3. Choose Your Pattern - Select based on your requirements

πŸ“‹ Deployment Patterns

Choose the pattern that matches your security and connectivity requirements:

Best for: Most deployments, cost-effective, simpler setup

Use when: Standard security requirements, public control plane acceptable

Full Private

Best for: Highly regulated environments, complete isolation

Use when: Zero-trust network, compliance requirements, complete air-gapping needed


πŸ”§ Module Reference

Deep-dive into each infrastructure component:

  1. Networking - VNet, subnets, NSG, NAT Gateway
  2. Workspace - Databricks workspace configuration
  3. Unity Catalog - Metastore, storage credentials, external locations
  4. Network Connectivity (NCC) - Serverless compute networking
  5. Customer-Managed Keys (CMK) - Encryption with your own keys
  6. Service Endpoint Policy (SEP) - Storage egress control

πŸ“š How-To Guides

Step-by-step instructions for specific tasks:

Serverless Setup

Enable Databricks SQL Warehouses and Serverless Notebooks with:

Applies to: Both Non-PL and Full-Private patterns


πŸ› οΈ Additional Documentation

Core Documentation

Configuration


πŸ“– Documentation Structure

docs/
β”œβ”€β”€ 01-QUICKSTART.md                 # Start here
β”œβ”€β”€ 02-DEPLOYMENT-CHECKLIST.md       # Pre-deployment prep
β”œβ”€β”€ 03-TRAFFIC-FLOWS.md              # Network architecture
β”œβ”€β”€ 04-TROUBLESHOOTING.md            # Problem solving
β”‚
β”œβ”€β”€ patterns/                         # Deployment patterns
β”‚   β”œβ”€β”€ 01-NON-PL.md                 # Non-Private Link
β”‚   └── 02-FULL-PRIVATE.md           # Full Private
β”‚
β”œβ”€β”€ modules/                          # Component deep-dives
β”‚   β”œβ”€β”€ 01-NETWORKING.md
β”‚   β”œβ”€β”€ 02-WORKSPACE.md
β”‚   β”œβ”€β”€ 03-UNITY-CATALOG.md
β”‚   β”œβ”€β”€ 04-NCC.md
β”‚   β”œβ”€β”€ 05-CMK.md
β”‚   └── 06-SEP.md
β”‚
└── guides/                           # How-to guides
    └── 01-SERVERLESS-SETUP.md

🎯 Common Workflows

First-Time Deployment

# 1. Review prerequisites
Read: 02-DEPLOYMENT-CHECKLIST.md

# 2. Choose pattern
Non-PL: docs/patterns/01-NON-PL.md
Full-Private: docs/patterns/02-FULL-PRIVATE.md

# 3. Configure
Edit: deployments/<pattern>/terraform.tfvars

# 4. Deploy
cd deployments/<pattern>
terraform init
terraform apply

# 5. Verify
terraform output deployment_summary

Enable Serverless

# After workspace deployment
Read: guides/01-SERVERLESS-SETUP.md

# Configure storage firewall or Private Link
# Test with SQL Warehouse

Enable CMK Encryption

# Configure in terraform.tfvars
enable_cmk_managed_services = true
enable_cmk_managed_disks    = true
enable_cmk_dbfs_root        = true

# Apply changes
terraform apply

# Verify
terraform output customer_managed_keys

Troubleshoot Issues

# Check troubleshooting guide
Read: 04-TROUBLESHOOTING.md

# Common issues:
# - Metastore deletion errors
# - Network connectivity
# - SEP/NCC destroy issues

πŸ” Finding What You Need

β€œI want to…”

β€œI need to understand…”


πŸ† Best Practices

βœ… DO:

❌ DON’T:


πŸ’‘ Tips for Success

For Beginners

  1. Start with Quickstart
  2. Use Non-PL pattern
  3. Keep default settings initially
  4. Follow the deployment checklist
  5. Join the troubleshooting guide if stuck

For Advanced Users

  1. Review pattern comparison for requirements
  2. Customize networking via BYOV
  3. Enable all security features (CMK, SEP, Private Link)
  4. Plan for serverless from the start
  5. Implement hub-spoke for multiple workspaces

For Production

  1. Use service principal authentication
  2. Enable all three CMK scopes
  3. Test destroy workflow first
  4. Document custom configurations
  5. Set up monitoring and alerting
  6. Plan for disaster recovery

πŸ“ž Getting Help

Documentation Resources

External Resources

Support Channels

  1. Check Troubleshooting Guide first
  2. Review checkpoint documents for similar issues
  3. Check provider documentation for recent changes
  4. Contact your platform team or Databricks support

πŸš€ Ready to Deploy?

Start with the Quickstart Guide and have your first workspace running in 15 minutes!