Select persistent boot disk types for cluster nodes
You can select the persistent boot disk type when you create a cluster using the
Google Cloud console, Google Cloud CLI, or Dataproc API.
Console
You can create a cluster and select a standard, SSD, balanced Persistent Disk,
or Hyperdisk Balanced boot disk for manager (master), primary worker, and
secondary worker
cluster nodes from the Configure nodes panel on the Dataproc
Create a cluster page
of the Google Cloud console.
gcloud CLI
You can create a cluster and select a standard, SSD , balanced persistent boot disk,
or Hyperdisk balanced for manager (master), primary worker, and
secondary worker cluster nodes using the
gcloud dataproc clusters create
command with the --master-boot-disk-type,
--worker-boot-disk-type, and
--secondary-worker-boot-disk-type flags.
The default persistent boot disk type for Dataproc cluster manager (master) and
primary worker nodes is pd-standard. If the VM
machine type
supports only Hyperdisk Balanced as the boot disk,
the default boot disk is hyperdisk-balanced.
The default persistent boot disk type for cluster
secondary worker nodes is the primary worker node persistent boot disk type.
You can pass a value of pd-standard, pd-ssd,
pd-balanced, or hyperdisk-balanced to the
--master-boot-disk-type, --worker-boot-disk-type, and
--secondary-worker-boot-disk-type flags. Any of the valid
disk type values can be set on any cluster node type.
The default boot disk type for Dataproc cluster manager (master) and primary worker
nodes is pd-standard. If the VM machine type
supports only Hyperdisk Balanced as the boot disk,
the default boot disk is hyperdisk-balanced. The default boot disk type for
secondary worker nodes is the
primary work node boot disk type.
You can set a value of pd-standard, pd-ssd,
pd-balanced, or hyperdisk-balanced in the
InstanceGroupConfig.DiskConfig.bootDiskType
field in the masterConfig, workerConfig, and
secondaryWorkerConfig as part of a
cluster.create
API request. Any of the valid boot disk type type values can be set on any cluster node type.
Hyperdisk settings
When you create a cluster with a Hyperdisk Balanced volume
as the boot disk for a Dataproc cluster node, you can set the
provisioned IOPS and throughput.
Console
Hyperdisk Balanced is selected as the default
primary boot disk type for manager (master) and primary worker cluster nodes from the
Configure nodes panel on the Dataproc
Create a cluster page
of the Google Cloud console. You can set IOPS and throughput, or accept the default
values.
gcloud CLI
You can set provisioned IOPS and provisioned throughput for cluster nodes with the
hyperdisk-balanced boot disks using the
gcloud dataproc clusters create
command --master-boot-disk-provisioned-iops,
--worker-boot-disk-provisioned-iops,
--master-boot-disk-provisioned-throughput, and
--worker-boot-disk-provisioned-throughput flags.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-25 UTC."],[[["\u003cp\u003eYou can select between standard, SSD, balanced persistent disks, or hyperdisk balanced as boot disks for your Dataproc cluster nodes.\u003c/p\u003e\n"],["\u003cp\u003eThe Google Cloud console, gcloud CLI, and Dataproc API offer ways to choose the persistent boot disk type for master, primary worker, and secondary worker nodes during cluster creation.\u003c/p\u003e\n"],["\u003cp\u003eThe default boot disk type is \u003ccode\u003epd-standard\u003c/code\u003e, unless the VM machine type only supports hyperdisk, in which case \u003ccode\u003ehyperdisk-balanced\u003c/code\u003e is the default, and secondary workers default to the primary worker node's type.\u003c/p\u003e\n"],["\u003cp\u003eWhen using \u003ccode\u003ehyperdisk-balanced\u003c/code\u003e boot disks, you can use the gcloud CLI or the Dataproc API to set the provisioned IOPS and provisioned throughput settings for the master and worker nodes.\u003c/p\u003e\n"],["\u003cp\u003eYou are able to configure the size of the persistent boot disk as well when creating a new cluster.\u003c/p\u003e\n"]]],[],null,["# Dataproc boot disks\n\nYou select standard, SSD, balanced [Persistent Disk](/compute/docs/persistent-disks#disk-types),\nor [Google Cloud Hyperdisk Balanced](/compute/docs/disks/hd-types/hyperdisk-balanced) as boot disks for\nDataproc cluster nodes.\n\nSelect persistent boot disk types for cluster nodes\n---------------------------------------------------\n\nYou can select the persistent boot disk type when you create a cluster using the\nGoogle Cloud console, Google Cloud CLI, or Dataproc API. \n\n### Console\n\nYou can create a cluster and select a standard, SSD, balanced Persistent Disk,\nor [Hyperdisk Balanced](/compute/docs/disks/hd-types/hyperdisk-balanced) boot disk for manager (master), primary worker, and\n[secondary worker](/dataproc/docs/concepts/compute/secondary-vms)\ncluster nodes from the **Configure nodes** panel on the Dataproc\n[**Create a cluster**](https://console.cloud.google.com/dataproc/clustersAdd) page\nof the Google Cloud console.\n\n### gcloud CLI\n\nYou can create a cluster and select a standard, SSD , balanced persistent boot disk,\nor [Hyperdisk balanced](/compute/docs/disks/hd-types/hyperdisk-balanced) for manager (master), primary worker, and\n[secondary worker](/dataproc/docs/concepts/compute/secondary-vms) cluster nodes using the\n[gcloud dataproc clusters create](/sdk/gcloud/reference/dataproc/clusters/create)\ncommand with the `--master-boot-disk-type`,\n`--worker-boot-disk-type`, and\n`--secondary-worker-boot-disk-type` flags.\n\nThe default persistent boot disk type for Dataproc cluster manager (master) and\nprimary worker nodes is `pd-standard`. If the VM\n[machine type](/dataproc/docs/concepts/compute/supported-machine-types)\nsupports only [Hyperdisk Balanced](/compute/docs/disks/hd-types/hyperdisk-balanced) as the boot disk,\nthe default boot disk is `hyperdisk-balanced`.\nThe default persistent boot disk type for cluster\nsecondary worker nodes is the primary worker node persistent boot disk type.\n\nYou can pass a value of `pd-standard`, `pd-ssd`,\n`pd-balanced`, or `hyperdisk-balanced` to the\n`--master-boot-disk-type`, `--worker-boot-disk-type`, and\n`--secondary-worker-boot-disk-type` flags. Any of the valid\ndisk type values can be set on any cluster node type.\n**Example:** \n\n```\ngcloud dataproc clusters create CLUSTER_NAME \\\n --region=REGION \\\n --master-boot-disk-type=pd-ssd \\\n --worker-boot-disk-type=hyperdisk-balanced \\\n --secondary-worker-boot-disk-type=pd-standard \\\n other args ...\n```\np You can set the size of persistent boot disks using the [`--master-boot-disk-size`](/sdk/gcloud/reference/dataproc/clusters/create#--master-boot-disk-size), [`--worker-boot-disk-size`, and](/sdk/gcloud/reference/dataproc/clusters/create#--worker-boot-disk-size)[`--secondary-worker-boot-disk-size`](/sdk/gcloud/reference/dataproc/clusters/create#--secondary-worker-boot-disk-size) flags.\n\n### REST API\n\nThe default boot disk type for Dataproc cluster manager (master) and primary worker\nnodes is `pd-standard`. If the VM [machine type](/dataproc/docs/concepts/compute/supported-machine-types)\nsupports only [Hyperdisk Balanced](/compute/docs/disks/hd-types/hyperdisk-balanced) as the boot disk,\nthe default boot disk is `hyperdisk-balanced`. The default boot disk type for\n[secondary worker](/dataproc/docs/concepts/compute/secondary-vms) nodes is the\nprimary work node boot disk type.\n\nYou can set a value of `pd-standard`, `pd-ssd`,\n`pd-balanced`, or `hyperdisk-balanced` in the\n[`InstanceGroupConfig.DiskConfig.bootDiskType`](/dataproc/docs/reference/rest/v1/InstanceGroupConfig#DiskConfig)\nfield in the `masterConfig`, `workerConfig`, and\n`secondaryWorkerConfig` as part of a\n[`cluster.create`](/dataproc/docs/reference/rest/v1/projects.regions.clusters/create)\nAPI request. Any of the valid boot disk type type values can be set on any cluster node type.\n| Note: Set the per-node boot disk **size** using the [`InstanceGroupConfig.DiskConfig.bootDiskSizeGb`](/dataproc/docs/reference/rest/v1/InstanceGroupConfig#DiskConfig) field in the master, worker, or secondary worker config.\n\n### Hyperdisk settings\n\nWhen you create a cluster with a Hyperdisk Balanced volume\nas the boot disk for a Dataproc cluster node, you can set the\n[provisioned IOPS and throughput](/compute/docs/disks/hd-types/hyperdisk-balanced#about-provisioned-performance-for-hyperdisk-balanced). \n\n### Console\n\n[Hyperdisk Balanced](/compute/docs/disks/hd-types/hyperdisk-balanced) is selected as the default\nprimary boot disk type for manager (master) and primary worker cluster nodes from the\n**Configure nodes** panel on the Dataproc\n[**Create a cluster**](https://console.cloud.google.com/dataproc/clustersAdd) page\nof the Google Cloud console. You can set IOPS and throughput, or accept the default\nvalues.\n\n### gcloud CLI\n\nYou can set provisioned IOPS and provisioned throughput for cluster nodes with the\nhyperdisk-balanced boot disks using the\n[gcloud dataproc clusters create](/sdk/gcloud/reference/dataproc/clusters/create)\ncommand `--master-boot-disk-provisioned-iops`,\n`--worker-boot-disk-provisioned-iops`,\n`--master-boot-disk-provisioned-throughput`, and\n`--worker-boot-disk-provisioned-throughput` flags.\n**Example:** \n\n```\n gcloud dataproc clusters create CLUSTER_NAME \\\n --region=REGION \\\n --master-boot-disk-type=hyperdisk-balanced \\\n --master-boot-disk-provisioned-iops=MASTER_BOOT_DISK_IOPS \\\n --master-boot-disk-provisioned-throughput=MASTER_BOOT_DISK_THROUGHPUT \\\n --worker-boot-disk-type=hyperdisk-balanced \\\n --worker-boot-disk-provisioned-iops=WORKER_BOOT_DISK_IOPS \\\n --worker-boot-disk-provisioned-throughput=WORKER_BOOT_DISK_THROUGHPUT \\\n other args ...\n \n```\n| Secondary workers are configured with primary worker boot disk settings.\n\n### REST API\n\nYou can set provisioned IOPS and provisioned throughput for cluster nodes with\nHyperdisk boot disks using the [`InstanceGroupConfig.DiskConfig.bootDiskProvisionedIops`](/dataproc/docs/reference/rest/v1/InstanceGroupConfig#DiskConfig.FIELDS.boot_disk_provisioned_iops) and [`InstanceGroupConfig.DiskConfig.bootDiskProvisionedThroughput`](/dataproc/docs/reference/rest/v1/InstanceGroupConfig#DiskConfig.FIELDS.boot_disk_provisioned_throughput) fields for\nthe manager (master) and worker configs."]]