Stay organized with collections
Save and categorize content based on your preferences.
Dataplex Universal Catalog is a unified, intelligent governance solution for data and AI
assets in Google Cloud. Through Dataplex Universal Catalog,
you can use AI to simplify data queries, quality assurance, and business
insights.
Dataplex Universal Catalog performs governance at scale. For example, consider a
global retail company that generates large amounts of sales, inventory, and
customer data that's stored in Cloud Storage, Spanner, and
Pub/Sub. With data distributed across systems, it can be complex and
time-consuming to manage governance, ensure quality, and maintain compliance.
Dataplex Universal Catalog simplifies this process by providing a central view to
discover, profile, validate, track the lineage of, and control access to
organizational data assets.
Why use Dataplex Universal Catalog?
Dataplex Universal Catalog governs data through the following features:
Metadata cataloging. Retrieve metadata
for Google Cloud resources (in BigQuery, Cloud SQL,
Spanner, Vertex AI, Pub/Sub,
Dataform, Dataproc Metastore), and third-party resources you
bring into Dataplex Universal Catalog, for a snapshot of your data assets.
Data discovery. Scan for structured
and unstructured data in Cloud Storage buckets to extract and catalog
their metadata.
Data insights. Use AI to generate natural
language questions about your data, to uncover patterns, assess data quality,
and perform statistical analyses.
Data profiling. Identify common
characteristics of the column data in your BigQuery tables, for
example, typical data values, data distribution, and null counts, which can
inform data classification and quality assurance.
Data quality. Define and
measure the quality of the data in your BigQuery tables, by
validating data against organizational policies and logging alerts if data
doesn't meet quality criteria.
Business glossary. Manage
business-related terminology and definitions across your organization, and
attach terms to table columns to promote a consistent understanding of data
usage.
Data lineage. Track how data moves
through your systems: where it comes from, where it is passed to, and what
transformations are applied to it.
Dataplex Universal Catalog supports an end-to-end data lifecycle, from distributed
discovery to business insights. Governance features are also available through
BigQuery.
Use cases
You can use Dataplex Universal Catalog to do the following:
Discover and understand your data. Dataplex Universal Catalog
provides visibility over your data resources across the organization. It lets
you find relevant resources for data consumption needs. It provides context
for data resources, which helps you understand the suitability of data
resources for your data consumer's needs.
Enable data governance and data management. Dataplex Universal Catalog
supplies metadata that can inform and power your data governance and data
management capabilities.
Maintain an extensible and comprehensive repository for your metadata.
Dataplex Universal Catalog stores and provides access to metadata that
is automatically harvested from your Google Cloud resources. You can
integrate your own metadata from non-Google Cloud systems. You can enrich all
metadata with additional business and technical metadata annotations.
Get started
If this is your first time working with Dataplex Universal Catalog, consider
following a quickstart:
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-25 UTC."],[[["\u003cp\u003eDataplex unifies distributed data across data lakes, warehouses, and marts without data movement, centralizing data management and governance.\u003c/p\u003e\n"],["\u003cp\u003eIt enables building a domain-specific data mesh across multiple Google Cloud projects while providing consistent data governance and monitoring.\u003c/p\u003e\n"],["\u003cp\u003eDataplex automates metadata discovery and curation across various data silos and offers secure querying using BigQuery and open-source tools.\u003c/p\u003e\n"],["\u003cp\u003eThe platform abstracts underlying data storage using constructs like lakes, zones, and assets to organize data based on business needs and data readiness.\u003c/p\u003e\n"],["\u003cp\u003eCommon use cases include creating a domain-centric data mesh with decentralized data ownership and tiering data based on its readiness for different users.\u003c/p\u003e\n"]]],[],null,["# Dataplex Universal Catalog overview\n\nDataplex Universal Catalog is a unified, intelligent governance solution for data and AI\nassets in Google Cloud. Through Dataplex Universal Catalog,\nyou can use AI to simplify data queries, quality assurance, and business\ninsights.\n\nDataplex Universal Catalog performs governance at scale. For example, consider a\nglobal retail company that generates large amounts of sales, inventory, and\ncustomer data that's stored in Cloud Storage, Spanner, and\nPub/Sub. With data distributed across systems, it can be complex and\ntime-consuming to manage governance, ensure quality, and maintain compliance.\nDataplex Universal Catalog simplifies this process by providing a central view to\ndiscover, profile, validate, track the lineage of, and control access to\norganizational data assets.\n\nWhy use Dataplex Universal Catalog?\n-----------------------------------\n\nDataplex Universal Catalog governs data through the following features:\n\n- **[Metadata cataloging](/dataplex/docs/catalog-overview)**. Retrieve metadata for Google Cloud resources (in BigQuery, Cloud SQL, Spanner, Vertex AI, Pub/Sub, Dataform, Dataproc Metastore), and third-party resources you bring into Dataplex Universal Catalog, for a snapshot of your data assets.\n- **[Data discovery](/bigquery/docs/automatic-discovery)**. Scan for structured and unstructured data in Cloud Storage buckets to extract and catalog their metadata.\n- **[Data insights](/dataplex/docs/data-insights)**. Use AI to generate natural language questions about your data, to uncover patterns, assess data quality, and perform statistical analyses.\n- **[Data profiling](/dataplex/docs/data-profiling-overview)**. Identify common characteristics of the column data in your BigQuery tables, for example, typical data values, data distribution, and null counts, which can inform data classification and quality assurance.\n- **[Data quality](/dataplex/docs/auto-data-quality-overview)**. Define and measure the quality of the data in your BigQuery tables, by validating data against organizational policies and logging alerts if data doesn't meet quality criteria.\n- **[Business glossary](/dataplex/docs/create-glossary)**. Manage business-related terminology and definitions across your organization, and attach terms to table columns to promote a consistent understanding of data usage.\n- **[Data lineage](/dataplex/docs/about-data-lineage)**. Track how data moves through your systems: where it comes from, where it is passed to, and what transformations are applied to it.\n\nDataplex Universal Catalog supports an end-to-end data lifecycle, from distributed\ndiscovery to business insights. Governance features are also available through\nBigQuery.\n\nUse cases\n---------\n\nYou can use Dataplex Universal Catalog to do the following:\n\n- **Discover and understand your data**. Dataplex Universal Catalog\n provides visibility over your data resources across the organization. It lets\n you find relevant resources for data consumption needs. It provides context\n for data resources, which helps you understand the suitability of data\n resources for your data consumer's needs.\n\n- **Enable data governance and data management**. Dataplex Universal Catalog\n supplies metadata that can inform and power your data governance and data\n management capabilities.\n\n- **Maintain an extensible and comprehensive repository for your metadata**.\n Dataplex Universal Catalog stores and provides access to metadata that\n is automatically harvested from your Google Cloud resources. You can\n integrate your own metadata from non-Google Cloud systems. You can enrich all\n metadata with additional business and technical metadata annotations.\n\nGet started\n-----------\n\nIf this is your first time working with Dataplex Universal Catalog, consider\nfollowing a quickstart:\n\n- [Track data lineage for a BigQuery table](/dataplex/docs/track-lineage-quickstart)\n\nWhat's next\n-----------\n\n- Learn about [metadata management in Dataplex Universal Catalog](/dataplex/docs/catalog-overview#catalog-model).\n- Learn how to [search for data assets](/dataplex/docs/search-assets).\n- Learn how to [manage entries and ingest custom sources](/dataplex/docs/ingest-custom-sources).\n- Learn how to [import metadata into Dataplex Universal Catalog](/dataplex/docs/managed-connectivity-overview).\n- Learn about [BigQuery governance](/bigquery/docs/data-governance)."]]