Starting April 29, 2025, Gemini 1.5 Pro and Gemini 1.5 Flash models are not available in projects that have no prior usage of these models, including new projects. For details, see Model versions and lifecycle.

Qwen models

Qwen models on Vertex AI offer fully managed and serverless models as APIs. To use a Qwen model on Vertex AI, send a request directly to the Vertex AI API endpoint. Because Qwen models use a managed API, there's no need to provision or manage infrastructure.

You can stream your responses to reduce the end-user latency perception. A streamed response uses server-sent events (SSE) to incrementally stream the response.

Available Qwen models

The following models are available from Qwen to use in Vertex AI. To access a Qwen model, go to its Model Garden model card.

Qwen3 Coder (Qwen3 Coder)

Qwen3 Coder (Qwen3 Coder) is a large-scale, open-weight model developed for advanced software development tasks. The model's key feature is its large context window, allowing it to process and understand large codebases comprehensively.

Go to the Qwen3 Coder model card

Qwen3 235B (Qwen3 235B)

Qwen3 235B (Qwen3 235B) is a large 235B parameter model. The model is distinguished by its "hybrid thinking" capability, which allows users to dynamically switch between a methodical, step-by-step "thinking" mode for complex tasks like mathematical reasoning and coding, and a rapid "non-thinking" mode for general-purpose conversation. Its large context window makes it suitable for use cases requiring deep reasoning and long-form comprehension.

Go to the Qwen3 235B model card

Before you begin

To use Qwen models with Vertex AI, you must perform the following steps. The Vertex AI API (aiplatform.googleapis.com) must be enabled to use Vertex AI. If you already have an existing project with the Vertex AI API enabled, you can use that project instead of creating a new project.

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Enable the Vertex AI API.

Enable the API