Skip to main content

GLM Coding Plan Details

Q: GLM Coding Plan: Plan to Add Support for GLM-5 ? How can I switch between them? A: The Max and Pro plan currently support GLM-5. Subsequently, the Lite plan will also support GLM-5 once the iteration of new and old model resources is completed. At this stage, all plans support GLM-4.7 and legacy text models, and invoking GLM-5 will consume more plan quota than historical models.
Mapping between Claude Code internal model environment variables and GLM models, with the default configuration as follows:
  • ANTHROPIC_DEFAULT_OPUS_MODEL: GLM-4.7
  • ANTHROPIC_DEFAULT_SONNET_MODEL: GLM-4.7
  • ANTHROPIC_DEFAULT_HAIKU_MODEL: GLM-4.5-Air
Note: For users on plans that already support GLM-5, to use this model, you need to manually change the model to “GLM-5” in your custom configuration (e.g., ~/.claude/settings.json in Claude Code).

Q: Why is the quota consumed faster when using GLM-5? Is there a way to save quota? A: Since the parameter scale of GLM-5 has expanded more than twice compared to the previous generation model, while the model’s capabilities have significantly improved, it also consumes more computing resources. Therefore, during the current stage of resource iteration between new and old models, invoking GLM-5 will consume package quotas at a rate 3 times higher (compared to other historical models). Subsequently, we will adjust it downward based on resource expansion. Usage suggestions for more efficient quota utilization:
  • GLM-4.7 is comparable to Claude’s Sonnet-level model, and GLM-4.7 is sufficient for daily development and routine tasks.
  • GLM-5 is comparable to Claude’s Opus-level model, and in high-difficulty scenarios such as complex reasoning and large-scale engineering tasks, it switches to GLM-5. By reasonably combining different model capability levels, it is possible to more efficiently control quota consumption while ensuring effectiveness.
Recommended Configuration (taking Claude Code as an example):
  • Edit or add the settings.json file
  • MacOS & Linux: ~/.claude/settings.json
  • Windows user directory: /.claude/settings.json
  • Add or modify the env field inside
  • Note to replace the your_zhipu_api_key inside with the API Key you obtained in the previous step.
{
    "env": {
        "ANTHROPIC_AUTH_TOKEN": "your_Z.ai_api_key",
        "ANTHROPIC_BASE_URL": "https://api.z.ai/api/anthropic",
        "API_TIMEOUT_MS": "3000000",
        "CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC": 1,
        "ANTHROPIC_DEFAULT_OPUS_MODEL": "glm-5",
        "ANTHROPIC_DEFAULT_SONNET_MODEL": "glm-4.7",
        "ANTHROPIC_DEFAULT_HAIKU_MODEL": "glm-4.5-air"
    }
}

Q: How much usage quota does the plan provide? A: Subscribe once, unlock unmatched usage and unbeatable value.
  • Lite Plan: Up to ~80 prompts every 5 hours — about 3× the usage quota of the Claude Pro plan.
  • Pro Plan: Up to ~400 prompts every 5 hours — about 5× the usage quota of the Lite plan.
  • Max Plan: Up to ~1600 prompts every 5 hours — about 4× the usage quota of the Pro plan.
In terms of token consumption, each prompt typically allows 15–20 model calls, giving a total monthly allowance of tens of billions of tokens — all at only ~1% of standard API pricing, making it extremely cost-effective.
  • The numbers above are estimates; actual usage may vary depending on task complexity, codebase size, and whether auto-accept is enabled.
  • For users who subscribed before February 12, your token quota will remain unchanged for the duration of your current subscription cycle and will continue to follow the limits shown at the time of purchase.
  • For users who enabled auto-renewal before February 12, both the renewal price and the usage quota will remain unchanged and will continue to follow the limits shown at the time of your original subscription.

Q: Apart from the 5-hour usage limit, are there any other restrictions on usage? A: To manage resources and ensure fair access for all users, we have introduced a weekly usage quota limit. This quota starts counting from the time you place your order, and the quota is refreshed and reset on a 7-day cycle. You can view the progress of your quota consumption in the subscription. Users who subscribe and enable auto-renewal before February 12 will enjoy unlimited weekly usage for the duration of their subscription.
Q: If I am a Lite user and want to experience and call GLM-5, what should I do? A: The Lite plan currently does not include GLM-5 quota (we will gradually expand the scope and strive to enable more users to experience and use GLM-5). If you call GLM-5 under the plan endpoints, an error will be returned. If you still wish to experience GLM-5 at this stage and are willing to pay according to the Pricing, you can call it through the General API endpoint (i.e., https://api.z.ai/api/paas/v4/chat/completions), with the deduction priority being [Platform Credits - Account Balance] in sequence.
Q: Which coding tools are supported? A: The pack currently supports Claude Code, Roo Code, Kilo Code, Cline, OpenCode, Crush, Goose, OpenClawand more. Please refer to our tool guide for step-by-step setup. All supported coding tools share the same usage quota under your subscription.
Q: Can I use my GLM Coding Plan quota in non-AI coding tools? A: No. The GLM Coding Plan quota is only intended to be used within coding/IDE tools designated or recognized by Z.ai (such as Claude Code, Kilo Code, etc.) for personal code-assistance scenarios in your local IDE or terminal. If you want to integrate GLM models into your own applications, websites, bots, SaaS products or other systems via API, please use Z.ai’s enterprise-grade API / MaaS services and follow the corresponding terms and pricing.
Q: Can the plan only be used within supported coding tools? A: Yes. API calls are billed separately and do not use the Coding Plan quota.
Q: What happens when my plan quota runs out? Will the system consume my account balance? A: No. GLM-4.7 calls made in supported coding tools will only use your Coding Plan quota.
  • Once the quota is used up, you’ll need to wait until the next 5-hour cycle for it to refresh. The system will not deduct from your account balance.
  • Users subscribed to the Coding Plan can only make calls via the plan’s quota in supported tools. API calls outside the plan are not available.

MCP Call

Q:Which plan supports Vision Understanding, Web Search and Web Reader MCP tools? A: All plans support these MCP tools, but the Lite plan includes only a limited quota for trial purposes.
Q:What is the quota of Vision Understanding, Web Search and Web Reader MCP in the plan? A: The MCP quotas for the Lite, Pro and Max plans are as follows:
  • Lite Plan: Include a total of 100 web searches and web readers per month, along with the 5-hour maximum prompt resource pool of the package for vision understanding.
  • Pro Plan: Include a total of 1,000 web searches and web readers per month, along with the 5-hour maximum prompt resource pool of the package for vision understanding.
  • Max Plan: Include a total of 4,000 web searches and web readers per month, along with the 5-hour maximum prompt resource pool of the package for vision understanding.

Q:Besides using the GLM Coding Plan resource package, can I use other methods to call these three MCP tools for Vision Understanding, Web Search and Web Reader? A: Other than the resource package, we currently do not provide any other access solutions for calling these three MCP tools. If you use other similar MCP tools and incur billing issues during their use, such issues are not within the scope of this package.

Subscription Management

Q: Will my subscription renew automatically? A: Yes. Your subscription will automatically renew at the end of each billing cycle, and the fee will be charged to your saved payment method.
Q: How are subscription fees deducted? A: The system deducts fees in the following order:
  1. Credits balance will be used first.
  2. If insufficient, cash balance will be used;.
  3. If still insufficient, the remaining amount will be charged from your linked payment method (e.g., bank card or PayPal).
Please note that a small minimum applies when charging your credit card. If the remaining amount is less, we will round up the deduction to meet this minimum.

Q: How can I cancel my subscription? A: You can cancel your subscription on the subscription management page. Please make sure to cancel at least 24 hours before your next billing date to avoid auto-renewal. After cancellation, your current plan remains valid until it expires.
Q: Can I get a refund? A: Please note that subscriptions are non-refundable once purchased. Even if you haven’t used your full plan, the fees cannot be returned. We recommend choosing a subscription plan and billing cycle that best fits your usage needs.

Upgrade your Plan

Q: How can I upgrade my coding plan? A: For same-tier plan upgrades (e.g., monthly plan to annual plan), the newly purchased plan will not take effect immediately. It will only become active after the expiration of your current plan, meaning the validity periods of the plans are cumulative. Example: Upgrading from a Lite Monthly Plan to a Lite Annual Plan will grant you a total of 13 months of service validity. For cross-tier plan upgrades, the new plan will take effect immediately. The remaining value of your original plan will be converted into account balance on a pro-rated basis, which can be used to offset the price difference of the upgrade. Example: Upgrading from a Lite Monthly Plan $6 to a Pro Monthly Plan $30 on the same day of purchase will require an additional payment of $24.
Q: How can I downgrade my plan? A: On the subsciption page, select your desired plan. The change will take effect after the current billing cycle ends. You can also cancel your current subscription and re-subscribe to the desired plan after the billing cycle ends. Steps are as follows:
  1. Go to your subscription page.
  2. Click “Subscribe” and select your desired plan.
  3. Confirm the change.
  4. The new plan will take effect after the current billing cycle ends.

Q: How can I change my billing cycle? A: On the subsciption page, select your desired plan. The change will take effect after the current billing cycle ends. You can also cancel your current subscription and re-subscribe to the desired plan after the billing cycle ends. Steps are as follows:
  1. Go to your subscription page.
  2. Click “Subscribe” and select your desired billing cycle.
  3. Confirm the change.
  4. The new cycle will take effect after the current billing period ends.

‘Invite Friends, Get Credits’

Q: Where can I view the credits I earned? A: You can view your credit rewards in the [Invite Friends, Earn Credits pop-up - Your Credit Rewards] or check the details under [Billing - Transaction History].
Q: Where can I use my credits? Can I withdraw them? A: Credits can be used on [Z.ai platform] to offset various purchases, including subscriptions like GLM Coding Plan, resource packs, feature extensions, and API calls.
Credits are promotional benefits and cannot be withdrawn, transferred, or refunded.

Q: How is a successful invitation defined? What does my friend need to do? A: A successful invitation must meet all of the following:
  1. The referred friend shall access the campaign page through the inviter’s unique referral link or code.
  2. Your friend is a new user (has never paid for a subscription).
  3. They complete their first GLM Coding Plan subscription payment within 72 hours.
  4. The order is not refunded within 24 hours.

Q: When will I receive the credits after inviting a friend? A: Credits are usually issued within 24-48 hours after your friend’s payment is confirmed and the order passes review. Check [Finance - Transaction History]. If not received after 72 hours, contact support.
Q: What does “friend’s actual payment amount” mean? How are credits calculated if a coupon is used? A: “Actual payment amount” refers to the final cash amount your friend paid for their first order. For example: If a plan costs 100 USD, and your friend gets a 50% first-order discount, an extra 10% off from your referral, and uses 5 USD in credits, the final actual payment is 40 USD. Your credits will be calculated based on 40 USD (e.g., 10% of 40 USD = 4 USD). Credit calculation does not include any coupons, discounts, point deductions, or refunded amounts.
Q: Why didn’t I receive credits after referring a friend? A: Common reasons include:
  1. Your friend didn’t register using your exclusive link/code.
  2. Your friend completed payment after 72 hours.
  3. Your friend was not a new subscribing user.
  4. The order was refunded.
  5. The system detected unusual activity (e.g., bulk registrations, fraud).
If none of these apply, please contact customer support with your account and your friend’s registration information.

Technical Support


Q:Why does it still report error “1113 Insufficient Balance” after purchasing the coding package? Why is the account balance still deducted after purchasing the coding package? A: The situation of reporting insufficient balance or deducting account balance may be due to not meeting the usage conditions of the GLM Coding Plan coding package:
  1. The coding plan currently supports Claude Code、Open Code、Cline、Factory Droid、Kilo Code、Roo Code、Crush、Goose、Cursor、Gemini CLI、Grok CLI、Cherry studio.
  2. A specific baseurl address must be configured to use it:
  • API endpoint for Claude Code and Goose is:https://api.z.ai/api/anthropic
  • API endpoint for other tools is: https://api.z.ai/api/coding/paas/v4
  1. Only the following four models can be called: GLM-4.7, GLM-4.6, GLM-4.5, and GLM-4.5-Air. For detailed instructions, please refer to: https://docs.z.ai/devpack/overview