fix(gguf): Ensure Gemma2 configs have hidden_act for backward compatibility by kitaekatt · Pull Request #30411 · vllm-project/vllm

kitaekatt · 2025-12-10T17:45:58Z

Summary

Fixes AttributeError: 'Gemma2Config' object has no attribute 'hidden_act' when loading Gemma2 GGUF models.

Changes

In ModelConfig.__init__, if model_type is "gemma2" and config has hidden_activation but not hidden_act, copy the value

Root Cause

Transformers' Gemma2Config only defines hidden_activation, not hidden_act
vLLM's gemma2.py directly accesses config.hidden_act without fallback

Testing

Tested with bartowski/gemma-2-2b-it-GGUF - model resolves architecture correctly.

gemini-code-assist

Code Review

This pull request introduces several fixes and improvements related to GGUF model loading. The primary fix ensures backward compatibility for Gemma2 models by setting the hidden_act attribute in the configuration, resolving a potential AttributeError. Additionally, the PR enhances dtype handling for quantized models by automatically selecting a compatible dtype when a conflict arises, which improves user experience by avoiding crashes. It also addresses hardware-specific precision issues on Blackwell GPUs for GGUF models by disabling bfloat16. Finally, it correctly handles tied word embeddings in GGUF models. The changes are well-implemented and improve the robustness of GGUF model support in vLLM.

mergify · 2025-12-15T20:48:52Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @kitaekatt.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

…bility GGUF-loaded configs may only have hidden_activation from config.json, but Gemma2MLP model code expects hidden_act attribute. This adds a post-processing step to copy hidden_activation to hidden_act when needed. Fixes AttributeError: 'Gemma2Config' object has no attribute 'hidden_act' when loading Gemma2 GGUF models. Signed-off-by: Christina <truffle@gmail.com>

yewentao256

LGTM, thanks for the work!

hmellor

Model specific edge cases in config classes should be avoided if possible
Why can't we just update gemma2.py to access hidden_activation?

mergify bot mentioned this pull request Dec 10, 2025

fix(gguf): Ensure Gemma2 configs have hidden_act for backward compatibility #30404

Closed

gemini-code-assist bot reviewed Dec 10, 2025

View reviewed changes

This was referenced Dec 15, 2025

fix(gemma2): Skip missing parameters during GGUF weight loading #30421

Closed

fix(gguf): GGUF model support fixes for Blackwell GPUs #30497

Closed

kitaekatt force-pushed the fix/30404-gemma2-hidden-act branch from 04ceef5 to 126db87 Compare December 15, 2025 20:48

mergify bot added the needs-rebase label Dec 15, 2025

kitaekatt force-pushed the fix/30404-gemma2-hidden-act branch from 126db87 to 8108f82 Compare December 29, 2025 20:42

mergify bot removed the needs-rebase label Dec 29, 2025

kitaekatt force-pushed the fix/30404-gemma2-hidden-act branch from 8108f82 to a9fb4d7 Compare January 19, 2026 17:27

kitaekatt force-pushed the fix/30404-gemma2-hidden-act branch from a9fb4d7 to eb974b4 Compare February 4, 2026 22:19

kitaekatt marked this pull request as ready for review February 4, 2026 22:19

kitaekatt requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners February 4, 2026 22:19

yewentao256 approved these changes Feb 5, 2026

View reviewed changes

yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 5, 2026

hmellor requested changes Feb 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

fix(gguf): Ensure Gemma2 configs have hidden_act for backward compatibility#30411

fix(gguf): Ensure Gemma2 configs have hidden_act for backward compatibility#30411
kitaekatt wants to merge 1 commit intovllm-project:mainfrom
kitaekatt:fix/30404-gemma2-hidden-act

kitaekatt commented Dec 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mergify bot commented Dec 15, 2025

Uh oh!

yewentao256 left a comment

Uh oh!

hmellor left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Comments

Conversation

kitaekatt commented Dec 10, 2025

Summary

Changes

Root Cause

Testing

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mergify bot commented Dec 15, 2025

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

hmellor left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants