gpu branch #266

junhuihe-hjh · 2025-05-15T06:07:41Z

No description provided.

shumingma · 2025-05-15T12:35:57Z

gpu/README.md

+
+Within each block, values are stored contiguously in memory and permuted to facilitate efficient access and processing.  
+
+See `convert_convert_checkpoint.py` for details.


convert_checkpoint.py ？

shumingma · 2025-05-15T12:37:25Z

gpu/convert_checkpoint.py

@@ -0,0 +1,106 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.


may remove this as the conversion script is written by us.

shumingma · 2025-05-15T12:39:11Z

gpu/convert_checkpoint.py

+from einops import rearrange
+from safetensors.torch import save_file
+import model
+from pack_weight import convert_weight_int8_to_int2_adsbrain, weight_repack, convert_weight_int8_to_int2


may remove 'convert_weight_int8_to_int2_adsbrain' if it's not used.

shumingma · 2025-05-15T12:40:14Z

gpu/pack_weight.py

+
+    return ret
+
+def convert_weight_int8_to_int2_adsbrain(weight):


can be removed?

shumingma · 2025-05-15T12:41:28Z

gpu/test.py

+from torch.utils import benchmark
+from torch import nn
+
+from pack_weight import convert_weight_int8_to_int2, weight_repack, convert_weight_int8_to_int2_adsbrain


weight_repack and convert_weight_int8_to_int2_adsbrain can be removed?

shumingma reviewed May 15, 2025

View reviewed changes

gpu/pack_weight.py Outdated

return ret

def convert_weight_int8_to_int2_adsbrain(weight):

Copy link

Contributor

shumingma May 15, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can be removed?

shumingma reviewed May 15, 2025

View reviewed changes

Init gpu branch

154c92b

junhuihe-hjh force-pushed the gpu-dev branch from 4b29360 to 154c92b Compare May 19, 2025 04:34

junhuihe-hjh merged commit 6c2c08f into main May 19, 2025
1 check was pending

yashuatla mentioned this pull request Jun 23, 2025

Feature implementation from commits 0e7dadb..6197e9f yashuatla/BitNet#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gpu branch #266

gpu branch #266

Uh oh!

junhuihe-hjh commented May 15, 2025

Uh oh!

shumingma May 15, 2025

Uh oh!

shumingma May 15, 2025

Uh oh!

shumingma May 15, 2025

Uh oh!

shumingma May 15, 2025

Uh oh!

shumingma May 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		Within each block, values are stored contiguously in memory and permuted to facilitate efficient access and processing.

		See `convert_convert_checkpoint.py` for details.

		@@ -0,0 +1,106 @@
		# Copyright (c) Meta Platforms, Inc. and affiliates.

gpu branch #266

gpu branch #266

Uh oh!

Conversation

junhuihe-hjh commented May 15, 2025

Uh oh!

shumingma May 15, 2025

Choose a reason for hiding this comment

Uh oh!

shumingma May 15, 2025

Choose a reason for hiding this comment

Uh oh!

shumingma May 15, 2025

Choose a reason for hiding this comment

Uh oh!

shumingma May 15, 2025

Choose a reason for hiding this comment

Uh oh!

shumingma May 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants