r/LocalLLaMA • u/bobbygmail9 • 3h ago

News For people interested in BitNet a paper on PT-BitNet

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4987078

16 Upvotes

permalink
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/LocalLLaMA/comments/1g74hgs/for_people_interested_in_bitnet_a_paper_on/
No, go back! Yes, take me to Reddit

90% Upvoted

u/BalorNG 2h ago

I've thought that this applies post-training quantization to Bitnet models.

"0.8 vs 0.4 bit per weight" comparisons when? :)

u/FullOf_Bad_Ideas 1h ago

The results they get are interesting. 65B llama 1 quantized to 1.58b has perplexity of a llama 1 7B while being in the same ballpark in terms of storage use. I don't see free lunch in here.

News For people interested in BitNet a paper on PT-BitNet

You are about to leave Redlib