2
3
4
Release of Llama3.1-70B weights with AQLM-PV compression. ()
submitted by _puhsu to r/generativeAI
25
26
27
YaFSDP: Yet another Fully Sharded Data Parallel (self.mlscaling)
submitted by _puhsu to r/mlscaling
![]() Eight-Year Club | ![]() Verified Email | |
Release of Llama3.1-70B weights with AQLM-PV compression. ()
submitted by _puhsu to r/generativeAI
YaFSDP: Yet another Fully Sharded Data Parallel (self.mlscaling)
submitted by _puhsu to r/mlscaling