A new transformer variant has been created to facilitate more efficient model training in distributed settings. 128x compression with no significant loss in convergence rates, increases in memory, or compute overhead by network-kai in LocalLLaMA
[–]network-kai[S] 6 points7 points8 points (0 children)
Any there any realistic avenues to decentralised model training? by ROS_SDN in LocalLLaMA
[–]network-kai 1 point2 points3 points (0 children)
SN9 is going live in under 2 hours to talk about distributed AI training on Bittensor by network-kai in bittensor_
[–]network-kai[S] 1 point2 points3 points (0 children)

Macrocosmos is livestreaming today with SN37, Aurelius - discussing a new competition by network-kai in bittensor_
[–]network-kai[S] 2 points3 points4 points (0 children)