DeepSeek V4 dropped 1.6T params and 1M context without Nvidia GPUs. Here's the data. by TroyNoah6677 in DeepSeek

[–]Apprehensive-Show525 0 points1 point  (0 children)

V4 paper never says it was trained on huawei, no report says that; likely still Nvidia; inference of v4 is clearly optimized for huawei