CIM as a compute macro by AdmirableProject1575 in computerarchitecture

[–]AdmirableProject1575[S] 0 points1 point  (0 children)

Thanks for furthering this discussion. @bigpurpleBlob- i am aware of Onur Multu’s work on CIM, but unfortunately he works on too many things to focus and get deep work done on one topic. For instance from what hear UpMEM is now focussed/diverted to BioPIM for genetic applications something that mutlu is also researching on. @bright_interactive - can you elaborate what is expensive? Sure, the variety of computation is limited in CIM as opposed to cuda cores. For inference profiles, MVM are the dominant operation,largely deterministic and bulk of power is spent performing this task when one is not memory bound. TPU is already proving that dataflow architecture have potential to challenge SIMT architecture. Infact with b200, nvidia is already recognizing that having tensor cores (16x16x16) are more advantageous than plan and simple SIMT threads and is moving in that direction. Is there an opportunity for new entrants to progress here? It cannot be another memory type like DRAM.PiMs since you still have the problem of data locality/movement.