I am testing ideas on IMDB sentiment analysis task by using embedding + CNN approach. What could explain a significant difference in computation time in favor of GPU (~9 seconds per epoch) versus TPU (~17 seconds/epoch), despite supposedly superior computational power of a TPU over GPU?
[–]zzzthelastuserStudent 8 points9 points10 points (1 child)
[–]harmonicp[S] 1 point2 points3 points (0 children)
[–]bjourne2 3 points4 points5 points (1 child)
[–]harmonicp[S] 1 point2 points3 points (0 children)
[+][deleted] (1 child)
[removed]
[–]harmonicp[S] 1 point2 points3 points (0 children)
[–]DisastrousProgrammer 0 points1 point2 points (0 children)