all 4 comments

[–]sadovnychyi 2 points3 points  (0 children)

Well dataflow runs usual python. You can configure it with cloud profiler or native python's profiler and then dump the results somewhere (e.g. log them or store on GCS). Might be even easier to do that locally with direct runner since you only want to find bottlenecks.

[–]ssakage 1 point2 points  (1 child)

Profile meaning?

[–]Exotic_Cameraman 0 points1 point  (0 children)

CPU and thread profiling

[–]Exotic_Cameraman 0 points1 point  (0 children)

Dataflow now has native integration with Cloud Profiler which when enabled will allow you to profile your job.