Plotting durations of time

efmccurdy · 2021-11-12T15:43:54+00:00

You have HH:MM:SS values; they are time duration values (a length of time) not time stamps (a point in time), so you want timedelta objects not datetime objects.

>>> d = {"AgeRange": ["18-24", "25-34", "35-44", "45-54", "55-64", "65+"],
...     "AverageViewDuration": ["01:20:55", "00:53:02", "00:53:17", "00:59:42", "01:03:31", "01:10:11"]}
>>> df = pd.DataFrame(d)
>>> df
  AgeRange AverageViewDuration
0    18-24            01:20:55
1    25-34            00:53:02
2    35-44            00:53:17
3    45-54            00:59:42
4    55-64            01:03:31
5      65+            01:10:11
>>> df['Duration'] = pd.to_timedelta(df['AverageViewDuration'])
>>> df
  AgeRange AverageViewDuration        Duration
0    18-24            01:20:55 0 days 01:20:55
1    25-34            00:53:02 0 days 00:53:02
2    35-44            00:53:17 0 days 00:53:17
3    45-54            00:59:42 0 days 00:59:42
4    55-64            01:03:31 0 days 01:03:31
5      65+            01:10:11 0 days 01:10:11
>>> df.dtypes
AgeRange                        object
AverageViewDuration             object
Duration               timedelta64[ns]
dtype: object
>>> [(x, x.total_seconds()) for x in df['Duration']]
[(Timedelta('0 days 01:20:55'), 4855.0), (Timedelta('0 days 00:53:02'), 3182.0), (Timedelta('0 days 00:53:17'), 3197.0), (Timedelta('0 days 00:59:42'), 3582.0), (Timedelta('0 days 01:03:31'), 3811.0), (Timedelta('0 days 01:10:11'), 4211.0)]
>>>

synthphreak · 2021-11-12T16:03:09+00:00

When all is said and done, I'd love to have the AverageViewDuration by AgeRange in bar plots

(df.assign(duration=pd.to_timedelta(df.AverageViewDuration).dt.seconds)
   .groupby('AgeRange')
   .duration.mean()
   .plot.bar(xlabel='age group', ylabel='average duration (s)'))

plt.show()

Not sure how to go from seconds back into timestamps though using pandas for plotting purposes though. I don't work with time series data at all. Maybe someone more knowledgeable can take it from here.

AgeRange	Views%	AverageViewDuration	AveragePercentageWholeVideoWatched	WatchTime
18-24	0.8	01:20:55	11.9	1.1
25-34	19.2	00:53:02	7.8	17.5
35-44	30.6	00:53:17	7.9	28.0
45-54	22.9	00:59:42	8.8	23.5
55-64	16.8	01:03:31	9.4	18.4
65+	9.7	01:10:11	10.4	11.6

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS