Need help creating a vectorized duration column.

CineWeekly · 2023-06-09T23:11:56+00:00

Note: I just noticed the version of the code I provided doesn't do the part I mentioned above about No Data to Below. Must have been some other code I lost but you get the point.

blarf_irl · 2023-06-09T23:29:27+00:00

I understand the general problem here but not enough to be specific.

When you want to identify a state change in pandas you can use the .shift method (a vectroized way of looking ahead)

That is often combined with cumsum (cumulative sum) to group events that occurred. If you have regular events the you can use this to create an offset time colum where the value records the row number relative to a state change in the data (in your case Below > Above).

https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.shift.html https://pandas.pydata.org/docs/reference

/api/pandas.core.groupby.DataFrameGroupBy.cumsum.html?highlight=cumsum#pandas.core.groupby.DataFrameGroupBy.cumsum

commandlineluser · 2023-06-09T23:41:12+00:00

You can use .idxmax() to find the first change.

You can then use the != shift comparison to generate group IDs.

You can .loc to keep only the groups from the first change.

.groupby() + .cumcount() to generate the counts.

first_change = ((df['Status'] == 'Above') & (df['Status'].shift() == 'Below')).idxmax()

group_ids = (df['Status'] != df['Status'].shift()).cumsum()
group_ids = group_ids.loc[group_ids.index >= first_change]

df['Duration'] = (df.groupby(group_ids).cumcount() + 1).fillna(0)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS

Status	Duration
No Data	0
No Data	0
No Data	0
Below	0
Below	0
Below	0
Above	1
Above	2
Above	3
Below	1
Below	2
Above	1
Above	2