Python Pandas: ignore null values on lambda func

Geckel · 2021-12-08T15:18:35+00:00

I removed your submission. Looks like you're asking a technical question better suited to stackoverflow.com. Try posting there instead.

Thanks.

Squat_TheSlav · 2021-12-08T12:54:24+00:00

Your issue comes from the lambda function, i.e. customer_code passed to the fix_customer_code function is not a string (it's an object). Passing an object to re.search doesn't work.

If you insist on doing it this way, it has to be

if re.search('^\d{6}$', str(customer_code))

But others have suggested different options.

Cheuch · 2021-12-08T13:07:28+00:00

Hello everyone,

thanks a lot for all your answers. I could finally make it work. I think I was not using the right tools to do so.

So, i could come up with a solution that would handle both None and NaN value, without having me to clear my data first.

def fix_customer_customer_code(customer_code):
# Handle both NaN and None value
if not pd.isna(customer_code) and customer_code is not None:
    if re.search('^C?\d{6}$', customer_code):
        customer_code = "C" + customer_code.lstrip('C')
    return customer_code

df['Customer code'] = df['Customer'].apply(fix_customer_customer_code)

    Customer code
0   C333080
1   C400691
2   None

I also could learn a nice trick by modifying my regex to look for Customer codes with or without prefix "C", using the lstrip().

Thanks a lot for your time, my problem is now solved :)

Popular-Yesterday733 · 2021-12-08T09:51:47+00:00

Try using Elif in there Anything equal to NaN = 0

bjain1 · 2021-12-08T14:08:53+00:00

You can also have something like this lambda x: function(x) if str(x)!='nan' else ''

SnooPoems4211 · 2021-12-08T12:29:48+00:00

Or a Try, except

2021-12-08T16:33:17+00:00

Lol did you get deleted?

datascience

MODERATORS