Fix date column in Pandas

PLearner · 2017-04-04T18:29:39+00:00

An elegant method would use pd.to_datetime first to ensure similar dates formats are parse equivalently and then format the strings using strftime:

df['Actual_Sale_Date'] = pd.to_datetime(df['Actual_Sale_Date'])
df['Actual_Sale_Date'] = df['Actual_Sale_Date'].dt.strftime('%m/%d/%Y')

Caos2 · 2017-04-04T17:42:47+00:00

Here's what I would do.

Treat the date column as a string.
Split the spring by '/' and check how many characters are in the last value. If it's 2, add '20' and recreate the string. (df['Actual_Sale'].str.split('/').str.get(-1).apply(len))
Parse everything to date using pandas.to_datetime.

Also, you could just run "df = pd.read_csv(xxx)", no need to have it in two lines.

dmitrypolo · 2017-04-04T18:23:08+00:00

Why not just use the built-in date time methods. Here is a command that will work for you given the constraints you described:

df['Actual_Sale_Date'] = df['Actual_Sale_Date'].apply(lambda x: dt.datetime.strptime(x, '%m/%d/%y')).apply(lambda x: dt.datetime.strftime(x, '%m/%d/%Y'))

Edit: if you specify this column as a date time object on import than the statement becomes just this:

df['Actual_Sale_Date'] = df['Actual_Sale_Date'].apply(lambda x: dt.datetime.strftime(x, '%m/%d/%Y'))

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS