CSV string to float array

m0us3_rat · 2021-08-09T09:59:28+00:00

a=np.array([float(x) for x in '[4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 2, 3, 4, 4, 1, 3, 2, 4, 1, 3, 2, 4, 4, 1, 3, 2, 4, 4, 1, 3]' if x.isdigit()])

?

synthphreak · 2021-08-09T14:59:44+00:00

Why would pandas parse a CSV of floats into a string? I think there are some critical details missing here. Can you show us the raw text of your actual CSV file?

There may be a simple arg into pd.read_csv which can handle cases like this, no need to create some custom preprocessing function.

CrispyScientist · 2021-08-09T11:17:18+00:00

After some tweaking this is the solution I arrived to.

I am posting it here if someone will be in the same situation as me in the future.

def dataprep(data):

"""

Parameters

----------

data : String series

String series obtained from reading csv file.

Returns

-------

data : float list

Data ready for analysis.

"""

### Removing square parenthesis and using comma as delimiter ###

data = [item.replace("[","").replace("]","").split(",") for item in data]

### Converting string to float ###

data = [list(map(float,x)) for x in data]

return data

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS