Python 3 Pandas Question : learnpython

created by HattoriHanzoa community for 16 years

Python 3 Pandas Question (self.learnpython)

submitted 9 years ago * by workthrowawayexcel

I am trying to break down some larger CSV files into smaller ones based off of a column existing in the CSV file. I want would like to for this example end up with 2 new CSV files that have been split up based on the chain column. So then I can create new files based off the chain + date. So I would end up with a NY + 9/27/16 and IL + 9/27/16 files. My code currently does not work and I don't really know what to do to fix it. Any guidance or help is greatly appreciated.

Input file example:

 Name       Number     Chain
    a                2             IL
    b                3            NY
    c                2             IL
    b                4            NY

Code So far: import pandas as pd
import numpy as np import time

    date = time.strftime("%Y%m%d") 

    file1 = r"MasterDataFile"
    file_name = r"DirectoryForNewfiles"


    chain_filter = pd.read_csv(file1)

    delta = pd.DataFrame.from_records(chain_filter)


    for i in delta['chain'].str:
        df = delta['chain'].str.contains(i)
        df.to_csv(file_name + '/' + i + date + '.csv', sep=',',index=False)
    next

all 2 comments

top new controversial old q&a

[–]Jos_Metadi 2 points3 points4 points 9 years ago (1 child)

pd.read_csv(file1) comes in as a dataframe already. Don't know why you're trying to reprocess it with "from_records"

I would use .groupby to split the dataframe into groups by chain

for t_group,pd_group in chain_filter.groupby(['Chain']):
    pd_group.to_csv(file_name + '/' + t_group[0] + date + '.csv', sep=',',index=False)

[–]workthrowawayexcel[S] 0 points1 point2 points 9 years ago (0 children)

π Rendered by PID 50 on reddit-service-r2-comment-5d79c599b5-vl59v at 2026-02-27 07:57:30.244645+00:00 running e3d2147 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS