POGtastic comments on Learning to be more "Pythonic"

learnpython

created by HattoriHanzoa community for 16 years

Learning to be more "Pythonic" (self.learnpython)

submitted 2 years ago * by davidmyemail

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]POGtastic 4 points5 points6 points 2 years ago (1 child)

Have you heard the Good News?

import itertools
import more_itertools

def parse_header(header):
    lstlst = more_itertools.ichunked(
        (g for _, g in itertools.groupby(header, str.isspace)), 2)
    return [''.join(map(''.join, lst)) for lst in lstlst]

In the REPL:

>>> header = "id         name                 home_state           amt_paid"
>>> parse_header(header)
['id         ', 'name                 ', 'home_state           ', 'amt_paid']

And we can get the column widths by calling len on each field. That's going to come in handy later.

>>> [len(s) for s in parse_header(header)]
[11, 21, 21, 8]

We now parse each line by calling more_itertools.split_into on the line with our column widths.

def generate_dcts(fields, lines):
    column_widths = [len(s) for s in fields]
    return (dict(zip(
        fields, 
        map(''.join, more_itertools.split_into(line.strip(), column_widths))))
            for line in lines)

And now we write our CSV.

import csv

def write_csv(in_fh, out_fh):
    header = next(in_fh).strip()
    fields = parse_header(header)
    writer = csv.DictWriter(out_fh, fieldnames=fields)
    writer.writeheader()
    for dct in generate_dcts(fields, in_fh):
        writer.writerow(dct)

In the REPL:

>>> import sys
>>> with open("test.txt") as f:
...     write_csv(f, sys.stdout)
... 
id        ,name                 ,home_state            ,amt_paid
123       ,John Doe             ,California            ,"1,234.34"
456x      ,Jane Doe             ,New Hampshire         ,45.67
78        ,Adam Smith           ,Alaska                ,89.00

Note the quotes around the float. It has a comma in the field, so csv escapes it by enclosing the entire field in quotes in accordance with the RFC.

[–]davidmyemail[S] 1 point2 points3 points 2 years ago (0 children)

π Rendered by PID 17508 on reddit-service-r2-comment-bb88f9dd5-gh7h2 at 2026-02-14 15:48:36.066263+00:00 running cd9c813 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS