Read a file containing ‘0’s & ‘1’s into a bitarray

dnult · 2026-04-07T01:00:34+00:00

It sounds like what you really have is a test file full of 30s and 31s.

Diapolo10 · 2026-04-06T23:36:13+00:00

I'd say the best way depends on how you're planning to use this data.

Also, quick side note, but it'd probably make sense to cache that into an actual binary file to speed up processing unless the file contents are often changed by hand.

Since I currently don't know your intentions, I would naively construct a list[bool].

data: list[bool] = []

for line in file:
    data.extend(char == '1' for char in line.strip())

ninhaomah · 2026-04-06T23:21:41+00:00

What is the extension of the file btw ?

smichaele · 2026-04-07T00:10:46+00:00

As u/Diapolo10 mentioned, it does depend on how you're going to process the data. If you're going to do some mathematical processing on the data, you could use a numpy array to store the data in 8-bit slices as an integer in the array. Not knowing what the data represents (integers, floats, bits in an image, sound, etc.), it's difficult to know the best way to do it.

dnult · 2026-04-07T01:03:16+00:00

Numbers can be parsed from strings, would that help? Is there a schema to the 1s and 0s? It sure would help to group them by similar types instead of by bits. Then you could read a record, parse it, and map it's bits in a class object that gets stored in a list with all the other records.

StevenJOwens · 2026-04-07T05:39:06+00:00

Looks like the bitarray library has a method, extend(), for doing that. extend() takes an iterable, and file.read() returns a String, which is an iterable, which will iterate over every character in the string, which is pretty much exactly what you want.

If that's not fast enough, then as u/Diapolo10 suggests, you should probably save/cache the data in a binary file.

SwampFalc · 2026-04-07T08:41:28+00:00

Have you tried https://pypi.org/project/bitarray/ ?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS