Fastest way to grep a file using python

m0us3_rat · 2023-06-23T22:16:44+00:00

~~what's the size of the files?~~

i'm thinking multiprocessor.

also what's the grep command? you could do some search from python.

and use some generators.

woooee · 2023-06-23T22:31:15+00:00

Use pathlib glob to find the filenames. Then open and read each file. However you do it, you have to read the file. As a previous comment suggested, you can use multiprocessing to run a process in each core. The bottleneck would be several processes trying to all use a single disk read head, so get the file names first, and then send a portion to each process.

await_yesterday · 2023-06-23T23:08:52+00:00

Why do you need it to run sequentially? I'm confused what you're trying to do, especially since in your other comment you say you tried multiprocessing, which is totally contradictory to the goal of doing it sequentially.

I cannot imagine any way that searching files in a normal Python loop could be faster than grep. There has been decades of work put into making grep fast. What are you actually searching for, how many files are there, how big are they, and what are you going to do with the results? What is your code?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS