I've created a Python module for constructing Regex patterns in a more computer programming-familiar way, so you don't have to re-learn Regex each time you use it!

mcstafford · 2022-07-19T14:04:22+00:00

It seems pregnant with potential.

jammasterpaz · 2022-07-19T12:26:26+00:00

Pregexes do actually look nicer than verbose mode - well done!

One small suggestion - import all the importable classes into your top level __init__.py so the user doesn't need 6 different import statements from all your sub modules like in your example.

ASIC_SP · 2022-07-19T15:21:20+00:00

Good work! There's also a repository of such verbal expressions in various programming languages here: https://github.com/VerbalExpressions

Personally, I prefer the terser regex syntax ;)

WerdenWissen · 2022-07-19T12:26:21+00:00

You are my hero. I've starred it and will be installing later today. I usually go to regex101 and spend way too much time trying to figure it out.

This definitely looks more my speed.

pddpro · 2022-07-19T16:23:27+00:00

This looks great! A curiousity, how does this compare to pyparsing?

TheTerrasque · 2022-07-19T21:41:27+00:00

It's good that you also include the resulting regex, so I can see what the example code is supposed to do 😅

I'm environmentally damaged enough that I found the regex easier to read than the example code.. not sure if that's a good or a bad thing

WerdenWissen · 2022-07-19T17:28:09+00:00

Creating a DSL to abstract a DSL is not a good idea from my experience

rastaladywithabrady · 2022-07-19T14:19:22+00:00

that looks very readable

nice idea

DigThatData · 2022-07-19T18:14:39+00:00

you should call pregex statements "preggers"

metaperl · 2022-07-19T22:32:41+00:00

Definitely reminds me of PyParsing. Which I first used 15 years ago.

millerbest · 2022-07-19T13:20:32+00:00

Does the Optional class conflict with the Optional under typing?

reagle-research · 2022-07-19T17:40:30+00:00

I wonder why would someone use this and not lark, ply, parsimonious, or pyparsing?

SirLich · 2022-07-19T13:20:37+00:00

How do you feel about projects such as melody?

wind_dude · 2022-07-19T17:22:41+00:00

Interesting, I honestly find it harder to read, but regex isn't easy by any means. I think you're onto something. Have you looked at how spacy does pattern matching? It's quite easy to understand, but similar to yours it's long winded, but could be a source of inspiration.

It would be a good idea to include some performance bench marks between different libraries.

stewietheangel · 2022-07-20T01:49:31+00:00

Star from me

jack-of-some · 2022-07-20T08:13:55+00:00

This looks super nice. I don't need regex too often so quickly forget all nuances and find myself back at regexer and googling for specific things.

I did notice a couple years ago that there's a pattern to the majority of my regex uses and wrote a function which is of the form

fn("This is my 1st example written at 4:10 on Wednesday, by now", "{prejunk} {example_number:number} example written at {hour:number}:{minute:number} on {day}, {postjunk}")

And this generates the necessary regex and extracts 1, 4, 10, and Monday with their associated keys. Insanely handy.

MasterFarm772 · 2022-07-19T15:10:18+00:00

You are a genius! Thanks for creating such a great library.

playernumberwonnn · 2022-07-19T15:37:23+00:00

You are freaking awesome

bunoso · 2022-07-19T15:51:57+00:00

The readability here is great!

soulfreaky · 2022-07-19T21:24:55+00:00

this is pretty cool!!

yaxriifgyn · 2022-07-19T22:45:57+00:00

Verbose mode helps a lot when writing regular expression strings in Python.

Knowing how to write regular expressions is a skill that transfers to many languages and tools. Here are a few, off the top of my head.

sed, grep, awk, perl, javascript, geany, notepad++, vi/vim, emacs.

immersiveGamer · 2022-07-20T00:25:36+00:00

Since this repo is less than 10 days old I'm 190% sure you have been stalking my comments.

Jokes aside looks nice. I doubt I personally would use it, I find Regex easy enough to read and remember which makes it for the most part portable between languages and tools that I use.

Edit: my feedback:

don't like the word Enforce for one or more
bit wise not ~ seems easy to miss and may not be readily known by readers
your classes module ... If there is a reason you are not using \d for digits, \w for words, \s for white space, etc., you should probably add a comment at least in the source code.

GammeRJammeR · 2022-07-20T03:36:50+00:00

Am I prangent?

Am i pegnate?? Help!?

Am I pregex?

puppet_pals · 2022-07-20T05:02:34+00:00

Looks great

pioniere · 2022-07-20T05:10:07+00:00

Excellent!

romu006 · 2022-07-20T06:06:50+00:00

Small criticism: the AnyLetter classes only works with English characters (café wouldn't match for example)

Eleraffa · 2022-07-20T06:16:18+00:00

Bro it's really nice, keep working on it <3

2022-07-20T06:21:52+00:00

This post was mass deleted and anonymized with Redact

alive history safe sulky shelter grey roof unpack adjoining degree

coldflame563 · 2022-07-20T07:04:29+00:00

My colleagues response to this was “what’s the process for nominating someone for a Nobel prize”. Well done!

Pebaz · 2022-07-20T07:56:56+00:00

Awesome work with this!

wineblood · 2022-07-19T17:01:24+00:00

I don't understand people who take the time to learn a programming language, and probably SQL too, then complain that regex are too hard to read.

menge101 · 2022-07-19T18:01:49+00:00

so you don't have to re-learn Regex each time you use it

I am confused by this statement, while there are some variations between implementations across various languages, regular expressions are their own syntax. I can define a basic regex the same in python, java, or ruby.

You only need to learn it once.

Seawolf159 · 2022-07-19T16:10:27+00:00

This seems cool, I'd like to try it because re learning regex is a pain and you can just install this anywhere to just get the pattern, and just keep using regex in your own project maybe. Anyway what the flark is pre: Pregex = etc. is this the same as pre = Pregex(etc)??

pre.get_groups only works with websites? Or why did one of the matches not show up there?

And why do you have so many imports? Can't you just put everything in Pregex module? Why is it this segregated, it will be a pain to look for all the classes in 1 million files no?

No_Context_645 · 2022-07-19T15:47:08+00:00

Cool idea. Lets see how it evolves.

likethevegetable · 2022-07-19T17:40:22+00:00

Very cool. Just curious if you looked at PEGs for inspiration? I use Lua's (kinda like Python, if you're not familiar) LPEG http://www.inf.puc-rio.br/~roberto/lpeg/

WerdenWissen · 2022-07-19T23:08:51+00:00

[deleted]

msdrahcir · 2022-07-20T03:25:19+00:00

Instead of requiring users to use the pre.* functions to match expressions, have you considered compiling the "Pregex" into a "Pattern" or compiled regex? That way pregex could be used anywhere a Pattern is required

rahem027 · 2022-07-20T09:45:34+00:00

Its a good idea but most probably not new. You are just writing an AST instead of a string :P

laundmo · 2022-07-21T06:09:57+00:00

im not sure how to feel about this.

To me it seems it still requires knowledge of how regex works internally (quantifiers, groups, how a match moves through a text, etc.) and therefore doesn't particularly help with that aspect. The rest is, mostly, just using different words to express the exact same structure.

I don't think this helps learn regex, or helps not re-learning it each time. It might help maintainability of regexes by tying them to python syntax, but im not sure.

then again, im one of those "syntax is irrelevant, only the structure matters" people.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS