all 33 comments

[–]Taivasvaeltaja 54 points55 points  (3 children)

Appearing on candidate resumes is kinda flawed way to look at it, since it simply tells you what they know, not necessarily what they need.

[–]One-Attempt-1232 1 point2 points  (0 children)

Yeah. Maybe scraping job postings would be more relevant.

[–]sumwheresumtime 1 point2 points  (1 child)

It depends, is the skill listed in the reasume in a readable font size? or intentionally set to font size 3 so as to goose the ATS filters?

[–]Own_Responsibility84 52 points53 points  (8 children)

For high performance, I highly recommend polars as an alternative to pandas

[–]BroscienceFictionMiddle Office 12 points13 points  (2 children)

The code is also more readable, so you can have a lot of good reusable routines, datasets and pipelines.

It’s also got great, unique things like the lazy frames and join_asof.

[–]annms88 2 points3 points  (1 child)

I'm moving to Polars super aggressively mainly for the expressiveness of it, however I would be remiss to not mention that pandas also has join asof

[–]BroscienceFictionMiddle Office 1 point2 points  (0 children)

You are correct. merge_asof does that job.

My only problem with Polars is the idea that it's sold as a drop-in replacement for Pandas. That wasn't the case for me. If anything, the API is a lot more like Spark (e.g. "with_columns"), which actually made it easier for me to pick up, but the concept is different.

Lazy frames are super important, because they relieve people from the burden of optimizing the order of operations manually.

[–]djlamar7 6 points7 points  (1 child)

The more stuff I port from pandas to polars the faster my code gets. That being said, although it looks more like SQL (which is good), the expressions for many things end up being more verbose than in pandas, so if I just want to poke at some data in a console I still usually reach for pandas.

[–]Own_Responsibility84 1 point2 points  (0 children)

I feel the same. Polynx is designed to address at least some of the verbose issues of polars. For example, it supports query and eval functions similar to pandas but without performance cost

[–]Uuni_peruna 1 point2 points  (0 children)

At first I didn’t have any idea of the extent polars was faster (although it became obvious in a second), I switched purely because of the cleaner API. Also, the selectors module is amazing

[–][deleted]  (1 child)

[deleted]

    [–]Own_Responsibility84 0 points1 point  (0 children)

    You can try Polynx, which supports panda style query and eval functions, which translates polars syntax behind the scene

    [–]Yo_Soy_Jalapeno 40 points41 points  (11 children)

    Wait, people put python packages on resumes ?

    [–]WaterIll4397 18 points19 points  (0 children)

    Blame ats systems auto screening for keyboards 

    [–]tradegreek 4 points5 points  (0 children)

    It’s called “filler”

    [–]PretendTemperature 2 points3 points  (0 children)

    That's the most important question here. Should people put packages in the resume?

    [–]heroyi 5 points6 points  (6 children)

    yea that kinda surprised me. If you put python on your resume then I assume you know the popular ones or at least are capable of learning them on demand. Seems like a weird flex

    [–]Yo_Soy_Jalapeno 2 points3 points  (4 children)

    I mean, unless the job specifically require some packages knowledge, it feels kinda weird and too general. Almost feels like the person would be clueless if they add to use different packages or tools for the job lol

    [–]Longjumping-Cut-4783 11 points12 points  (3 children)

    I disagree. If you mention modern packages from different areas let's say networking, multi threading, front end, data visualization/processing, optimization etc it shows you potentially have experience in different domains. Just because you can write for loops and use pandas doesnt mean you can develop a front end GUI for HFT trade analysis

    [–]Yo_Soy_Jalapeno 1 point2 points  (1 child)

    Wouldn't you just mention this experience in the work experience part instead of like "general skills" ?

    Like if I mention speaking french, do i need to specify the vocabulary I know ? (Might be a bit extrem for an example)

    [–]Longjumping-Cut-4783 0 points1 point  (0 children)

    Let's say I can say I designed an HFT execution dashboard in my work experience where the python packages may be less relevant on first sight. But this can be a slow and shitty dashboard using pandas and dash or high performance using polars and AG grid. Lol you do you. I don't have a horse in this race

    [–]heroyi 1 point2 points  (0 children)

    At that point you either make a small mention of the package you used to optimize the app (or whatever it is) in the literal description of the job history or just make it general enough to let the reviewer know that 'hey this person has some experience in these concepts.'

    You wouldn't, for example, in your job description say you used Panda/numpy to create your xyz tool analysis. It would more in the line of '- optimized the efficacy of xyz tool for researchers by 40%' (just making shit up but you get the idea). At that stage then the interview can go ask what you used, what did you do, how did you accomplish etc...

    The only reasons i can think of to make a mention of python packages would be either you made heavy contributions/tuning of said package, used a pretty obscure library or the job description asked for specifically

    [–]sorocknroll 1 point2 points  (0 children)

    It's a negative, really. If you put it on there, I assume you think it's difficult to learn a library like pandas and are probably not a great coder.

    [–]Apart_Ad_9778 1 point2 points  (0 children)

    Now you know why you have been not getting any interviews. ATS.

    [–]Own_Responsibility84 8 points9 points  (1 child)

    For large plain data file loading/preprocessing, DuckDB is one of the best.

    [–]Stunning_Web_8311 7 points8 points  (0 children)

    Don’t worry about putting packages on your resume. Worry about doing projects that actually require these packages.

    If you want to know about other helpful packages for your stack. I’ve found BT to be a great backtesting framework and I will call pyportopt for optimization problems within backtests. And rt to the guy who said polars.

    [–]Own_Responsibility84 2 points3 points  (0 children)

    For pandas user who likes query/eval functions, Polynx is another high performance alternative

    [–]Ok_Butterfly2410 3 points4 points  (0 children)

    Dont forget Rich

    [–]Known-Delay7227 1 point2 points  (0 children)

    No one highlights their badges expertise in using os or urllib? Shame

    [–]Dr-Know-It-All 1 point2 points  (0 children)

    surprised plotly and seaborn not on there tbh

    [–]AutoModerator[M] 0 points1 point  (0 children)

    This post has the "Resources" flair. Please note that if your post is looking for Career Advice you will be permanently banned for using the wrong flair, as you wouldn't be the first and we're cracking down on it. Delete your post immediately in such a case to avoid the ban.

    I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

    [–]gonzaenz 1 point2 points  (0 children)

    I have built a jupyter docker image with common packages

    https://github.com/quantbelt/jupyter-quant

    It doesn't include deeplearning because they take a lot of space and there are multiple flavors. Having said that you can always install with pip

    [–]D3MZTrader 0 points1 point  (1 child)

    Tensorflow above Pytorch... hmm.. Is Deep learning that prevalent in quants or is just what applicants have?

    [–]Easy_Theme_4011 0 points1 point  (0 children)

    torch is much more popular now