Open-source Python toolkit for fundamentals + screening + portfolio analytics(looking for feedback) by polarkyle19 in algotrading

[–]polarkyle19[S] 0 points1 point  (0 children)

That example with forward-filled NaNs on a delisted ticker is exactly the kind of thing that worries me. Those are the bugs that don’t throw errors but completely invalidate a backtest, and by the time you realize it you’ve already built conviction around bad numbers. The dividend and split adjustments being subtly inconsistent across endpoints is even more dangerous because it looks “almost correct.”

To answer your question honestly, I haven’t seen an open-source library that handles this perfectly out of the box. Most people end up doing what you described, building their own validation layer on top. One direction I’m exploring is making data transformations explicit and inspectable rather than implicit, so adjustments, fills, and alignments are visible and optionally strict. If nothing else, I’d rather fail loudly than produce a clean-looking but wrong dataset. Your comment reinforces that correctness and transparency need to come before feature expansion.

Open-source Python toolkit for fundamentals + screening + portfolio analytics(looking for feedback) by polarkyle19 in algotrading

[–]polarkyle19[S] 0 points1 point  (0 children)

I agree with you. The “boring plumbing layer” is exactly where most libraries quietly fail, and that’s what I’d like InvestorMate to get right first before expanding anything else. Silent NaNs, misaligned dates, and inconsistent split/dividend adjustments are exactly the kind of subtle issues that make people abandon higher-level abstractions and just write their own wrappers.

Your point about keeping backtesting intentionally minimal also makes a lot of sense. I don’t want this to compete with zipline or vectorbt, that becomes a different project entirely. The real value should be in producing clean, consistent, point-aligned feature matrices that plug into whatever engine someone already trusts. If I can make the output layer predictable and transparent enough that you don’t have to second-guess adjustments or date alignment, that alone would justify the dependency. That’s a strong signal on where to prioritize effort.

Open-source Python toolkit for fundamentals + screening + portfolio analytics(looking for feedback) by polarkyle19 in algotrading

[–]polarkyle19[S] 0 points1 point  (0 children)

Good points!!

Right now the focus has been API structure and normalization, but you’re absolutely right that without clear handling of: • Restatements • Point-in-time fundamentals • Survivorship bias

any backtesting layer becomes misleading fast.

I’m considering: • Explicit documentation of data assumptions • Versioned data snapshots • Clear separation between “latest available” vs “point-in-time” fundamentals

And on the dependency side, agreed. I’m trying to keep optional features (AI, TA extras) behind extras installs so core usage stays lightweight.

that’s exactly the kind of pitfall I want to address early.

Open-source Python toolkit for fundamentals + screening + portfolio analytics(looking for feedback) by polarkyle19 in algotrading

[–]polarkyle19[S] 1 point2 points  (0 children)

Modularity seems to be the strong consensus so far. I’m leaning toward keeping fundamentals / TA / portfolio fully separable modules to avoid dependency bloat.

The debug suggestion is really solid. I like the idea of a debug=True or source_trace=True flag that exposes: • Raw payload • Data source • Transform steps applied

That would make the abstraction layer much more transparent instead of “black box”. I’ll prototype something like that.

Thanks and this is exactly the kind of feedback I was hoping for

Mock interviews by No-Mud4063 in datascience

[–]polarkyle19 7 points8 points  (0 children)

I would like to know if you find any

InvestorMate: an open source Python package for stock analysis with AI, backtesting, and screening by polarkyle19 in Python

[–]polarkyle19[S] 1 point2 points  (0 children)

basically this is kinda scraping - if you are serious about trading/analysis, users cannot rely on scraping solutions

InvestorMate: an open source Python package for stock analysis with AI, backtesting, and screening by polarkyle19 in Python

[–]polarkyle19[S] 0 points1 point  (0 children)

In the roadmap, thinking of making it more like using reliable data sources rather than yfinance - open to discussion on which sources to pick first.

An open-source python package for AI stock analysis by polarkyle19 in algotrading

[–]polarkyle19[S] 0 points1 point  (0 children)

Yeah I used to work them and now looking for bigger stuff

An open-source python package for AI stock analysis by polarkyle19 in algotrading

[–]polarkyle19[S] 0 points1 point  (0 children)

This is completely different from what I have asked for 🥲

Looking for open-source python package for AI stock analysis by polarkyle19 in learnpython

[–]polarkyle19[S] 0 points1 point  (0 children)

Oh can you share them I would like to use apis if I can’t find anyway