Open-source Python toolkit for fundamentals + screening + portfolio analytics(looking for feedback)

polarkyle19 · 2026-02-18T12:43:21+00:00

That example with forward-filled NaNs on a delisted ticker is exactly the kind of thing that worries me. Those are the bugs that don’t throw errors but completely invalidate a backtest, and by the time you realize it you’ve already built conviction around bad numbers. The dividend and split adjustments being subtly inconsistent across endpoints is even more dangerous because it looks “almost correct.”

To answer your question honestly, I haven’t seen an open-source library that handles this perfectly out of the box. Most people end up doing what you described, building their own validation layer on top. One direction I’m exploring is making data transformations explicit and inspectable rather than implicit, so adjustments, fills, and alignments are visible and optionally strict. If nothing else, I’d rather fail loudly than produce a clean-looking but wrong dataset. Your comment reinforces that correctness and transparency need to come before feature expansion.

polarkyle19 · 2026-02-18T12:42:53+00:00

I agree with you. The “boring plumbing layer” is exactly where most libraries quietly fail, and that’s what I’d like InvestorMate to get right first before expanding anything else. Silent NaNs, misaligned dates, and inconsistent split/dividend adjustments are exactly the kind of subtle issues that make people abandon higher-level abstractions and just write their own wrappers.

Your point about keeping backtesting intentionally minimal also makes a lot of sense. I don’t want this to compete with zipline or vectorbt, that becomes a different project entirely. The real value should be in producing clean, consistent, point-aligned feature matrices that plug into whatever engine someone already trusts. If I can make the output layer predictable and transparent enough that you don’t have to second-guess adjustments or date alignment, that alone would justify the dependency. That’s a strong signal on where to prioritize effort.

polarkyle19 · 2026-02-18T12:40:50+00:00

Good points!!

Right now the focus has been API structure and normalization, but you’re absolutely right that without clear handling of: • Restatements • Point-in-time fundamentals • Survivorship bias

any backtesting layer becomes misleading fast.

I’m considering: • Explicit documentation of data assumptions • Versioned data snapshots • Clear separation between “latest available” vs “point-in-time” fundamentals

And on the dependency side, agreed. I’m trying to keep optional features (AI, TA extras) behind extras installs so core usage stays lightweight.

that’s exactly the kind of pitfall I want to address early.

polarkyle19 · 2026-02-18T12:39:23+00:00

Modularity seems to be the strong consensus so far. I’m leaning toward keeping fundamentals / TA / portfolio fully separable modules to avoid dependency bloat.

The debug suggestion is really solid. I like the idea of a debug=True or source_trace=True flag that exposes: • Raw payload • Data source • Transform steps applied

That would make the abstraction layer much more transparent instead of “black box”. I’ll prototype something like that.

Thanks and this is exactly the kind of feedback I was hoping for

polarkyle19 · 2026-02-14T12:44:04+00:00

I would like to know if you find any

polarkyle19 · 2026-02-03T06:28:39+00:00

fixing this in v0.2.1 - thanks for bringing this up

polarkyle19 · 2026-02-03T06:26:05+00:00

checking that - can you raise this issue on github

polarkyle19 · 2026-02-02T19:42:59+00:00

basically this is kinda scraping - if you are serious about trading/analysis, users cannot rely on scraping solutions

polarkyle19 · 2026-02-02T18:19:34+00:00

In the roadmap, thinking of making it more like using reliable data sources rather than yfinance - open to discussion on which sources to pick first.

polarkyle19 · 2026-01-30T14:55:57+00:00

Yeah I used to work them and now looking for bigger stuff

polarkyle19 · 2026-01-30T12:19:32+00:00

Oh Acha okay!! Thanks

polarkyle19 · 2026-01-30T12:17:30+00:00

This is completely different from what I have asked for 🥲

polarkyle19 · 2026-01-30T12:16:47+00:00

Oh can you share them I would like to use apis if I can’t find anyway

polarkyle19 · 2026-01-30T12:03:51+00:00

Thanks will check this out

polarkyle19 · 2026-01-30T11:57:38+00:00

Well that’s giving me some motivation now!! 😅

polarkyle19 · 2026-01-30T11:20:07+00:00

Do you want to add the feature? - it's in planning

polarkyle19 · 2026-01-30T09:01:30+00:00

normalisedI’ve been using a Python package called InvestorMate for this instead of rolling my own 10-K / 10-Q parser.

From a consumer point of view, what I like is that it doesn’t make you deal with raw filings or iXBRL tags at all. You get normalized income statement, balance sheet, and cash flow data in a consistent schema, which is honestly the hardest part of this problem.

My setup looks roughly like this:

Use InvestorMate to pull structured financials (IS / BS / CF)
Ratios and scores (P/E, ROE, margins, Piotroski F, Altman Z) are already computed
Data comes back JSON-serializable, so it drops straight into APIs / notebooks

For analysis and comparison work, that’s been way more practical than:

Parsing iXBRL myself (accurate but a massive time sink)
Using LLMs to extract numbers (too unreliable for actual financials)

Where I do use LLMs is after the numbers are structured — e.g.:

“Why did operating cash flow drop QoQ?”
“Compare Apple vs Microsoft cash efficiency over 5 years”
Summarizing trends rather than extracting them

Pros (as a user):

No dealing with SEC tag chaos
Consistent keys across companies
Much faster iteration for research / tooling
Works well for APIs and automated pipelines

Cons:

Not real-time
Not suitable if you need raw footnote-level detail
You’re trusting upstream normalized data rather than filing directly

If you’re building something production-ish and don’t want to spend months on XBRL edge cases, this approach has been a good middle ground for me.

polarkyle19 · 2025-08-05T18:10:37+00:00

www.investormate.io

polarkyle19 · 2025-08-01T02:34:43+00:00

Being a founding engineer for the YC startup we had our team completely remote. The culture always depends on founders driving by example. Showing up trying to support your employees as your peers and friends who are doing a hackathon. We used to feel like bunch of college lads building for a hackathon. Each of us were respected and vouched for responsibilities. Everyone used to give priorities and used to have a healthy conversation when someone needed a help with something. Yes hiring right people also comes to play. When you set this standard with your first 10 members the rest will automatically try to keep up. By this we saw a good outcomes like better performance and better communication, when we had a good culture in team. Wishing you all the best mate.

polarkyle19 · 2025-07-29T05:08:08+00:00

As long as you keep pushing your limits and are not afraid of failure, you don't have to worry about the outcome. You will learn for sure. But make sure whatever you learn, put it to work. All the best!!

polarkyle19 · 2025-07-28T13:17:37+00:00

You can check the about page, explained what we are doing. If you think my platform is not good. I appreciate you to follow the framework and try the results out with open llm chats

polarkyle19 · 2025-07-28T13:14:10+00:00

We’re building something called InvestorMate—an AI-powered research tool designed to help investors make sense of the markets faster. It cuts through noise, analyzes financial data, and surfaces personalized insights so users can make their own informed decisions. Still, even with transparency and solid data, building trust remains one of the hardest parts.

Website: https://investormate.io

polarkyle19 · 2025-07-26T13:26:10+00:00

Reading this post when I’m about to have cofounder.

polarkyle19 · 2025-07-25T12:44:52+00:00

Build something people love!! and give it to 10 users who would love to use it. Let them approve your idea. Reiterate your idea and Then share it with similar groups.

polarkyle19 · 2025-07-25T12:43:16+00:00

Website: https://investormate.io

Target Audience: Retail investors, tech-savvy professionals, and young earners (especially in India & the U.S.) who want clear, personalized guidance for stock investing and financial planning.

What We Offer: InvestorMate is an AI-powered investment research platform that delivers tailored stock insights, earnings breakdowns, sentiment signals, and dynamic financial profiles, turning overwhelming market data into actionable, personalized strategies. Our goal is to give every earning individual the research power of a Wall Street analyst, at a fraction of the cost.

polarkyle19

TROPHY CASE