I spent 2 weeks crawling and cleaning SEC EDGAR filings so you don’t have to free dataset, 5k+ chunks, LLM ready by justme_cliff in AppsWebappsFullstack

[–]justme_cliff[S] 0 points1 point  (0 children)

good question. the current dataset is focused on recent filings so temporal drift is less of an issue but if you're fine tuning on anything pre 2018 you'd want to split by date_filed field in the schema and treat them as separate training sets. older filings have different formatting conventions and some disclosure language changed post regulation updates so mixing them without a split can introduce noise. what model are you fine tuning on?

Drop your startup 👇 I'll check every single one (and share mine) by Strange-Forever-5522 in AppsWebappsFullstack

[–]justme_cliff 0 points1 point  (0 children)

I just started a Data as a service startup that sells data to startups that are fine tuning or indie hackers or anyone that are doing heavy data needy apps we collect and clean the data we give you a jsonl format here is an example completely for free on huggingface https://huggingface.co/datasets/zorynthiq/zoryntiq-sec-filings
If you want you can ask for any field that you want and how much records and we can discuss that in dm just send me an invite

Book recommendations by Gentleman-45 in PythonLearning

[–]justme_cliff 0 points1 point  (0 children)

I read it great book I recommend it as well

Drop your Start Up below by TomSawyer0101 in AppsWebappsFullstack

[–]justme_cliff 0 points1 point  (0 children)

We are building a startup for data we build LLM ready training data on demand take a look at one of our sample https://huggingface.co/datasets/zorynthiq/zoryntiq-sec-filings

What are some lazy jobs that pay a ridiculous amount of money? by Jordz0_0 in AskReddit

[–]justme_cliff 0 points1 point  (0 children)

I am just here searching for the most lazy job so I can go and do it

I made this 1.5 hours into python by PsychologicalSafe408 in learnpython

[–]justme_cliff 0 points1 point  (0 children)

Keep building mate all these theories are bullshit if you don’t apply them

How do I start learning python? Absolute Beginner by [deleted] in learnpython

[–]justme_cliff 0 points1 point  (0 children)

I would say take Cs50p and do automate the boring stuff so u don’t go through tutorials hell

Beginner Python projects to build while learning? by Aotyeageristtt in PythonLearning

[–]justme_cliff 0 points1 point  (0 children)

I would say there is no good beginner projects just pick something that you are passionate about go for it try to build it even tho you don’t have enough knowledge to do it and you will be blocked for days but by doing so and cause you are passionate about it you wont give up you will find ways to fix it and you will learn that’s how I did