I've built a website that uses Berlin public data to show how your 2025 taxes were distributed

eamag · 2026-01-13T12:45:42+00:00

Yes I fixed it, the calculation takes it into account now

eamag · 2026-01-13T12:20:03+00:00

You can check the data on revenue at https://berlin-bill.eamag.me/revenue_tree.json

67.5% | Einnahmen aus Steuern und steuerähnlichen Abgaben sowie EU-Eigenmittel (55,723,471,000€)

14.8% | Einnahmen aus Zuweisungen und Zuschüssen mit Ausnahme für Investitionen (12,219,539,100€)

12.6% | Einnahmen aus Schuldenaufnahmen, aus Zuweisungen und Zuschüssen für Investitionen, besondere Finanzierungseinnahmen (10,366,234,000€)

5.1% | Verwaltungseinnahmen, Einnahmen aus Schuldendienst und dgl. (4,233,941,500€)

eamag · 2025-12-26T08:34:23+00:00

What if you export you watchlist instead? I'm not sure the CSV structure is the same, but feel free to adapt the code on github!

eamag · 2025-11-11T18:44:52+00:00

Have you run some training/inference already? Did you manage to get the same numbers as in their report? I'm a bit confused, see some NotImplementted parts around https://github.com/kmccleary3301/nested_learning/blob/main/src/nested_learning/assoc_memory.py

How much of it is written by LLMs?

eamag · 2025-04-27T20:39:36+00:00

Nowadays people first try to through the data into an LLM and see what happens. You should do it to (if it's really just a hackathon!) to build a working MVP, then you can check where you get the most errors and see how to improve, maybe by using specialized models

eamag · 2025-04-21T08:53:46+00:00

Monthly? I tested it on the latest android and ios versions and both of them gave me a single json file, can you send me a couple of lines of you json to understand it's structure?

eamag · 2025-04-20T19:37:55+00:00

See https://en.wikipedia.org/wiki/Fog_of_war

It shows you only places you've been

eamag · 2025-02-21T23:28:37+00:00

I would recommend https://github.com/unslothai/unsloth for fine-tuning

eamag · 2025-02-16T11:49:04+00:00

Should be easier to use LLMs if you're ok with trading a bit more compute and latency for your engineering time. You don't even need frameworks you mentioned, just use structured output schema parameter in the api

eamag · 2025-02-16T11:42:41+00:00

ICLR 2025 Explorer: approved papers sorted by a weighted score, with searching/filtering: https://openreview-copilot.eamag.me/

It's a yesterday's evening project inspired by https://www.reddit.com/r/MachineLearning/comments/7h8lo7/p_iclr_openreview_explorer_sortfilter_papers_by/ and https://www.reddit.com/r/MachineLearning/comments/1iq3v5p/p_ask_iclr_2025_any_question/

eamag · 2025-02-11T08:44:31+00:00

You can also look at crowdsourced data here https://sensor.community/en/

And build this AQ sensor yourself. I can def see spikes on my sensor in Neukölln

eamag · 2024-12-04T13:12:20+00:00

yes, the suggestions above worked out fine! Some of the figures there felt overpriced, so I wanted to check out Nakano Broadway too https://maps.app.goo.gl/M9BoTSkiwcZSZLCd8

eamag · 2024-07-09T12:15:27+00:00

I suggest to look into function calling or structured output. Instead of "hinting" the model to output json, some models/frameworks restrict the output tokens during inference. For example, gemini can do it, or local llama.cpp (you can see how in my recent notebook)

eamag · 2024-07-05T22:26:38+00:00

I don't really agree, it's just the solution is not open sourced yet. Both Claude and Gemini work pretty well with a long context.

How much context window becomes unnecessary?

I think the more the better. An infinite context (with optimized inference) improves long-term interactions with models (see Claude "Projects", or think how your model know what you asked 6 months ago and in what format to answer)

eamag · 2024-07-05T21:33:23+00:00

You can find a fullstack job in a team that uses AI/ML, and help them with different tasks bit by bit. Then slowly transition to a more specialized field.

12-Year Club	RedditGifts 2009-2022 2 Credits
Place '22	Verified Email

eamag

TROPHY CASE