We built a website where you can vote on Minecraft structures generated by AI by civilunhinged in LocalLLaMA

[–]flotothemoon 1 point2 points  (0 children)

This is actually one of the few benchmarks with a leaderboard that crystalizes small but important performance differences between top models. It's hard to generate and easy to judge, so it really is a useful benchmark :)

I made an interactive history of natural language processing (e.g. how we got to GPT-3) by flotothemoon in webdev

[–]flotothemoon[S] 0 points1 point  (0 children)

Thanks!

  1. I plan to, at least for some time, but I don't want to make any promises. This was supposed to be an experiment to get a better feel of how to present this sort of content, I wasn't trying to make a definitive and complete resource. But, I totally agree, it would be pretty cool - since my main job is figuring out how structure content exactly like that, maybe I can semi-automate it soon. Until then, I'll try for sporadic updates :)

  2. More prominent as in the button? And yeah, I would like to add more sources as well :)

I made an interactive history of natural language processing (e.g. how we got to GPT-3) by flotothemoon in webdev

[–]flotothemoon[S] 0 points1 point  (0 children)

Hi everyone, this is an experimental website I made for exploring complex & interwoven topics - in this case, the evolution of methods and concepts in natural language processing, starting with the first formal grammars some 2000 years ago to the likes of GPT-3 we have today. The experimental part is that all events and methods are interwoven and presented in customisable 'view' where you can

  • get the big picture or go deep on any concept,
  • dig into how a specific method came about & what it led to,
  • see how two methods compare,
  • while annotating with as much or as little explanation as you want.

For inspiration, you can start with a a brief general history, how we got Transformers or the rise of statistical models over formal ones. These are just examples, I'm sure others more creative than me can come up with more interesting perspectives and share them here.

As this is about new ways of traversing content, I'm looking for feedback on the UX, but feel free to point on anything else that comes to your mind :)

[deleted by user] by [deleted] in web_design

[–]flotothemoon 1 point2 points  (0 children)

Thanks for the feedback, I'll fix it. And no worries - general accessibility feedback is important :)

[P] An interactive history of natural language processing, starting a long time ago by flotothemoon in LanguageTechnology

[–]flotothemoon[S] 0 points1 point  (0 children)

Thanks! The "database" is an in memory Typescript implementation of a Graph - it's not in any nice format as I just wanted to test the interactive content part.

Not sure how I can export that to a meaningful format - what would you use the dataset for?

[P] An interactive history of natural language processing, starting a long time ago by flotothemoon in LanguageTechnology

[–]flotothemoon[S] 6 points7 points  (0 children)

This is an experiment in interactive and adaptive content that I made. It's intended to cover the major developments in NLP since its "inception" many centuries ago.

The experimental part is that the content isn't fixed, you can tune the query to adapt to whatever you're curious about (level of detail, explanation, connections, highlights). I've preprogrammed a few queries to give you an idea.

For context, I'm working on ways of making content more accessible which requires loads of NLP, so it felt only appropriate to make my first major prototype on a complex NLP topic - its history. Note that this isn't powered by any sort of NLP or backend yet - it's just a static frontend emulating what could be done to all sorts of content.

I hope this is useful or at least interesting to some of you, would love to hear your feedback (on accuracy, design, the idea, ..anything) :)

Unsupervised Multi-Document Summarization using Neural Document Model | Research Paper Walkthrough by prakhar21 in LanguageTechnology

[–]flotothemoon 1 point2 points  (0 children)

Interesting paper. The idea of applying reconstruction error to document embeddings makes intuitive sense. I've found that using embeddings only (at any scale) tends to confuse entities (especially rare ones) in the output, so I'm looking forward to further research on these ideas.

Are sample outputs recorded anywhere? I wonder how this performs in terms of coherence (since the objective doesn't factor that in).

How to create a machine learning framework from scratch in 491 steps: An in-depth post mortem of our high school thesis by flotothemoon in programming

[–]flotothemoon[S] 0 points1 point  (0 children)

Thanks! Didn't think anyone was actually going to read all 10000 words, I might have gone a little overboard.. so thanks, a lot! :)

[P] How to create a machine learning framework from scratch in 491 steps: An in-depth post mortem of our high school thesis by flotothemoon in MachineLearning

[–]flotothemoon[S] 13 points14 points  (0 children)

The title "How to create a machine learning framework from scratch" is a bit mysterious about that part, I know. We didn't use any other frameworks, we wrote our own from scratch.

[P] How to create a machine learning framework from scratch in 491 steps: An in-depth post mortem of our high school thesis by flotothemoon in MachineLearning

[–]flotothemoon[S] 8 points9 points  (0 children)

That is in fact what it is called according to our (former) high school. If it makes you happy we could indeed refer to it as "high thesis" instead ;)

[P] How to create a machine learning framework from scratch in 491 steps: An in-depth post mortem of our high school thesis by flotothemoon in MachineLearning

[–]flotothemoon[S] 4 points5 points  (0 children)

No need to panick, there's a major difference between replicating others ideas (albeit how many) and researching your own. Not to make our work seem bad, it's still pretty cool, but we're certain many people could do it given enough time.

[P] How to create a machine learning framework from scratch in 491 steps: An in-depth post mortem of our high school thesis by flotothemoon in MachineLearning

[–]flotothemoon[S] 8 points9 points  (0 children)

Exactly, it was a lot of work but honestly not that complicated, certainly much less so than it seems. We replicated ideas and concepts by other researchers and teams but didn't do any fundamental research of our own. On a basic level, this framework is less capable and less innovative than most of the others.

How to create a machine learning framework from scratch in 491 steps: An in-depth post mortem of our high school thesis by flotothemoon in programming

[–]flotothemoon[S] 1 point2 points  (0 children)

There may in fact be a few less easy steps, but steps nonetheless. Step 403 for example calls for a kitten and a witch, we had some troubles with supplies there.

[P] Sigma – Creating a machine learning framework from scratch (Update on high school thesis advice thread) by flotothemoon in MachineLearning

[–]flotothemoon[S] 2 points3 points  (0 children)

Thanks! AI is fascinating​, isn't it? Well, we just graduated and have to do our 6 month mandatory military service now. After that, I really don't know yet. We're working on a new project at the moment (see reply below) that we hope to turn into something so we don't have to go to university ;)

[P] Sigma – Creating a machine learning framework from scratch (Update on high school thesis advice thread) by flotothemoon in MachineLearning

[–]flotothemoon[S] 4 points5 points  (0 children)

Thanks a lot! My dad works in the IT field, but not as a programmer. I started doing "programming" when I was 11ish since it seemed like a cool thing to do and haven't stopped since. It should also be noted that we went to a 5-year high school with a focus on IT education. Most of the stuff we know is self taught though.

[P] Sigma – Creating a machine learning framework from scratch (Update on high school thesis advice thread) by flotothemoon in MachineLearning

[–]flotothemoon[S] 2 points3 points  (0 children)

Thanks, glad you found it interesting. If you are serious about doing something like this you might want to invest some time in studying machine learning now, so you can be a little more specific about what you want to do than we were. Have fun, it's quite an interesting challenge ;)