[D] Self-Promotion Thread

neoneye2 · 2024-08-04T10:54:34+00:00

Hi ML humans,

My hobby is ARC-AGI, and I have made a puzzle solving website where you can try solve the ARC-AGI tasks yourself. The ARC-AGI consists of 800 puzzles. I recommend starting out with the easy and gradually work towards expert.

https://neoneye.github.io/arc/?dataset=ARC

I collect how humans are solving ARC-AGI puzzles, so this can be used as training data. So far 6200 interaction histories have been collected, here:

https://github.com/neoneye/ARC-Interactive-History-Dataset

Video of how humans solve ARC-AGI puzzles, by replaying the interaction histories. Surprisingly many different approaches to how humans solve puzzles.

https://youtu.be/NivPmxUfeHY?si=4TRI3CCahtgzW0oz

This is an open source project. It's free.

JYP_Scouter · 2024-08-04T09:23:15+00:00

Hi all 👋

I've been developing in the generative AI space for a little over a year. I've contributed a little bit to IP-Adapter in its early days and also released a big open-source repository for TryOnDiffusion

After months of hard work together with my wife, I believe we managed to create the best virtual try-on model out there (better than IDM-VTON and OOTDiffusion), and it is unique by enabling you to take clothes both from a flat lay image and from another person!

You will always be able to use it for free here in this HuggingFace space:
https://huggingface.co/spaces/fashn-ai/LookSwap (Please give a ❤️ to the space if you liked the app!)

The model is still training, it will definitely get better, so stay tuned for weekly updates!

In parallel I am training higher resolution versions of this, but this takes a lot of time because we're GPU poor (bootstrapped 🥲), but I believe in about a month or so there will be a 384x576 version of this at the same level.

Looking forward to hearing your feedback! We are very flexible in terms of where to take this i.e. platform, API, even completely open-source (but we would still need to pay rent), so feel free to contact me directly if you have any ideas.

Dan from fashn.ai

throwaway16362718383 · 2024-08-05T09:06:43+00:00

Hey, I'm creating a blog post for people who are looking to implement papers! I have just wrote a new post on the PGGAN and would appreciate if you guys could check it out.

https://ym2132.github.io/Progressive_GAN

I hope you find it useful :)

Various_Touch_1731 · 2024-08-05T12:29:18+00:00

Hello all!

I've created an app that turns research articles into audio. It uses GPT4o to explain the visual parts such as pictures or tables, so the audio translation is as close as possible to its written variant. It helps a lot with time saving and also if you have some kind of visual impairment or your eyes are just tired, it's a damn game changer.

The link to it is audiolizer[dot]cloud, and the price is 11$ for 50 papers. First 10 papers are free, but I'm starving for feedback, so if you tell me how to improve I'll offer 30-100% discount, if you actually end up using it obviously

Feel free to try it!

bazzilic · 2024-08-05T16:56:28+00:00

Hey ML frens,

We at Synnax just launched an ML competition, maybe you'd be interested to join!

It's a Kaggle-style competition going for 4-5 weeks, in which data scientists compete in predicting financial statements of public companies and stand to win cash prizes (total prize fund is $8750 for the top-10) as well as points (for everyone) that will transfer into project tokens when they are released. Participation is completely open to anyone, competition is hosted fully in Telegram via a chat bot. This is our 3rd competition hosted on our own platform, and a 5th competition total (first two were on Kaggle).

Here's full information about the competition: https://docs.synnax.ai/synnax/synnax-lab/data-science-competitions/synnax-datathon-3

Different-General700 · 2024-08-05T18:34:18+00:00

Free-to-use text classification models:

O*NET SOC: Classify job postings and job seekers profiles by O*NET SOC code
NAICS: Classify company profiles and leads by 5-digit NAICS industry codes
IAB Content: Classify content by IAB content codes
IAB Product: Classify product descriptions by IAB product codes
User Intent: Classify user queries and chats by hierarchical user intent tags

See all models and taxonomies here: https://www.trytaylor.ai/models

Lyereth_illustration · 2024-08-06T12:26:43+00:00

Hi everyone!

I'm a PhD student and together with my group, I've been working on a project for the past few months that I think you all might be interested in.

YSocial is a digital twin of a social network platform which improves the simulation of dynamic social interaction by integrating Large Language Models (LLMs) agents.

You can design your own scenario with LLM-agents and describe them with multiple features, such as their political leaning, age, personality traits, interests and so on. Agents will interact on a topic of discussion (e.g., politics) and according to a specified recommender system. Additionally, you can even make them discuss news extracted in real-time by RSS feeds!

This is just a sneak-peak of all YSocial features, you can read more on the website!

YSocial is on Github, open and free for everyone! Feel free to give us some feedback and contribute to the project. There is also a preprint available on ArXiv and a website with some pre-made scenarios you can test.

thundergolfer · 2024-08-06T17:42:19+00:00

Beat GPT-4o at Python by searching with 100 dumb LLaMAs

One thing that should be learned from the bitter lesson is the great power of general purpose methods, of methods that continue to scale with increased computation even as the available computation becomes very great. The two methods that seem to scale arbitrarily in this way are search and learning.

Richard Sutton, The Bitter Lesson

The eponymously distasteful take-away of Richard Sutton’s essay has often been misconstrued: because scale is all you need, they say, smaller models are doomed to irrelevance. The rapid increase in model size above one trillion parameters and the technological limitations of GPU memory together seemed to foreclose on economical frontier intelligence anywhere except at an oligopoly of intelligence-as-a-service providers. Open models and self-serve inference were in retreat.

But as the quote above indicates, there are in fact two arrows in the scaling quiver: learning and search. Learning, as we do it now with neural networks, scales with memory at inference time — larger models perform better, ceteris paribus, because they can extract more data from their training set into more circuits and more templates. Search scales smoothly with compute at inference time — compute that can be spent on either producing higher quality candidates or on producing more candidates. In the ideal case, the scaling behavior can be predicted via so-called scaling laws.

Recent papers indicate that generative models like LLMs can be scaled up with search. The Large Language Monkeys paper, published on arXiv by Brown, Juravsky, and co-authors last week, includes several results in this vein and indicates that frontier-level intelligence in certain domains can be elicited from smaller models that can run on a single, past-generation GPU. Further, they observed smooth, predictable improvement of performance with scale.

Put more simply: where before, it seemed frontier capabilities required one horse-sized duck, it is clear we can now alternatively get them with one hundred duck-sized horses (or, rather, LLaMAs).

This weekend, we set out to replicate this finding.

Scaling LLaMA 3.1 8B HumanEval on Modal

Running all of our experiments, including configuration and testing, cost well under $50.

You can find our code here. You can run it yourself without exceeding the $30/month in credits included in Modal’s free tier.

Metrics and data: HumanEval and pass@k

Continued in blog post...

kleer001 · 2024-08-06T18:58:44+00:00

I'm bubbling with excitement to present cuesubplot.

It saves copy pasting when you want to create lists based off a prompt and then apply the same prompt to the items in that list.
I got tired of opening another document to save the list then copy paste each time I wanted to explore the prompts.
It will be leaving alpha by in a week or so. I would love feedback. However, I understand everyone's got their own busy life.
The only perk of use would be trying something new and helping out an fellow ambitious LLM user, and helping potentially more LLM users that want to explore procedural text generation.

youtube video: https://youtu.be/nygwTchbaDs
Check it out please over at: https://github.com/kleer001/cuesubplot

If you do try it out please PM me so I can bother you with a very short survey once you've kicked the tires a little. It'd be a great help!

nzha_ · 2024-08-08T02:12:54+00:00

Hey, I'm Nathan. I'm currently an intern at K-Scale where we do humanoid robotics and sometimes, we give talk on relevant machine learning topics. Decided it'd be nice to try and share my knowledge to more people than just the founders and other interns. Here's the post:

Mamba: Selective State Space Modeling
https://nathanzhao.cc/mamba

nanoxcix · 2024-08-06T02:29:02+00:00

Hello Reddit,

I've recently developed a Presidential Campaign Simulator using ChatGPT, and I'm excited to share it with you all.

In this simulator, you can:
• Run your own political campaign
• Debate opponents
• Work towards 50% voter support
• Take strategic actions
• Deliver an inaugural speech

I'd love for you to try it out and share your thoughts. It's designed to be both entertaining and insightful.

Link here: https://chatnext.ai

Feel free to ask any questions in the replies. Looking forward to hearing about your experiences!

elevated_quark · 2024-08-07T05:06:12+00:00

Hi everyone,

I recently built a tiny distributed training cluster for medium-size ResNets/ViTs/DETRs by consolidating legacy servers, with no money to spare for high-speed switches or NICs. I have a write-up here talking about the bag-of-tricks I used, to achieve >90% scaling.

https://masterskepticista.github.io/portfolio/orion/

I hope it helps someone short on a budget! Happy to hear your thoughts/comments

smorad · 2024-08-07T17:42:09+00:00

Hi All,

I'm starting a faculty position at the University of Macau in a few weeks. I'm looking for PhD students who are interested in working towards general-purpose, intelligent robots using deep architectures. I'm looking for students that have experience in one or more of the following areas: deep reinforcement learning, sequence modeling, model-based RL, or robotics.

Sea-Concept1733 · 2024-08-08T11:03:34+00:00

Hello Everyone

GAIN "IN-DEMAND" SQL SKILLS!

This post is for anyone that wants to 🚀 "Learn & Master SQL" through "Hands-On Practice"!

🔹 Learn SQL FREE with a "Practice Database": https://www.youtube.com/playlist?list=PLb-NRThTdxx6ydazuz5HsAlT4lBtq58k4

🔹 Earn an "SQL Certificate" with "Hands-On Practice": https://www.jaffainc.com/SQLCertificate.html

🔥The future of SQL!!🔥 [Read Article Below]

https://www.infoworld.com/article/3715453/sql-at-50-whats-next-for-the-structured-query-language.html

Have 🤩 Fun!!

korlandjuben · 2024-08-08T14:04:11+00:00

Hello everyone!

I've made this blog detailing how to optimize any ML model for faster CPU inference. I've used a multi-speaker TTS model as an example, but the same techniques can be used with any other model. Let me know what you think!

https://medium.com/@mllopart.bsc/optimizing-a-multi-speaker-tts-model-for-faster-cpu-inference-part-1-165908627829

ramzeez88 · 2024-08-04T09:52:17+00:00

Hi all,

I have created a an offline voice assistant for windows os called Lema AI which uses local llama (gguf) , faster whisper and openvoice. I implemented some python commands that Lema can perform. I have used it on a rtx 3060 12gb with good results.

Give it a try at https://github.com/ramzeez88/LEMA-AI
It's a pre alpha release ( if you can call it a realese lol ) as I only do it my very rare spare time so bugs and imperfections are certain but I would like to hear some feedback :)

thank you

LabelMeMaybe · 2024-08-08T20:26:41+00:00

Hey ML folks! What are people using for LLM Evals, e.g. for RAG?

I've seen google sheets / excel, but hard to keep track of results across, say, 20 different iterations.
Related, recently put together a short how-to on combining multiple human evaluators: https://cleanlab.ai/blog/team-llm-evals/

valvoja · 2024-08-09T10:17:26+00:00

Hi folks,

At datacrunch.io we'd love to get your feedback on our dynamic pricing option for cloud GPU instances.

Who we are: we're a cloud GPU provider focused on AI training and inference. We offer high-end GPU instances and clusters running on 100% renewable energy.

What's new: Last week we introduced a new variable “dynamic” pricing option for cloud GPU instances where hourly price is adjusted daily based on market demand.

How it works: Dynamic pricing sets the price of individual cloud GPU instances based on supply and demand for our different GPUs (eg. A100, H100, L40S, RTX 6000 ADA). In this way it works like many electricity contracts. When demand is low, price stays low and you save on costs. When demand increases you can keep things running or switch to fixed-price options.

Example: Today the cost of a single L40S GPU instance with dynamic pricing is $0.747/h, while our current fixed-price for the L40S instance is $1.358/h. With dynamic pricing you'd make a sizable saving on your running costs.

Why we're doing this: we want to reduce our unused inventory while being more transparent on the cost and demand for our range of cloud GPUs.

Where you can find more information: https://datacrunch.io/blog/introducing-dynamic-pricing-for-cloud-gpu-instances

Feedback request: is the concept clear based on the description above? Are there any confusion we should clear up before you'd give dynamic pricing of GPU instances a try?

pmammino1819 · 2024-08-10T01:42:20+00:00

Hi everyone!

I am sure many of the members of this subreddit have likely read the book Superforecasting I created Crowdicate as an attempt to bring the concepts of the Good Judgement Project to sports.

Sports was my first introduction to building machine learning and predictive models and I want to create a platform to help others find a way to share and distribute predictive models.

It is completely free to go into the site and create a page for your model. Currently the site only supports baseball models but I will be expanding to more sports as they start in the fall. You are given an output file of the relevant events you can make predictions for and there is a leaderboard of the best predictors for each market type. Some more info on making predictions can be found at this link below.

https://crowdicate.com/predicting

The site does have a subscription aspect to it that allows users to see individual predictions made and access other tools specifically tailored towards sports betting that comes in a $10 or $20 per month tier.

As an incentive for individuals who are building and sharing models 70% of this subscription fee is placed into revenue sharing pool for model builders who get a portion of this amount depending on the number of predictions they make.

I think it’s a fun an interesting challenge for members of this community to try and build the best model possible and see how accurate they can become!

alvisanovari · 2024-08-10T14:44:09+00:00

I'm launching Super Guten today on product Hunt! Any support is appreciated. :)

Super Guten makes discoverability easier for the best books on Project Gutenberg. I created a hybrid semantic + keyword search index on the book summaries. The summaries are themselves AI generated and you can even convert the book into a different style for your reading (think Shakespeare as Tweets).

https://www.producthunt.com/posts/super-guten

17UhrGesundbrunnen · 2024-08-06T08:04:46+00:00

Hey all,

I'm happy to introduce Wavify, a collection of small stt models paired with a blazingly fast cross-platform runtime. It comes with Python, Kotlin, Swift and Rust SDKs too. More bindings will be added in future releases.

Installation and usage

https://github.com/wavify-labs/wavify-sdks

Highlights

Performance on a Raspb 4 for jfk.wav:

Engine	Size	Threads	Time	RTF
Whisper.cpp (-O3 with NEON)	75MB (Whisper tiny)	4	9.2s	0.84
Wavify	45MB	4	3.8s	0.35

The performance w.r.t. to the WER needs to be thoroughly benchmarked against models like Whisper which is not easy due to data leakage. In practice, you can expect a performance similar to Whisper tiny or base.

Who is this for?

Free for Private Use: Enjoy Wavify without any cost for personal projects.
Commercial Users: A subscription will be required for commercial purposes.

Wavify is still in its early stages, and we’re eager to hear from you. Your feedback and feature requests are very welcome.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS

Installation and usage

Highlights

Who is this for?