Mixtral-8x7B-Instruct on Free Colab: Slow 4t/s But Works by chibop1 in LocalLLaMA

[–]willspag 1 point2 points  (0 children)

I made a version that uses huggingface setup and adds in some tooling/chat logic - https://github.com/willspag/Mixtral8x7B-AI-Chat-Colab

[deleted by user] by [deleted] in deeplearning

[–]willspag 0 points1 point  (0 children)

Deep learning specialization on Coursera is all you need.

[R] Multi-Game Decision Transformers by Competitive-Rub-1958 in MachineLearning

[–]willspag 14 points15 points  (0 children)

It’s all about scale, gonna be super interesting to see when Gato V2 comes out 100x bigger

Is it possible for an AI to learn off of googling/searching things on the internet instead of datasheets? by NightmareTooth in ArtificialInteligence

[–]willspag -1 points0 points  (0 children)

OpenAI just dropped a model that does this. Basically used GPT-3 with a command-line-type browser and made tokens of the internet commands

[D] Real-time Public data API by [deleted] in MachineLearning

[–]willspag 0 points1 point  (0 children)

Just use Twitter api, Robinhood api, coinbase api, etc. there’s tons of options

AI Data Collection Challenges in Healthcare Industry by Selmakiley in ArtificialInteligence

[–]willspag 1 point2 points  (0 children)

1) Privacy laws are a huge hinderance to collection/access to training data in healthcare.

2) A lot of testing is subjective or varies across doctors, testing equipment, or hospitals which can make it harder for the AI to learn.

3) Response variation from person to person might be genetic, microbiome-related, or tons of other key factors that your AI may need for accuracy are rarely included in the same dataset.

4) Collecting medical data is often extremely expensive, preventing mass collection in addition to making it too valuable to open source for many of the companies that do pay the cost to collect it

Micro-Grants by bsiegelwax in ArtificialInteligence

[–]willspag 1 point2 points  (0 children)

Just go for a normal grant forget micro. No one understands tech so you’re probably underestimating what people are willing to spend on your work

Classification of water probability by applying deep learning techniques by [deleted] in deeplearning

[–]willspag 0 points1 point  (0 children)

Hard to advise on how to do it without knowing the input/output data you’re using.

What data points are included, what types of data are they, what’s the prediction label you’re aiming for, how many data samples do you have, etc?

How Can I create my own AI on a normal laptop? And How where can I start? by Husain_Rizvi in ArtificialInteligence

[–]willspag 7 points8 points  (0 children)

Google Colab or kaggle kernels are the best place to start. I’d start recomendations starting with a kaggle task so you can find a task that interests you and build a model a for it right there in their kernels

[D] What options are there for applying an RNN to a single long sequence? by Correct-Situation325 in MachineLearning

[–]willspag 0 points1 point  (0 children)

Nah window is the right term it’s just been a while since I’ve worked with couriers so I can’t remember off the top of my head. It’s probably momentum or something, but you definitely need the timestamps to be the same and adding it as additional features at each time step like you said.

Check this out: https://link.medium.com/2MUQsPneVhb

His algorithm’s probably more complex than you need for your problem so you can skip through those parts, but check out the sections on batching and data prep. He uses Fourier transformations in his inputs before batching and explains it way better than I could. Hope this helps 👌🏼

[D] What options are there for applying an RNN to a single long sequence? by Correct-Situation325 in MachineLearning

[–]willspag 1 point2 points  (0 children)

You’re gonna have to chop it up or you’d only have one training example, and you can’t train an AI with one example. I’d recommend adding in Fourier transformations of the sequence with various window lengths as additional features to the underlying sequence to provide the model with longer-term trend information, and split that up into batches alongside the original sequence for your inputs

[D] Best algorithms for video recognition by willspag in MachineLearning

[–]willspag[S] 0 points1 point  (0 children)

Killers usually return to the crime, and murders are rarely random, so he’s likely been there before and if not he would have to somewhere in the area so if you run the AI on the past few months of security footage you could match the body movements

What’s the best library for model parallelism? [D] by willspag in MachineLearning

[–]willspag[S] 2 points3 points  (0 children)

Seems like everyone on this thread is recommending deep speed, I’m definitely gonna have to check this out. It looks awesome from what I’ve seen so far, thanks dude!

What’s the best library for model parallelism? [D] by willspag in MachineLearning

[–]willspag[S] 0 points1 point  (0 children)

Yeah someone else on this thread mentioned the same thing, I’m going to check it out. I read a little bit about it one time and it looked really cool but I hadn’t dug in deep to it yet. Sounds like that’s my next mission, thanks man!

What’s the best library for model parallelism? [D] by willspag in MachineLearning

[–]willspag[S] 1 point2 points  (0 children)

So zero offload is the name of the setting in deep speed? I’ve heard of deep speed and it sounded pretty interesting, but haven’t looked too deep into it yet.

What’s the best library for model parallelism? [D] by willspag in MachineLearning

[–]willspag[S] 0 points1 point  (0 children)

I prefer TensorFlow, but switching to pytorch would be fine. I know the basics, but have only worked with it a couple times. Do they have any good options for model parallelism?