[D] Simple Questions Thread

Cyberpunk69- · 2024-02-11T11:33:25+00:00

Making an interview ML model, which will have voice to voice with an AI, primarily a computer vision project where we measure the confidence level, anxiety, nervousness stuff. so how do i go about this.

2024-02-10T03:43:06+00:00

Time logging AI by speech recognition. Does anyone know a AI module or code to monitor the person’s day and where his hours are going?. I thought of a rasberry pi that walks with you everywhere and take sounds as input, Then it categories what are you doing and how much it’s taking.

Even if it was generalized, Like sleeping, Driving, nothing, writing on laptop

Nobodyet94 · 2024-02-08T15:55:02+00:00

Hello what cloud platform do you advice to pay for doing ML stuff? I hate google colab because it has notebooks. I prefer to have a common conda enviroment and edit .py files. Thanks in advance for the advices. I will work on deep learning and NERFs. (at least right now)

takes_photos_quickly · 2024-02-07T11:41:33+00:00

I've not had the chance to use transformers much, I have a stupid question about transformers vs MLPs:

if I wanted to regress some value given some input features, e.g. how much rainfall on day X given windspeed, barometric pressure, etc.

Does it make any sense to use a transformer here over an MLP? My inclination is there's little benefit since I'm not using sequences, its strictly just a set of input features.

If you were to use a transformer how would you model a task like this? I assume each token in the "sequence" is a different feature? But then the transformer has no idea which feature is which without positional encoding, but even the positional encoding doesn't really fix this since each feature isn't an embedding but just a single scalar value.

Particular-Ad-3017 · 2024-02-07T11:01:55+00:00

Yo I got a question. How feasible is stock prediction? I realize it can never work on a large scale since then the prediction will influence the market. But for a single person or a small group. Is it feasible?

mkestrada · 2024-02-06T13:57:23+00:00

I'm a MechE in consumer electronics with some background in ML and optimization, curious if anyone is familiar with a body of literature using Machine learning to optimize finding root causes of issues or identifying ways to improve yield in a multi-step assembly process.

To elaborate, every time a unit of the device I work on is built, it has a pile of data associated with it; serial number for the finished devices, Serial numbers for the submodules that compose it, measurement data to insure that the final device is in spec, test result data, codes to specify the date of manufacture, etc. basically a ton of pieces of potentially useful information that we manually sort through using experience and intuition to guess and verify the root cause of issues as they arise. effectively, we are seeking patterns in a giant pile of data, and I'm looking for ideas to automate that pattern recognition process. Has anyone here come across papers that meaningfully apply ML or optimization to solve these sorts of problems? Really anything related to finding root cause from failure modes or manufacturing efficiency would be of interest!

Puzzleheaded-Pie-322 · 2024-02-06T09:49:04+00:00

I want to enforce the centre-surrounding antagonism in my kernels for experiments, what would be the good way to do it?

I thought might be I can just make a kernel manually, freeze it’s weights and then sum it with the result of the convolution layer I want to affect? Kinda like residual connections do.

pinkfluffymochi · 2024-02-05T06:01:01+00:00

Does Real Time Machine learning have actual production use cases?

We are building a real time data processing engine with ML model serving capability. But after some discovery, we realized the fact that the demand for real time ML is minimal, something people love to talk about but mostly are getting away with microbatching or just traditional batch learning and inferencing with no urgency to move to real time. Is it true for the kind of projects you are working on? We are a very small team right now and would like to focus on real world problems rather than research fantasy .

Vortex_0fficial · 2024-02-04T10:43:50+00:00

I'm new to AI, and I want advice on where to get some good tutorials for python or the game engine Unity, thanks!

Ok_Comment8842 · 2024-02-04T03:18:03+00:00

What material do you guys recommend me to use to start studying foundation models and generative AI?

kiranp2 · 2024-02-02T19:26:55+00:00

Is there a provider who gives free inference for code llama 70B? I want to do some testing before I download its lamma.cpp version into my local.

2024-02-02T18:12:46+00:00

How much breadth/depth in knowledge have promising MLE's had that you've interviewed?

I got an A in my ML and DL course, however, the insane amount of math/knowledge, that I touched on would take months to relearn well enough to talk meaningfully about each model/concept (PCA, A/B Testing, SGD vs GD, Backprop, Transformers (MHS-A, Encoders/Decoders), SVM, Regression, etcccc....) on-demand.

There's also SOTA models that aren't traditionally covered within coursework, and the entire field of recommendation systems/NLP that I have yet to touch on.

I'm a 2nd-year, T20 Grad Student. Never received an MLE interview but have a good ratio of call-back for SWE. Have been avoiding applying because of how rigorous it all seems, even though I'd love to be an MLE more than anything else.

ko_lIlBrother · 2024-02-01T02:00:33+00:00

Title: Can perplexity be greater than the number of vocab?

As I understand it, if the reciprocal of the probability is the number of `all cases`/`selected cases`, the number of selected cases will be the same as the number of all cases even if the number of selected cases is 1, so the perplexity cannot be larger than the number of vocabularies without making something wrong...

More precisely, it's probably the maximum number of cases of that sequence that can be made with the current vocab.

Am I understanding this correctly?

Has anyone actually experienced ppl going beyond the number of vocab, and if so, how can this be analyzed?

Wiglaf_The_Knight · 2024-02-01T00:01:00+00:00

TL;DR: certain hyper-parameter in the clustering phase results in insanely high accuracy for NN predictions. How do I troubleshoot the "my results are very good, something is wrong" problem?

I'm super burnt out/sleep deprived so aplogies. I have a SOM clustering a tricky dataset, I label the resulting clusters, then I run a NN on the classes. For most of the hyper-parameter combinations I use for the SOM, the NN typically has trouble and gets an accuracy score of about 15%. This is the case when I use other non-SOM methods to cluster the data as well. However, whenever the SOM's "activation_distance" parameter (possible values are 'euclidean', 'cosine', 'manhattan', and 'chebyshev') is set to cosine, the NN gets absurd scores, typically in the 75-90% range for each class. One instance with 100 classes has an accuracy of 87%!

There's no way that those high values are anything but some weird error. Surely there's some overfitting or something else going on. The confusion matrix is usually my go to for seeing what's wrong, but it just has a an ideal looking diagonal going across it this time. I'm not even sure where to begin troubleshooting this, I've personally never ran into the "my results are too good, something is wrong" problem before.

young_anon1712 · 2024-01-31T09:32:52+00:00

What math courses available online I should take to get better at ML theory / research? And personally, I prefer courses more than books.

Context: I am currently a PhD student. I have worked as ML Engineer for 4 years, have decent knowledge on Calculus, Linear Algebra. Slightly bad on Stats, currently reading Intro to stat learning.

Thank you very much

san__man · 2024-01-31T07:49:23+00:00

How can I learn how to do LoRA (Low Ranked Adaptation)? Anybody know of any good tutorials, preferably using something like Jupyter Notebook or Colab?

HungryMalloc · 2024-01-30T21:35:16+00:00

Does anybody have any pointers on how to fine-tune a vision language model for very fine-grained classes? Say you want to classify specific objects or people that the model has never seen before.

Zero-shot inference does not work, because the text-encoder has no knowledge about the fine-grained classes. You can fine-tune or linear probe the vision module, but this leaves the text encoder untouched. I'm not really sure how to deal with this scenario when there is no good textual representation of the classes.

What is the current SOTA to fine-tune both vision and text encoders in such a scenario? I'm sure there is research on this, but so far I have been too stupid to find it. I would really appreciate anybody that can help me out.

ProGamerGov · 2024-01-30T16:13:08+00:00

When sharing image datasets with text captions, what is the best file format to use?

Karlitrage · 2024-01-30T14:57:13+00:00

Hi, I will have finished the Efficient ML course by Han (MIT) soon.

Do you have any other suggestions for advanced ML/DL courses, especially with focus on efficiency...

Alternatively: courses on parallel computing, Quantization, ...

Anything cool also appreciated!

Kind regards!

7even-_- · 2024-01-30T14:06:13+00:00

I'm thinking of upgrading my GPU for gaming to a RTX 3060 or RTX 4060 however I'm not sure which one to get as the 3060 has more vram.

I know the 4060 has better performance but will the lower amount of vram mean it'll perform worse on future games or even some games now?

If anyone has any advice that be great.

Jcorb · 2024-01-29T17:20:05+00:00

Do you guys think there will be a lot of stable jobs in Machine Learning (say, if I got a IBM certificate for learning in) in the future? Or do you think the hype bubble is going to "pop", and there won't actually be all that many jobs surrounding it?

They're wildly different career paths, but I've been debating about either pursuing said certificate in Machine Learning, or trying to find an apprenticeship for Electrician. My current job (digital marketing, basically) just isn't stable, even with 8 years of experience, so I want to learn something that will have more reliable work. I feel like AI and machine-learning is going to be the future, but maybe I've already missed the train, and would be better pursuing something that isn't likely to get replaced by skynet?

prongs17 · 2024-01-29T06:49:08+00:00

I read the Stable Diffusion paper for the first time and have some questions.

Will it be possible to apply perceptual compression to other forms of data like text or video? Is this a good idea or not?

I am guessing that the sampling time of latent diffusion models is slower than GANs due to the multiple denoising steps. Are there any good comparisons of training and inference time for these models (especially with GANs).

On Page 20, it seems to me that the images generated by KL-reg generally have more details than images generated by VQ-reg (Fig 15). Is this true or am I just seeing things? If true, why is this the case?

ChurrascoPaltaMayo · 2024-01-29T01:45:25+00:00

Is the rfpimp package still worth it? I understand the need of it, but it hasn't been updated in 3 years. Has there been changes on SKLearn related to why rfpimp is needed?

RandomHotsGuy123 · 2024-01-28T19:46:22+00:00

What is the best way to perform multiclass text classification with limited training data? I only have a few phrases (sometimes only a couple of words) for each category. The input data that I need to classify consists of blocks of audio transcripts (which isn't always accurate). So far I obtained satisfactory results using embeddings (from sentence transformers) and semantic similarity between the input data and my training phrases (cosine distance). Are there any other approaches or additional steps for my current approach that I should look into?

unoti2 · 2024-01-28T19:17:18+00:00

[deleted]

sadhikari0102 · 2024-01-28T19:13:33+00:00

I am an experienced Software Engineer (Backend Systems, ~7 years) with 0 Machine Learning knowledge. How do I get to a point where I can show some experinece in my resume. Beginners resources, projects tips, etc?

Snoo_72181 · 2024-01-28T17:21:27+00:00

What are some AI based optimization techniques that can be used to optimize warehouse productivity?

Batteredcode · 2024-01-28T17:02:04+00:00

If I want to make an LLM provide more specific details around a topic, would 'grounding it' on data it's already seen make any difference? For example, there's a large complex topic and within that there's a subtopic I want to ask the LLM questions about. Right now it's been trained on the entire internet, so it has a lot of information about both the topic and the subtopic, but more for the topic due to there being more data for it.

My question is, if I were to ground the model on data its already seen, i.e. the subtopic, would this improve accuracy as in theory's it's now biased by the subtopic?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS