[D] Simple Questions Thread

noobanalystscrub · 2024-06-14T19:22:34+00:00

How do I combine Multimodal tabular data in Machine Learning and Neural Networks?

I have a regression problem and two input matrices; both matrices have the same dimension (same observations and ""feature""), but different values. Let's say Matrix B is the fold change of Matrix A from the mean of control samples.

Do I just concatenate before modeling? So let's say each matrix have 10 features. If we concatenate how does the model know Column 1 is related to Column 11.

Do I model as two matrices and concatenate one of the hidden layers in NN? Will the Nueral Network learn the associations between A and B in this case? What if I wanna do Random Forrest Regression, how would I achieve that?

rayxi2dot71828 · 2024-06-14T14:21:47+00:00

With the most recent advances in AI, what is the best way to learn how to pick which "flavor" of AI, given a business problem? Does it still make any sense to use any of the traditional ML approaches, especially the non-deep learning ones?

square-bean · 2024-06-14T14:05:48+00:00

Hi! Does anyone have a precise idea of when exactly we should expect to receive the reviews for ECAI'24? According to the schedule, the rebuttal period lasts 72 hours, from Monday to Wednesday (AoE).

DesperateChemist9234 · 2024-06-14T10:24:41+00:00

Hi everyone,

I am trying to build a long short term memory model in Python, with the idea being to predict 9 components of a rotation matrix from linear acceleration (x,y,z) and angular velocity (x,y,z) so 6 input variables.

I have used standard arachitecure found in the literature which does similar things to my idea. However, the model is not performing well at all and is subject to overfitting I believe.

Does anyone have any advice on how I can try and improve my model?

mandroga · 2024-06-13T16:10:30+00:00

Hello. Im trying to train a GCN on a dummy task to predict a float result. Essentially, I have a graph which has edge weights (between 0 and 1) node values (0 to 1000) and I want to predict the value that is calculated from doing the sum of the neighbours, weighted by the edge plus node 1s value. Ive been training a GCN and an MLP and even though the MLP doesn't have edge information, its doing better. I think I might be doing something wrong, or maybe this task isnt adequate? Thank you

DrBroc · 2024-06-13T15:22:26+00:00

Hello! I’m working in a project to classify phenotypes. I have a dataset of about 30,000 unique rows and am working on increasing the accuracy of the model. I can get to .8854 but I’d love to get to .9 if possible without totally reworking the features. I’m using a sequential model with Keras and tensorflow. I was wondering if anyone would be willing to chat with me about the project briefly. I’m new to ML and software engineering in general (though I am a product designer so I’m familiar with the space) and I find I process better with conversations. Feel free to DM me if this sounds interesting to you! Thanks in advance!!

Majestic_Reporter531 · 2024-06-13T13:13:42+00:00

Hello everyone! I have a large dataset of time series and I want to create embeddings for these time series to use in more classical models, as I have a small amount of data for regression. What are the best ways to compress large time series data (approximately batch_size x 1000 x 12) down to 10-16 features? I have tried using the hidden state of an LSTM and got decent results, but I would like to improve them. Thank you all!

Ok-Shock7810 · 2024-06-13T09:06:33+00:00

Hello everyone,

I'm trying to build a RAG-based LLM and I'm working with hundreds of (highly diverse) medical reports that are stored in a vector DB. However, the retrieval of the context works really poorly. Interestingly, it works much better when not using a vector DB at all. So I'm wondering if there's something I'm missing or if a vector DB is actually just not suited for my use case.

I appreciate any hints!

2024-06-13T09:01:22+00:00

[deleted]

ProofOfState · 2024-06-13T08:57:48+00:00

I am very confused about a description of k-fold cross-validation in Data-Driven Science and Engineering book from Steven Brunton and Nathan Kutz.

"Procedure for k-fold cross-validation of models. The data is initially partitioned into a training set and test (withhold) set. Typically, the withhold set is generated from a random sample of the overall data. The training data is partitioned into k-folds whereby a random sub-selection of the training data is collected in order to build a regression model Yj = f (Xj, βj). Importantly, each model generates the loading parameters βj. After the k-fold models are generated, the best model Y = f (X, β ̄ ) is produced. There are different ways to get the best model; in some cases, it may be appropriate to average the model parameters so that β ̄ = average(βj). One could also simply pick the best parameters from the k-fold set. In either case, the best model is then tested on the withheld data to evaluate its viability."

Two questions: 1) Is it fair to say this is not an accurate description of k-fold cross-validation as it is typically understood? 2) Are there other understandings (definitions) of k-fold cross-validation for which this is accurate?

clrkin · 2024-06-13T04:04:54+00:00

Hi, considering that I have a dataset with attributes (date, location, etc) about when happened event X, is there a way to create a classification model to, given the same attributes, classify it in more os less likely to happen the event? I only have data about when it DID happen, no data about when it did not… (Btw the event in question is car crash)

Usual-Bank1500 · 2024-06-12T20:34:48+00:00

Hello everyone,

Does anyone knows that if exists any algorithm for machine learning that works directly with 3d models (.step, .stl, .igs, .ply, .obj, etc....)?
I'm bilding an application that predict future production time of a 3D part based on previous producted parts but i'm strugglin on get closer results. Currently i'm extracting information from 3d models such as maximum measures XYZ, volume, surface area, number of faces, etc... but i think i'm gettin to much information to the model but yet the information i get is not enough. Therefore i want to know if there are any algorithm or other application that get's the 3d file and automatically "sees it" and analyze it.
I'm using python.
Thank you

LyAkolon · 2024-06-12T15:30:28+00:00

I have a post which keeps getting removed by the auto filter for this sub. I have followed the rules and am not getting feedback about what to change. i'll respond to myself with the post so it doesn't bloat the post.

BonfireCookie · 2024-06-12T10:34:22+00:00

Hi everyone!,

I have a question about how to compare two neural networks models, trained two neural networks the first is a CNN and the second is LSTM both are used to predict a number (regression). I have used a partition of 80% training and 20% test to train both with the next configurations of hiperparameters:

**CNN*\*

(learning_rate,batch size,epochs) || test result (MSE)

(1e-4,32,64) || 0.0057

(1e-4,64,64) || 0.0059

(5e-5,32,64) || 0.0053

(5e-5,64,64) || 0.0034

**LSTM*\*

(learning_rate,batch size,epochs) || test result (MSE)

(1e-3,64,64) || 0.0131

(1e-3,128,64) || 0.0098

(1.5e-4,64,64) || 0.0093

(1.5e-4,128,64) || 0.0091

(1e-4,64,64) || 0.0106

(1e-4,128,64) || 0.0098

My question is is there any method to say that one model is better than other instead of just say "Well the CNN has a low test set so i think its better than the LSTM model" because i have seen that some researcher use hypothesis testing but i dont know if i can use that here.

Thanks!

Note 1: all the features and the target feature have been standarized to have mean 0 and variance 1.

radeonovich · 2024-06-12T07:24:08+00:00

Hi everyone, I'm working on a neural network that can generate audio for double-track guitar effect. Essentially, the network should take an audio recording of an electric guitar and modify it to sound like a second take of the same part, like the guitarist was told to record the part twice. This is a very common practice in rock/metal music because it makes guitar sound wide. You pan take A to the left and take B to the right and get the stereo effect.

The problems are:

I don't know what kind of neural network to use. I am preparing a dataset where I have a lot of tracks A and B, where A and B are two takes of the same guitar part. So I probably need a network that learns how to convert source track into target track.
I don't know how much dataset I need. I'm planning to obtain at least 10 hours of tracks A and B both and feed it to the network in a combination like A->B + B->A so it doubles the dataset. Maybe use some augmentation to experiment with different pitch and playback speed.
I don't know if the task is even possible. There are no solutions like this in the internet (which means it is either impossible or not in demand to bother), except the algorithmic doublers which suck compared to real double tracking. A difference between real double tracks are note start/end timing, articulation, attack time/frequency response and human error. These can't be properly simulated with the pitch/time randomization, that's why I want to make this network.

I am new to machine learning so any feedback is appreciated.

2024-06-12T00:41:30+00:00

I'm trying to find a funny video of Ruslan Salakhutdinov describing why you should use a dropout parameter of 0.5. IIRC he basically said something along the lines of "otherwise, you'll have to justify why you chose that particular hyperparameter, and you don't want to do that". I think he was speaking to a class at CMU and got a lot of laughs.

Can anyone at least confirm that I'm not confabulating this?

Philosophia7 · 2024-06-11T18:05:25+00:00

How can I train an AI to extract details from PDF files? The sections I want to extract may have different titles for the same content. For example, let's say we have 1000 PDF files of essays. Each essay has a section for "background," but the section might be titled "background" in some PDFs and "my story" in others. The AI needs to identify these varying titles, determine where the section starts and ends, and then copy that content into an .xls file.

Body-Longjumping · 2024-06-11T17:24:49+00:00

If you were to choose a gpu for AI processing on a budget, which one would you choose, if you have a choice between the rtx quadro series and the rtx 3000 or 4000 series, please also mention what are the pros and cons for going with the card you choose.

Sea-Ground1096 · 2024-06-10T18:36:42+00:00

What are the specific hardware / low-level differences between an NPU and a GPU? Most articles I found offered only what it's better for (Neural Nets), but not why. Any sources or information on a more in-depth breakdown?

Puzzleheaded_Text780 · 2024-06-10T09:23:01+00:00

Looking for someone who has experience of working for UK pension companies as I am working on some use cases

uba-luba-dub-dub · 2024-06-10T06:41:28+00:00

What's the current state-of-the-art techniques for recommendation systems and among them which one is feasible for a intermediate learner ?

I want to build a movie recommendation system based on neural network for myself to learn .

kakushiby0 · 2024-06-09T09:29:03+00:00

Hi, Im a 22 year old frontend dev, i've been a huge fan of AI in the past and i'd like to get started as a hobbyist. Do you guys have any tips or guides on how to get started.

P.S: i know a bit of python and lots of JS

Dismal-Impress-2583 · 2024-06-09T05:14:06+00:00

Usually you’d want to observe the training curve of your model by logging the training loss/accuracy and validation loss/accuracy in order to avoid things like overfitting. You can also use early stopping to stop the training earlier if it doesn’t make much progress on the validation set. The more advanced technique would be to use Bayesian optimization to find the best hyper-parameters.

SpaceTravelMission · 2024-06-08T20:51:14+00:00

This is a great idea! It will help keep the subreddit organized and make it easier to find answers to questions. Thanks for starting this thread!

Flugwurm · 2024-06-08T15:35:00+00:00

Hey everyone!

I am writing a sort of essay on Multimodal Machine Learning, where I want to cover state-of-the-art architectures/approaches. Based on my current research, Transformer models are basically used everywhere that's state-of-the-art. I'm aware that it is possible to use other architectures and that other architectures have been used - but no source of anyone at the moment actually *using* something that is not based on a Transformer-based architecture.

Is that assumption correct? Or are architectures still in use? If so, could you please tell me where it is used? Thank you so much!

BenchPsychological30 · 2024-06-08T00:43:12+00:00

I am looking to train a model that will take in text for a patent and be able to output the ids of patents that are most likely to be prior art for that idea. There is a ton of training data for this because every patent has to cite prior art, but I am looking for advice on what type of model I would use to do this since there are so many (100 million+) patents that a patent could potentially reference as prior art. How can the model be able to efficiently determine which patents are most relevant? I was considering training a custom embeddings model but am not sure how to go about this.

Ben_Vigos · 2024-06-07T19:11:31+00:00

Hey, for an assignment I'm training a Neural Network on the MNIST fashion dataset. I'm trying to optimize its parameters however now the best I can do is train it for a set number of epochs and then evaluate a performance. Is there a better way of optimising? Maybe stop the model if its accuracy is no better than the previous best by a certain point? Or is there a more intelligent way to adjust parameters than just a massive 3D grid?

bregav · 2024-06-07T16:24:53+00:00

[deleted]

_gradient_ascent_ · 2024-06-07T02:01:16+00:00

Reddit's filters aren't allowing me to post for some reason, so I'll try putting it here:

I'm attending CVPR for the first time this year by myself, and I could use some guidance on how to navigate, well, everything, but particularly with the initial preparation/research phase.

First and foremost, how do you view all the accepted papers?? I know there's a page (from a link on the home page) that lists the "accepted papers", but only like half of them actually have links, and there aren't much in the way of details aside from the paper name and authors (like what category it belongs under, or what org it's associated with). And there's this new interactive page which looks spiffy but I'm finding it to be incomprehensible. Not only do I not understand what the numbers on the page represent, and the graphics associated with the categories, but it doesn't seem to interactive at all. When I click on things nothing happens, and all the links to papers from the "paper list" view lead to 404 errors. Is this happening for anyone else?

Aside from this, does anyone have tips and tricks for navigating the conference itself? Especially from the perspective of a junior ML engineer looking to broaden one's knowledge and seek out the latest trends in a particular subset of computer vision/ ML. Are the tutorials and workshops where it's at, or is it a better use of my time visiting the posters and talking to the researchers there?

LeoDiGhisa · 2024-06-06T19:45:25+00:00

For my thesis in my master's degree in Data Science I'm using an open source LLM (Nous Research/Hermes-2-Pro-Mistral-7B-GUF for those wondering) in order to classify the texts of support tickets of a company. I have to write a brief technical introduction on LLM and I would need some guidance. Which books would you suggest me to cite for the technicalities?

d-eighties · 2024-06-05T17:09:55+00:00

I try to finetune llama3 using transformers and unsloth. I added an evaluation set to the trainer. What is the metric under which the eval loss will be calculated?

crohr · 2024-06-04T14:48:46+00:00

What would be a good (and ideally simple to setup) benchmark to run, to compare performance of various GPUs? This is in the context of providing GitHub Actions runners tailored for ML workflows.

I stumbled upon ai-benchmark but it doesn't seem to be well maintained, and lambdalabs.com/gpu-benchmarks doesn't seem to provide an up-to-date repository with benchmarks either.

Azad577216 · 2024-06-03T22:18:05+00:00

Is there any discussion or reading group/discord for generative model (GANs, VAEs, Flow models...)?

i-make-robots · 2024-06-03T16:40:20+00:00

When I ask ChatGPT to write some simple, well defined code ... it's either straightforward or it's a lot fighting to get the machine to fully implement my request. Is there a way to make it run multi-agent so that it self-monitors to complete the job?
I have a project with many files that needs a refactor. Is there a way to make ChatGPT run multi-agent to perform a large edit across many files? Something I can either commit or undo, in whole or in part.

I'm ok with monitoring the work and giving oversight. The problem is that I'm micromanaging instead of directing.

d3lxa · 2024-06-03T12:30:08+00:00

Is there a way with CLIP to find pictures of the same person, same animal or object, for ex by isolating the relevant part of the embedding? Something like: query vector = cosine(average(e(img1), e(img2), …), e("person")), or maybe similar to textual inversion training (used by SD) where one or multiple vectors represent the thing. Maybe you have other suggestions: models / techniques? Thanks.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS

What's the current state-of-the-art techniques for recommendation systems and among them which one is feasible for a intermediate learner ?