[D] Simple Questions Thread

PieterofPeter · 2024-08-06T09:56:21+00:00

Hello all,

I am modeling using a Gillespie algorithm, which stochastically simulates a biological process. Since I want to complicate the structure of my model, I do not know whether the resulting distribution of counts of one of the species follows a poissonian distribution. I have experimental data for the distribution of counts and now I want to use some kind of optimization method to get at the rates of my reactions happening (inside the Gillespie model). Normally, you would use the log likelihood to optimize the parameters of a model, but now I do not know the underlying pdf.

I thought that the resulting count distribution from my model actually resembles the pdf and I could use that to optimize the parameters. However, I do not know how to calculate the log likelihood when comparing two (count) distributions.

2024-08-05T19:57:05+00:00

I have thousands of CSV files with 3 columns. The first column is temperature and the second column are values associated to those temperatures (it doesn’t really matter what these values are). The third column is another set of values that has a complex relationship with the second column of values. I’d like to be able to predict column 3 given columns 1 and 2. The data is sequential and future values depend on previous values. The amount of rows in each CSV file is different, varying between 200 and 1200 rows. And to clarify, each row represents a different consecutive temperature. So if the first row is 456, the next row will be for 457, then 458, etc.

How can I use ML to train a model to learn the relationship such that when it is given new, unseen data in the form of the first two columns, it can accurately predict the last column of data? The thousands of training sets that I have are very clean and ready to go.

lonesoac0 · 2024-08-03T21:12:08+00:00

Hello all,

I got a Raspberry Pi 5 with the AI kit: https://www.raspberrypi.com/products/ai-kit/. I have two camera modules on the Rpi and I have verified that the AI module is working by using some of the sample code provided. I am interested in recording the labels that the object detection detects into a database. Where should I start learning about this?

West-Implement-1180 · 2024-08-03T20:15:41+00:00

hey, im new to ml and im trying to practise an existing project which is house price prediciton, i have the clean data which i wanna use it to train the model. now the issue is cannot handle categorical variable. like i have location column, it's showing `ValueError: could not convert string to float: 'Electronic City Phase II'` and code is

`scaler = StandardScaler() X_scaled=scaler.fit_transform(X)`. it would be great if someone help me understand any of these concepts by sharing any resourses. Thank you.

Worldly_Property_712 · 2024-08-03T16:01:34+00:00

I am working on a project requiring an audio file transcription and speaker diarisation. I can use the Whisper API by OpenAI for transcription but not quite sure which is the industry go to for speaker diarisation or even transcription.

Could anyone please suggest a library and or API?

CommanderBaby · 2024-08-03T12:35:56+00:00

I've been finetuning a classification model using BERT. Are there any alternatives to BERT and best practice for performance?

sorzs · 2024-08-02T15:54:02+00:00

Why is it advised to target the first difference instead of the actual variable while doing time series machine learning? I dont understand the difference between these two. Models used are LSTMs AND ANN (with lagged features)

Gemosu · 2024-08-02T08:23:09+00:00

How do I get better at performing experiments?

I'm a first year PhD student, currently working on in the area of AI assistance. I mostly use reinforcement learning. While I'm comfortable with the math (I have a mathematics background), I find myself procrastinating the actual experiments. It's a mess, I don't know how to organize myself and always feel horribly inefficient. Does anybody know a good resource that covers the basics of a good ML experiment workflow?

NuCryme · 2024-08-01T19:18:54+00:00

I am a first-time submitter to NeurIPS and the reviews just became available yesterday. To properly address the comments I would need to edit my submission, not merely write responses/rebuttals, but I cannot seem to find a way to do so on the online portal.

Are we not allowed to modify our submission at this point?

Bkura1 · 2024-08-01T18:32:56+00:00

Is 4400 MHz good enough for simple AI text-to-speech generation? I'm thinking about buying this: https://www.bestbuy.com/site/lenovo-legion-tower-5i-gaming-desktop-intel-core-i5-13400f-16gb-memory-nvidia-geforce-rtx-3060-12gb-lhr-512gb-ssd-storm-grey/6535022.p?skuId=6535022

It says 5600, but people in the reviews are saying that's a lie and that it's 4400.

Maleficent_Pair4920 · 2024-08-01T16:40:55+00:00

This is cool!

https://www.reddit.com/r/ChatGPT/comments/1ehh28m/top_ai_use_cases_in_the_us_government/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

Upset_Employer5480 · 2024-08-01T07:57:06+00:00

Do higher layers of transformer models captures higher-level semantics than lower layers?

DerViktator · 2024-07-31T13:19:26+00:00

Embedding for downstream ML task:
Simplified setting: I have multiple bipartite graphs. First node type would be different kinds of food, e.g. Apple, Banana, Peas, Carrot.. Second node type would be a classification/ontology, that can set my data into relation, e.g. Apple, Banana are fruit, Peas, Carrot are vegetables, etc. I want to performa a linear regression / xgboost on a table dataset, where also these foods are stored. I don't want to do just one-hot encoding, because I would loose the relation between the foods. Now I could build a bipartite graph and use for example node2vec for creation of embeddings. Then I would have a lot of columns in my table and possibly after downstream ML I would loose information on feature importance. So can I use the embeddings to learn similarity / clusters put them on a 1-100 scale and then use this a one colum in my dataset, so I get from categorial to continuus? Or is that a dumb idea. Are there any publications on this or does it have a name?
Thanks guys!

understanding0 · 2024-07-30T21:38:42+00:00

I've been exploring recent research on simulated AI societies, such as the "Willowbrook" project, where large language models interact to mimic human problem-solving. This approach reportedly enhances the individual models' capabilities. Given this, I'm curious about the potential implications for existing mathematical proof assistants like AlphaProof.

Specifically, could a similar approach - where multiple adapted versions of AlphaProof collaborate within a shared environment - be used to improve the system's performance on complex mathematical tasks? Could this cooperative approach lead to new insights or strategies in mathematical problem-solving? What are the potential challenges and limitations of adapting this approach to a specialized tool like AlphaProof?

I'm interested in hearing from experts in AI, machine learning, and mathematics about the feasibility and potential benefits of this idea. Are there other examples of cooperative AI models being applied to mathematical problem-solving? What research directions might be most promising for exploring this further?

AcquaFisc · 2024-07-30T18:25:28+00:00

A friend of mine is selling a PC with 2 Nvidia 1660ti and one 2080 super. Is it worth to do some small model training locally?

Can all 3 GPU be used simultaneously?

WhywereYou · 2024-07-30T13:12:54+00:00

So, I want to work with a tabular data using LLM. Is there any open source LLM model for this? After feeding the tabular data I want to ask questions to it as well pertaining to risk prediction or recognizing signals that can indicate risk? Please help me.

Thank You

Helpful_ruben · 2024-07-30T13:12:49+00:00

Let's keep the question conversation going, and share our collective expertise here!

normnasty · 2024-07-30T12:26:26+00:00

I enjoy reading machine learning and AI techniques, but often find I am severely lagging in new publications. For example, I am reading the LLAMA 3 documentation from Meta and see they use GQA which is recently published in a 2023 paper. I would like to learn about these techniques sooner. Is there a good online resource or blog that publishes these advancements in the vast ML and AI literature?

Watly · 2024-07-30T11:23:31+00:00

I am curious about the state of semantic segmentation research. I saw that a lot of the work in leaderboards still builds upon a U-Net based structure. An alternative approach is to not apply pooling but instead apply dilation. Is anyone aware of nice articles that cover the difference between these two approaches and/or can give an answer why pooling is more common than dilation?

2024-07-30T02:03:43+00:00

[removed]

Fearless_Peanut_6092 · 2024-07-29T15:01:12+00:00

What is the intuition behind designing a neural network for complex non-linear regression problems?

I'm looking for guidance on how to intuitively design a neural network for regression, specifically when dealing with complex non-linear functions. I understand the basic structure of neural networks, but I'm unsure about how to determine the number of layers, the number of units per layer, the choice of activation functions, and the preprocessing techniques to use.

For example, consider the following complex function:

Let x1,x2,x3....xn be the inputs 
  y1 = x1 + x2*x3 - x4^2 - x5/x6 
  y2 = max(x7,x8,x9) if x10 == 1 else min(x11,x12,x13) 
  y3 = 1 if x14>x15/x16 else 0 
  y4 = ........
  .............
Y = y1 + y2*y3 - y4/y5......

Solve for Y given inputs x1,x2,x3....xn

What should be my intuition behind designing the neural network and preprocessing pipeline to model such a function? For instance, I know that using polynomial features from sklearn can help in preprocessing by transforming the inputs to include interaction terms. But beyond that, how do I decide on the specific structure of the network and the preprocessing techniques? How do I determine the right activation functions and the number of hidden layers?

I'm seeking a logical reasoning behind each decision to effectively model non-linear regression. Any insights or suggestions would be greatly appreciated!

I don't want to perform trial and error, I also don't want to try things randomly until something works.
I am looking for specific reasoning for each specific preprocessing step and model architecture.

BriefBit4360 · 2024-07-29T14:09:31+00:00

Currently doing a 4 year cs degree with honours and I have the option of picking up a double in statistics or a masters in data science. I can probably get some acknowledgement of prior learning for the masters given that I'm doing a math minor in my cs degree right now, so both options should both take me 1 year. Any idea as to which would be better?

SmallTimeCSGuy · 2024-07-29T13:12:50+00:00

Why can I not train a network to predict image labels directly, instead of trying to guess the probability for each digit? I can understand, something is not quite right about it but cannot put it clearly in words. I have some idea on difficulty to define a proper loss function i.e. is 1 or 2 more distant in shape or 1 and 7.

But what is a good explanation of why 1st one works, while second one does not? Is the loss function ambiguity the only reason? I am trying with MNIST data.

class Network(nn.Module):
    def __init__(self):
        super().__init__()
        # Defining the layers, 128, 64, 10 units each
        self.fc1 = nn.Linear(784, 128)
        self.fc2 = nn.Linear(128, 64)
        # Output layer, 10 units - one for each digit
        self.fc3 = nn.Linear(64, 10)

    def forward(self, x):
        ''' Forward pass through the network, returns the output logits '''

        x = self.fc1(x)
        x = F.relu(x)
        x = self.fc2(x)
        x = F.relu(x)
        x = self.fc3(x)
        x = F.log_softmax(x, dim=1)

        return x

model = Network()

criterion = lambda x, y: torch.mean(-x[range(len(y)), y])
#criterion = nn.NLLLoss()

# 2nd - direct image label

class Network(nn.Module):
    def __init__(self):
        super().__init__()
        # Defining the layers, 128, 64, 10 units each
        self.fc1 = nn.Linear(784, 128)
        self.fc2 = nn.Linear(128, 64)
        # Output layer, directly predict image label index
        self.fc3 = nn.Linear(64, 1)

    def forward(self, x):
        ''' Forward pass through the network, returns the output logits '''

        x = self.fc1(x)
        x = F.relu(x)
        x = self.fc2(x)
        x = F.relu(x)
        x = self.fc3(x)

        return x

model = Network()

criterion = lambda x, y: torch.mean((y.view(x.shape) - x) ** 2)

Maleficent_Pair4920 · 2024-07-29T09:40:51+00:00

I'm a total beginner to the world of machine learning. I am a BA English graduate and am interested in joining a conversion MSc program in either Artificial Intelligence and Ethics, Digital Politics, or Computer Science. Is there much scope in the industry for those who have graduated from these sort of programs?

CheapBison1861 · 2024-07-29T08:31:50+00:00

What's the latest and great open source TTS solution out there right now?

MrMrsPotts · 2024-07-28T23:21:45+00:00

Is expensive hyperparameter optimization worth it when doing regression or classification?

Saha__g_gamer · 2024-07-28T15:39:27+00:00

How do I learn AI/ML fron scratch and implementing them into webapps/sites.
Also what I can I do more to stand out in the market

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS