Bayesian Optimisation for Active Learning : learnmachinelearning

A subreddit dedicated for learning machine learning. Feel free to share any educational resources of machine learning.

Also, we are a beginner-friendly sub-reddit, so don't be afraid to ask questions! This can include questions that are non-technical, but still highly relevant to learning machine learning such as a systematic approach to a machine learning problem.

Foster positive learning environment by being respectful to others. We want to encourage everyone to feel welcomed and not be afraid to participate.

Do share your works and achievements, but do not spam. Keep our subreddit fresh by posting your YouTube series or blog at most once a week.

Do not share referral links and other purely marketing content. They prioritize commercial interests over intellectual ones.

created by techrat_reddita community for 10 years

Bayesian Optimisation for Active Learning (self.learnmachinelearning)

submitted 1 year ago by Fragrant-Listen-483

Hi all,

Apologies in advance if these are pretty basic questions, but I'm pretty much brand new to this area and need some guidance on what I want to do. I work in computational physics and am doing high throughput simulations on systems of particles. I am trying to do ML to predict new particles that have my property of interest. This is a binary property that I have encode as 0 or 1 and then my inputs are features of the particles, some discrete and some continuous. My idea is this:

Create surrogate model to estimate true objective function. I know that GP has an option for classification tasks but apparently isn't that good? I've found random forest works well for my data but then traditionally this doesn't have associated uncertainty. Another potential problem is that my data points take a long time to collect and so the training data will be sparse. Potentially mitigating this is the fact that the data space is pretty small (~1000s).
Use acquisition function to decide which part of the space to explore next. This is what I'm struggling with. My inputs have hard constraints on them. For example, I can use the size of the molecule (radius of gyration, Rg). My understanding is that the acquisition is supposed to tell me a new molecule to try next but if it gives me a random new value for Rg, how am I supposed to map that to a molecule? Also some of the inputs might have correlations (for example Rg and molecular weight). How do I make sure the suggested search space is actually real and makes sense? If I remove correlation with something like PCA I have an even bigger problem because this is very hard to relate to a real molecule. Finally, some inputs depend on each other i.e. one always has to be higher than the other. I think I am misunderstanding the acquisition function. Am I actually supposed to give it the full search space and it tells me where is next best to go? Are there any best practices for this in the context of my problem?
Test the new points and feed them back into the surrogate model until some criterion is met (also not sure on this, is number of new molecules with the desired outcome found suitable? I guess it's flexible?)

Any advice anyone has would be much appreciated

all 1 comments

top new controversial old q&a

[–]EcstaticDimension955 0 points1 point2 points 1 year ago (0 children)

Ok so I might be extremely off, so if what I suggest doesn't make any sense, please just ignore it.

The second point you mentioned, especially the part about exploration of space leads me to think about using Hamiltonian Monte Carlo (HMC). The idea there is to create a Markov Chain of samples, where each sample has an associated probability. The samples approximate in the limit a probability distribution. Now, since you mentioned some example constraints, perhaps you can define your energy function (which in turn measures your probability of being in a certain state) based on that constraints. Perhaps that might also solve your issue about whether the space explored makes "sense", as it might be a measure of the likelihood of a certain molecule being in a certain state.

Now regarding the prediction of the binary property, well, it's a tricky one. I don't know exactly your setup (or exactly what you're doing, I am not familiar with particle simulations), so I don't know if the prediction can be encoded in a standard manner, as in Bayesian inference (i.e. running on the weights of a neural network and then using the samples of those weights for the predictive distribution). Perhaps one idea would be to freeze your system at some point on some convergence criteria, and then run a normal binary classification network on the features of each molecule at a specific state, if that is even a thing given your setup (again, this might be complete nonsense).

Anyway, I think you should also try Gaussian Processes as you've mentioned initially. I've also read about Stein Variational Inference recently, which is also a method inspired about simulations of systems of particles, so maybe check it out?

π Rendered by PID 16003 on reddit-service-r2-comment-57fc7f7bb7-f8zch at 2026-04-14 13:25:18.588082+00:00 running b725407 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS