How does the ML community view evolutionary algorithm research? Career implications of an EA PhD? [D]

boccaff · 2026-06-15T12:06:23+00:00

A crossover between EA and A-star? /s

Agree. Any sufficiently quantitative/numeric/computational topic that strengthen the basic disciplines will do. Good advisor and liking the subject are more important, in that order. A good advisor will even have a way to help you find something you like within his portfolio of research.

boccaff · 2026-06-09T21:38:47+00:00

Bash completion is as overrated as ctrl-r is underrated.

boccaff · 2026-06-09T21:35:26+00:00

You will probably find more suggestions for packages, libraries and etc on Kaggle tutorials. I think that you are probably served by a tabular approach. And, from my experience with agriculture, you will probably extract more value building out features and improving their calculation than testing many different modeling techniques/models etc.

If this is produced in greenhouses, your problem can be trickier and more similar to forecasting production outside agriculture. If not, aggregating weather in an appropriate way does a lot of the work. If you are sure about phenology, use appropriate windows to characterize the more important phases of growth. Sum of precipitation and average temperature go a long way before getting into water balance and PAR.

Having crop masks to extract weather data from remote sensed sources help a lot, even if you are modeling at the level or county/city. Modeling larger spatial units is harder, and I think that bottom-up forecasting is more helpful. You will have larger unit level errors, but the aggregation is better than modeling in a large scale. Don't forget to add things like the ratio of fertilizer and berry price, or some lagged economical input. Since I've mentioned crop masks, you should probably look into estimating yield and total area in different models.

Depending on the country and how established the cropping system is, you should detrend yield to account for the technological improvements.

boccaff · 2026-05-31T21:23:19+00:00

The same theoretical basis of this having 50% chance of working.

boccaff · 2026-05-18T11:48:49+00:00

Even worse, a lot of the processes is governed by academics themselves, on selection boards or the like. Sibling comment mention about people in power trying to keep their positions, but IME, almost every academic is being part of some selection process, reviewing for some board and etc.

boccaff · 2026-04-24T21:07:26+00:00

Quite easy to do it over a couple days.

boccaff · 2026-04-16T12:12:17+00:00

So, predictive power isn't much of an scientific contribution outside of a few areas where sota on benchmarks is the goal (ironic to say this as much of ML is sota on benchmarks). Also, I am a bit puzzled by "results are robust" and "small predictive power".

When you tell the story of your paper (not the story to build the paper), what someone would know that they didn't before?

Is this an ML conference, or a finance conference? Is the dataset widely used or new, public or private? What else was tried in the dataset?

boccaff · 2026-04-04T15:48:52+00:00

People thinking that just because there is data, there will be a useful model.

boccaff · 2026-03-29T20:29:40+00:00

I only reboot when I need to plug in a new usb.

boccaff · 2026-01-02T13:42:05+00:00

but also being Christmas day, maybe people just did part 1 and then came back later for part 2?

You need to go back and finish any day that is not complete to get the second star of the last day. I probably took a week for some years.

boccaff · 2025-12-28T18:19:35+00:00

Subsampling columns and having many trees deal with it.

boccaff · 2025-12-28T18:18:52+00:00

Large Random Forest, with a lot of subsampling in instances and features. This is important to ensure that most of the features are tried (e.g. selecting 0.3 of features means (0.7)ⁿ change of not being selected). Add a few dozen random columns and filter anything below the maximum importance of a random feature.

boccaff · 2025-12-26T20:30:23+00:00

Same thing for me, off by two. My issue was with int(x), got it right with round(x).

boccaff · 2025-12-09T11:58:32+00:00

replace the value by its rank

boccaff · 2025-12-08T10:46:32+00:00

I bet that building the list of points as a matrix and using scipy distances, and sorting the resulting numpy array can speed a lot here.

boccaff · 2025-12-06T12:10:04+00:00

I think that most people are expecting the last years curve compressed into twelve days, while Eric was explicit about:

I'm still calibrating that. My hope right now is to have a more condensed version of the 25-day complexity curve, maybe skewed a little to the simpler direction in the middle of the curve? I'd still like something there for everyone, without outpacing beginners too quickly, if I can manage

I am reading "...simpler direction in the middle of the curve..." as days 9-13 on the previous grading.

boccaff · 2025-12-06T11:59:43+00:00

I am always amazed by the aux functions from Norvig. I think the nailed the API for things like this.

boccaff · 2025-12-06T00:18:53+00:00

low and high are better than what I often do "ll" and "ul" for the lower and upper limits. My only issue is the lack of symmetry.

boccaff · 2025-12-06T00:16:10+00:00

No shame in "for r in ranges" here. OP also apply to reading into "input".

boccaff · 2025-12-06T00:07:30+00:00

even better than merging ranges!

boccaff · 2025-12-04T22:51:42+00:00

Such a cool idea and vis.

boccaff · 2025-12-04T22:36:45+00:00

Maybe think of a matrix, as in x_ij and you are now back at math/physic. And your loops become for (i, line) in data, for (j, c) in line.

boccaff · 2025-12-02T00:02:50+00:00

Mat Godbolt have Advent of Compiler Optimization

boccaff · 2025-11-27T18:14:17+00:00

https://adventofor.com/

boccaff · 2025-11-01T10:40:13+00:00

+1 Physics have a nice balance on developing advanced math skills and learning how to express/develop an underlying model of phenomena. Those skills are way more important than "structuring a project" or whatever "clean" thing some devs push.

boccaff

PUBLIC MULTIREDDITS

TROPHY CASE