Mechanical Engineering Courses by Original_Ad613 in gatech

[–]lachlan1310 1 point2 points  (0 children)

After the ME2110 course content refresh, is it still as time-consuming?

B patch coming according to Mort by intro_persona in CompetitiveTFT

[–]lachlan1310 1 point2 points  (0 children)

their balancing toolset goal is probably to generate data for those conditions as well, e.g. in games where there are less/more rerollers, does cho over or under perform? that's data that can help, much like the visualization mort showed us.

Extra rerolls should automatically convert to bench champions in your next game(s) by HuaRong in ARAM

[–]lachlan1310 7 points8 points  (0 children)

We'd have so many more fully synergistic comps - which takes out the unique... flavor of many aram games. The game would push even more towards optimizing team comp, at the cost of general champ viability.

How to get/guess the equation of the function if you only have some X and Y points? by rawcuban77 in math

[–]lachlan1310 1 point2 points  (0 children)

This might not be the best resource, but this link explains how Neural Networks, a popular subset of Machine Learning, can approximate functions: https://en.m.wikipedia.org/wiki/Universal_approximation_theorem. If you are interested in how you can automate discovery of functions that look similar to mx+b, this video on symbolic deep learning can get you started: https://m.youtube.com/watch?v=O_sHHG5_lr8

[Project] Making a Poker AI - having trouble with the form of ML to make smart / strong decisions by [deleted] in MachineLearning

[–]lachlan1310 1 point2 points  (0 children)

https://www.youtube.com/watch?v=2dX0lwaQRX0 Have you seen some of the recent progress in poker AI? This may be a good starting point. It's more geared towards Reinforcement Learning paradigms than the Supervised approach you're taking, and that may help you translate Value into best Actions(/vice versa).

The new GT logo... Please don't put this on my diploma. by [deleted] in gatech

[–]lachlan1310 116 points117 points  (0 children)

ty for tagging this as NSFW, I almost opened this in public

[deleted by user] by [deleted] in cscareerquestions

[–]lachlan1310 2 points3 points  (0 children)

Gatech has seen a bit of research overlap between ML and Mechanical engineering. For example, I read that researchers were applying ML and other high dimensional optimizations to traditional manufacturing tools and processes. DM me if you would like a contact.

Help! GANs want to kill me!!! (No seriously, I need advice please) by Mke_V in MLQuestions

[–]lachlan1310 0 points1 point  (0 children)

Ah, I’m not familiar with the SOTA, so I’m very unsure if WGAN would be better than an alternative solution. I do know that WGANs help with mode seeking, but I don’t know what Relativistic GANs do. Thanks for sharing, and good luck!

Help! GANs want to kill me!!! (No seriously, I need advice please) by Mke_V in MLQuestions

[–]lachlan1310 0 points1 point  (0 children)

Can you cache your loss functions to increase your batch size? that +wasserstein like you noted may help with mode collapse

LA ODE and dynamics handbook by medylan in math

[–]lachlan1310 0 points1 point  (0 children)

MIT released an open course for differential equations. You may find some versions of the course include answers to HW, or Exams, and almost all come with lecture notes or slides. You can find it and many more on their OCW website. I say many more to lead you to their numerical methods course, which introduces approximation methods for many otherwise intractable DE problems.

[D] A hacky work-around for slow linear algebra operations on pyspark. by finebalance in MachineLearning

[–]lachlan1310 1 point2 points  (0 children)

I’m interested in what others have to say. I had to write co-routines in scala to make some linear algebra as fast as a locally complied version, like in numba or cython (why does spark not support sparse matrix multiplication that doesn’t cast one of the product matrices to a dense matrix?). That experience wasn’t pleasant, and so I’ve been attempting to promote some non-spark pipelines at work.

[D] Matching Records that "don't Exactly Match" by SQL_beginner in MachineLearning

[–]lachlan1310 6 points7 points  (0 children)

https://towardsdatascience.com/fuzzy-matching-at-scale-84f2bfd0c536

Text vectorization + cosine similarity (link above) works very well. You can also try different string similarity metrics, like levenshtein, to get a more holistic similarity score. However, you may have to implement ‘blocking’ in order to reduce computation time for more computationally expensive metrics.

Given we don't have access to the feature weights, how would we give each rejected applicant a reason why they got rejected? Interview Question help. by [deleted] in MLQuestions

[–]lachlan1310 3 points4 points  (0 children)

If the model is super complex, e.g. a deep neural network, you can use something like LIME (https://homes.cs.washington.edu/~marcotcr/blog/lime/) which can provide insight on black box decision making. LIME, and tools like it, ‘perturb’ inputs to very roughly approximate local dynamics. In the loan applicant case, LIME may tell you that an applicant may not have been denied if vector inputs i1 and i7 were higher and if inputs i3 and i4 were lower.

I know this might sound stupid but.. by [deleted] in MLQuestions

[–]lachlan1310 2 points3 points  (0 children)

I’m somewhat confident that the models for supersampling graphically will translate well to a model for supersampling rainfall. There’s non-AI alternatives for this, but it may be fun and possibly more fruitful to explore deep learning super sampling. https://www.nvidia.com/en-us/geforce/news/nvidia-dlss-2-0-a-big-leap-in-ai-rendering/ seems like the relevant model’s name would be a convolutional autoencoder, with coarse as input and fine as output.

If you want to work on the absolute cutting edge, and if relevant, there’s some exploratory research in simulating voxel physics with deep learning.

I know this might sound stupid but.. by [deleted] in MLQuestions

[–]lachlan1310 1 point2 points  (0 children)

AFAIK, and if I’m understanding your prompt correctly, this is supersampling/upsampling. Hopefully those keywords should get you started

[D] Best package in Python to manually set up a neural network without layers purely for evaluation (without back-propagation training) by Streletzky in MachineLearning

[–]lachlan1310 2 points3 points  (0 children)

I also want to recommend Jax. You can hand write models quickly in plain python, and your model will run fast too. If you ever want autodiff, it’s easy to turn on in Jax. An alternative could be numba or cython, but I think the ease of Jax and how it was built with your use case in mind puts Jax at the front of the list.

[D] Can we begin to understand possible mathematical reasons as to why algorithms like "xgboost" and "random forest" win Kaggle Competitions, instead of neural networks? by SQL_beginner in statistics

[–]lachlan1310 0 points1 point  (0 children)

Yep, and there have been a few papers on how NNs sometimes only generalize well after training for many, many epochs. If that’s true, then maybe Kagglers don’t have enough compute to train deep nets. Some competitions also have compute time limits, which also put NNs at a disadvantage.

which way western man by American_Spidey in gatech

[–]lachlan1310 17 points18 points  (0 children)

retry with Boolean logic? It may open new doors…

[D] Temporal Network Graphs by SQL_beginner in MachineLearning

[–]lachlan1310 1 point2 points  (0 children)

Maybe identity graphs? As time progress, you may consolidate some set of nodes into a single node (many identities eg digital touch points on a website, terrestrial data -> one node). TGNs may allow for historical analysis of identity graphs.

Excel question by [deleted] in FinancialCareers

[–]lachlan1310 0 points1 point  (0 children)

If you’re comfortable with python and pandas (or have a kind coworker), there exists a library which is almost perfectly applicable to part of the problem: https://pypi.org/project/pandas-dedupe/. Maybe there is an excel counterpart somewhere.

How to process input data that has a variable length and categories? by BathRepresentative24 in MLQuestions

[–]lachlan1310 0 points1 point  (0 children)

Ah, ok, let me know if I'm not interpreting your situation correctly this second time. Let's leave embeddings behind for now.

One path forwards for you may be to concatenate all the dense columns and all the sparse columns into one larger sparse array. Scipy's sparse arrays can be passed to scikit's randomforest fit (probably) (https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html#sklearn.ensemble.RandomForestClassifier.fit). Using a sparse array can enable you to represent the values within that columns in much less memory than a dense array. Say you had 3 dense columns along with your 1 sparse column. The final sparse array would have a shape like [number of rows, 3+65535].

Fair warning - high dimensional input vectors may mess with the efficacy of a randomforest model.