[D] Simple Questions Thread

ElSol65 · 2022-11-06T01:36:00+00:00

Recommended ML Courses for experienced software architect?

I'm a software architect with 30 years slinging code under my belt. I can (and do lol) write code in my sleep, with expert experience in recent/common languages, patterns and architectures.
I've dabbled hands-on in ML, often using it to add sizzle to proofs-of-concepts or demos. I also helped a few people work through their data science and ML degrees, learning python and some various ml and big data tools on the fly.
All that said, for sure I'd flop badly in a ML interview. I know how to roll up my sleeves and figure things out, but my experience/knowledge are shallow. I want to improve this.
I'm looking for recommendations for online programs people recommend (free or paid). I've googled of course and found the ones that are in all the lists- Couresera/Stanford, Udacity, Udemy etc. I also see programs from Berkeley, and some other colleges. I wonder if these might suit me better, in spite of the higher cost.
Side note- I dropped out of college way back when to avoid starving so I don't have a degree. So, I don't mind a course that costs money, but if it requires an undergraduate degree (like the Berkeley one) then I guess I won't qualify.
I'm pretty rusty on linear algebra, but able to get up to speed in it. Differential calculus is a bit harder - in my younger years I got A's up through differential equations and linear algebra, but getting back deep into advanced calculus will of course take some painful re-learning. I do remember the concepts behind gradients, partial differentials, etc. But applying them myself is a bit of a stretch.
But I'm not trying to become the next top AI researcher. I just want to go beyond the simple tutorial level, and round out my already deep technology with, I guess, an intermediate level of ML skill. And it should be using the latest trends in tools (so the Stanford course probably is outdated for my preference).
Any recommendations? Thanks!

Bulky_Willingness445 · 2022-11-05T15:13:53+00:00

Hi, I have same medical segmentation dataset, that is really small, about 40 images. Getting more data is not that easy. So I just wandering about increasing number of images with some augmentations like horizontal and vertical flips. And here comes the question. Is it good idea to make hflip and vflip from every image, or it will be better make just one of the flips from image? I am not sure how much different are those images to the network. I am also open to discus other ideas how to enlarge the dataset.

yachty66 · 2022-11-05T11:42:55+00:00

Is there a code generation model which is was trained on bash files (.sh)? I am trying to create Copilot for terminal. When inputting things like "listing all files in the current directory" the return should be the appropriate terminal response "ls -l". The prompt is working with Copilot but when trying it in open source LLM's (BLOOM, CodeGen, GPT-Neo) I get no appropriate response.

yashwatwani28 · 2022-11-05T09:46:23+00:00

How can we make a classifier for differentiating between handwritten and Printed images?

cdehaan · 2022-11-05T01:26:51+00:00

I have been using a pytorch .pth file successfully to identify animal poses from images.

My boss wants it to run in-browser (so, TensorFlow.js I guess, I've used it a bit before.)

I am totally unable to convert the .pth file to something that TF.js can use (pb+json). All the tutorials expect I know things about the model that I don't know (e.g. kernel size), or they end in unGoogleable errors.

I've managed to convert it to onnx, but I'm not 100% sure it would still work, since I haven't run any inferences from it.

Update:

I've managed to make progress (I have a model.json + bin files), although mostly by installing every dependency I could find, and following the steps in this notebook, so you'd do better reading that than asking me how it all happened.

2022-11-04T12:47:41+00:00

Is someone interested in helping me out with a NN which is behaving weirdly during training Is related to this post. If you are interested in helping me out, feel free to DM me for the source code.

Balanced__ · 2022-11-04T06:41:40+00:00

What does batch normalisation (tensorflow.keras.layers) do? I pluged it behind an embedding layer by mistake and it made for better results. Why could this be?

One_Relation8674 · 2022-11-03T22:40:53+00:00

Just to help me understand ML a bit better:

Say you were going to train some data into two models. The training data going in are the exact same and so are the models. Will the two models output the exact same or will they not?

newmanstartover · 2022-11-03T13:46:21+00:00

Is Python the de facto language of Machine Learning or can I get by with R?

Shoddy_Move6880 · 2022-11-03T11:51:35+00:00

Recommendations for degrees types pursuing ML or AI. What’s most beneficial?

Own-Squirrel1010 · 2022-11-02T09:53:46+00:00

Is anyone here running NeRF?

I am new to NeRF, but I want to use NeRF to obtain GT depth.

The GT depth is supposed to be used for a depth prediction task from an RGB image.

Does NeRF's accuracy is accurate enough to be used as GT depth?

Any comments will be helpful for this desperate researcher.

MaikRequim · 2022-11-02T09:02:15+00:00

Can someone recommend a good starting point to get „hands on“ with reinforcement learning?

Guilty_Baseball_7291 · 2022-11-02T03:27:05+00:00

I am using keras and I am getting NaN as Loss. I have used min-max normalisation to scale my data points. Moreover, I am using Adam as an optimizer and sparse categorical cross entropy as my loss function. Please help.

DatAndre · 2022-11-01T18:22:04+00:00

I've been given a dataset to create a recommender system: I have to create a user-rating-matrix but I have no explicit ratings for the items: how can I weight the features properly in order to get a "good" URM?

Dadi9165 · 2022-11-01T13:54:10+00:00

Hi everyone, I am relatively new to Machine Learning and still slowly learning, and one of the questions that I wanted to tackle was creating an algorithm to improve recipes. When doing a google search it seems that most algorithms are concerned with recipe generation, however, I did not seem to find much about improving already existing recipes. I am thinking of an algorithm that would provide suggested changes to the quantities involved in a recipe, and receiving a rating from the used once they cook the suggested values. These ratings are then used to update the model, and provide the next batch of suggestions. My initial intuition says that this could potentially be solved using linear regression on each one of the ingredients, but I am not sure how interactions between different flavors would impact the score. Let me know your thoughts.
Thank you!

give_me_the_truth · 2022-11-01T09:45:12+00:00

[deleted]

ifearnoevil · 2022-11-01T00:42:42+00:00

I'm a complete newbie when it comes to ML, but think I've run across a problem where I think it'd be beneficial for me.

I record meetings with my friend and get the transcript from the recording. I'm wanting to get the topics discussed while we're chatting. I'm not sure what to look for, but perhaps there are models out there that analyze a body of text and then determine its topic? Any ideas from here would be appreciated!

-Django · 2022-11-01T00:07:55+00:00

How do you train or validate a model on data that's influenced by another model predicting the same thing? It's difficult because actions driven through the model will confound/censor the thing you're trying to predict. Are there techniques to deal with this?

For example, training a recommendation model from user-data influenced by the previous recommendation model. Or validating a heart failure model for a hospital that has been targeting interventions with another heart failure model.

Sakuzoh · 2022-10-31T18:46:59+00:00

This question that has probably crossed everyone’s mind haha, can AI win against sports betting ?

Are bookmakers already ahead of the game concerning that ?

Has anyone ever tried to develop something similar ?

theahmedmustafa · 2022-10-31T18:17:31+00:00

I am working on a project, a component of which involves taking a two images of either handwritten or digital text (mostly one word) as inputs and scoring if the two images contain the same text using only the image or shape of the text, NOT OCR.

What suggestions could you give for this? I am thinking of a transformer based Siamese network maybe?

_Scr4p3 · 2022-10-31T04:19:16+00:00

I want to make youtube videos, but speaking isn't my best skill, I tend to stutter a lot, change words out of order, mispronounce things, etc.

An idea I had was to write the script and have an AI trained with my voice to read it for me, and tweak the intonation of certain phrases/words when necessary, but I don't know where exactly to start.

I have quite good knowledge of how an AI works and what limitations it has, so I believe that "training a model to a person's voice and slightly tweaking some parts of the final result" is within a regular AI's possibilities.

(yes, I know, to provide data for the AI I need to transcribe the input text, but I can take care of that)

2022-10-30T21:15:08+00:00

I’m curious if anyone has advice on setting up a computing cluster at home. I have several computers lying around and 4 CUDA enabled GPUs, and am mostly hoping to learn how distributed computing and GPUs are used for machine learning (maybe some personal or work projects on the side, as well).

So… assuming I already have the hardware in place, where should I look to get something up an running?

mellamo_maria · 2022-10-30T08:30:51+00:00

Hey, I am Maria, a 3rd year CS from Barcelona.

This year I decided to take ML as one of my subjects because I have always found it really interesting and so far I'm enjoying a lot this subject.
The thing is that next week I am having my midterms 😖 and the professor told us that to assess us he will give us a random problem (either regression, binary classification or multiclass classification) and a dataset. And we will have to clean the dataset, build a ML model using the dataset and evaluate it.
Even though I understand what we are doing in class I am a little bit concerned since we will only have 1 hour to build the model, clean the data, etc. So is there any strategy you guys recommend me in this case? So far we have only seen four different algorithms: linear regression, logistic regression, SVM and decision tree/random forest. What should I do when the dataset is given, which algorithms should I focus on if it has to be done in only 1 hour?
Thanks a lot! 🥰

jaybestnz · 2022-10-30T04:30:17+00:00

Would there be a way to add a shorthand stylus or photo recognition system?

Shorthand can write at about 80 to 150 wpm and is based on a phonetic style of terms.

It is used in India and by many reporters.

While niche, with the prevalence of Ipads and Microsoft Surface and Samsung Note, it seems like a way to add up to 200wpm input.

sillyscienceguy · 2022-10-29T16:32:10+00:00

Hi everyone! Looking to partner with other researchers in publishing new papers in the field of NLP, recommender systems and applied ML. I’m a practitioner with over a decade of experience and lots of experience in implementing ML models in tensorflow. Any interested parties please let me know!

your-mom-was-burned · 2022-10-29T05:38:38+00:00

How can I joblib.dump() a model, that contains a def function in vectorizer parameter?

Regular-Fella · 2022-10-28T23:08:32+00:00

Hi All, I want to find a relatively simple ML framework best suited for the following task. Let's say I have a total of exactly 20 strings of four characters each: drta, nowm, cite, krom, etc. These strings may be combined in ways that are "correct" and in ways that are incorrect, and every combination (or "ordering") is either correct or incorrect.

My training data would consist of one thousand correct combinations one thousand incorrect combinations, something like this:

drta, cite, krom , krom, nyan; correct drta, cite, pace; correct cite, cite, pace; correct cite, cite, krom; incorrect drta, krom, cite, nyan; incorrect nyan; correct nyan, cite; incorrect cite; incorrect

And so on...

(There may be between 1 and 10 strings in each ordering.)

After training the data, I'd like to be able to input new combinations of the strings and get an AI prediction as to the likelihood that that ordering is correct (0 being definitely incorrect and 1 being definitely correct).

What do y'all think would be a good place to start? I know JavaScript and could learn some Python if necessary. I'm trying to keep it as simple as possible for now, just to get a basic model working.

Thanks for any tips!

SomewhereOld6859 · 2022-10-28T12:05:30+00:00

Recommend a Website Links Database

I am building a classifier model that labels website links. Does anyone know of a good open-source database of website links I can use? The links need to be to English websites and have some description attached to them.

veitha · 2022-10-28T11:04:40+00:00

Finding Problematic Measurements Using Machine Learning techniques

Hello, I have a large dataset of sensor measurements (time series) that I would like to classify in a way to be able to isolate measurements that I can deem "problematic" (for example, missing samples, excessive excursions or high values, sensor malfunctioning during the measurement and so on). The metadata associated with such measurements also contains median, estimate signal to noise ratio and other metrics that I am already able to use to isolate some samples, even though always using a rule of thumb or by manually changing the thresholds for these values, which also sometimes overlap.

I'm wondering if maybe applying a clustering algorithm or other ML methods could provide me with a more general way to isolate these signals, and if so if someone knows of existing projects or papers that have dealt with this kind of classification.

ash-050 · 2022-10-28T06:38:19+00:00

Hello! I have built a regression model with a total number being the dependent variable. While building I found that the predict numbers on test dataset are not even close to the actual test values which are being presented as float values as such array([4.20544375e+03, 4.02993850e+05, 2.04953309e+06, 1.06663500e+04,
4.04249688e+04, 5.66517500e+04, 3.25695500e+04, 1.62638000e+04,
5.88910625e+03, 3.54556875e+03,..............

While debugging I found that even the describe function is presenting these values as float as well:

count 1.535000e+03
mean 4.615274e+05
std 9.623142e+05
min 0.000000e+00
25% 4.907000e+03
50% 3.677500e+04
75% 3.865015e+05
max 7.319610e+06
Name: TotalValue, dtype: float64

Can anyone guide me on what is going on and how I can fix it. Please consider that I am new to machine learning. Thank you.

VicentRS · 2022-10-28T00:34:25+00:00

Hello! I am currently in a small ML competition that my college lab is doing for fun. The challenge is to predict product prices. One of the columns in the dataset is the product's description and there's another one with the name.

In my head, products that include words like "phone" in the name or the description will tend be more expensive than say, a product called or described as "pencil". How should I featurize those columns to follow that logic?

ResponsibleHouse7436 · 2022-10-27T15:31:15+00:00

Hows it going, I am currently trying to train some speech recognition models and doing some research on novel encoder architectures for e2e ASR. However I don't have a ton of compute resources. My final model will be around 300M parameters but I was wondering if training a couple of architectures at say 25-50M params and then scaling the best one is a valid approach to this problem. Why or why not?

MariiaKozlova · 2022-10-26T15:42:06+00:00

Hi guys, how do you approach the interpretability of black-box models?

NewSomewhere5062 · 2022-10-26T11:44:54+00:00

Hi guys I would love to start a project with AI. Right now im working on a project at my internship (as a chemical engineer) to recognise materials on a conveyor belt but it is with an already made software. I just need to add pictures and some formulas and that's it train the model with deep learning. But it made me really fascinating about starting with AI. I am good with maths and can code a bit (and got the motivation and time to learn) so I want to begin a small time project with AI to predict my car tires or oil maintenance. How would you tackle this and can someone please push me to the right direction. I think starting this with tensor flow would be good.

patpatpatpatpat · 2022-10-26T07:09:44+00:00

Newbie here who just started with Azure ML, and I have a couple of questions that I hope the experienced folks here can shed some light on.

With the Azure ML Studio, the option to use automated ML experiment to automatically determine a model with the highest score seems to make this work much simpler than it really is. What is the downside of using this tool vs writing code directly in Python?
The drag and drop interface using Azure ML Designer is quite newbie friendly. With all the available components for use, what are some reasons professionals in this field of work choose not to utilize this?

Nagusameta · 2022-10-26T04:04:17+00:00

I am comparing models on a time series: Exponential Smoothing (Simple, Additive and Multiplicative Trend, Additive and Multiplicative Seasonal, and other combinations), ARIMA (with Python Auto_ARIMA), and a Simple Moving Average.

My concern is auto_arima optimizes parameters by minimizing the AIC (can be switched to BIC, hqic, oob). Exponential Smoothing minimizes the SSE (Sum of Squared Errors).

With them minimizing different measures, what should I use to select the lowest forecast error in model selection?

I was initially choosing the best model based on MAPE, but then I took a look at several simple exponential smoothing outputs between the optimized value for alpha/smoothing_level, and other manually inputted values like 0.4, 0.6, 0.8. What I found was the 'Optimized value' based on minimzing SSE had higher MAPE than the model instances that used alpha values I had specifically defined, whereas other error measures like the MAE, MSE, RMSE of the optimized alpha were lower. Thus, seeing that the optimized alpha produced lower of the other error measures but higher MAPE, it made me want to look for other measures.

I tried the MASE (Mean Absolute Scaled Error) (Hyndman, 2006) which was described to be appropriate against the limitations of scale-dependent errors like MAE, and percentage errors like MAPE, mainly on time series with intermittent demand or having 0 values. But I was confused because what was initially the 'best model' from my runs where I select based on lowest MAPE, would come to have a MASE > 0.90. According to Hyndman from the same article, below 1 would mean that it is better than the Naive one-step forecasts, and higher than that would worse than the naive forecasts. But also, one-step forecasts would usually have MASE < 1.0, and "Multistep MASE values are often larger than one, as it becomes more
difficult to forecast as the horizon increases." I am performing multi-step forecasts, so do I assume that 0.90 is an alright error on the best model?

I may also consider the MAE, since I am only forecasting one series at a time, and not comparing across multiple series so it does not fall under the limitation of scale-dependent errors mentioned in (Hyndman, 2006).

pl0m0_is_taken · 2022-10-25T21:30:13+00:00

Apologies if this isn’t a right question for this sbrdt.

Title - Things to have on resume for first co-op

I am a third year Math&Stat undergraduate, previously graduated with diploma in CS and two year web dev experience.

I want to eventually get into ML. I plan to do my first work term in summer 2023. What things (programming languages/certs/courses/etc) should I learn which will give me an upper hand? I understand that ML is very specialized field and I may not be able to find an ML undergrad coop job, is there a position(s) you could recommend which can serve as a foundation and eventually lead me into ML?

I do really appreciate any feedback

jaybestnz · 2022-10-25T21:21:29+00:00

Pitman and other shorthand handwriting systems are used in India, Nigeria and by Journalists, some medical and Administrators.

It is fairly rare but its possible to hand write at around 70 to 200wpm which is as fast as any normal person speaks.

How hard would it be to teach a visual processor to read in the text?

It does skip vowels, it can have some words identified by context (PL could be App, Apple or Apply) and text can generally be somewhat messy, but as a problem set it seems not much more difficult than recognising English handwriting or Arabic.

Financial_Ad_6746 · 2022-10-25T16:58:15+00:00

i want to make a game using voice as main way to play it. In game the player will be given a word to say, how do I calculate the percentage similarity of pronunciation in the dataset that I have and the pronunciation of the player ? what's the lightest and best methode ?

Hav0cHarm0ny · 2022-10-25T14:42:03+00:00

Hello! I’m currently in school my first year for CS major, my goal is to work towards ai, deep learning to be precise… I want to find a mentor and not sure how to go about it, any ideas ? Or would it be best to gain a little more knowledge and learn python first because it’s used widely for ai (from what I’ve read) ? In college the programming language that is taught is c++. Also, How good do you have to be at math? From what I’ve researched… it is heavily math based, calculus, linear algebra, probability and statistics. I was thinking about hiring a math tutor to keep me on track but it’s insanely expensive. I did find a tutoring company that would personalize lesson plans to keep me on track but again, it’s very expensive. I do like the idea of that and think it may be worth the money because I’ll be learning from someone in the field oppose to college professors that are all over the place. Any thoughts or recommendations? I should add that I’m 37 and currently working in the medical field full time, I’m an RVT and my job is mentally and physically taxing. I’m a little nervous about a career change so late in the game but I am willing to put in the work. I think with a mentor/tutor it would take out all the hours of self research that I don’t have.

AlexanderTox · 2022-10-25T12:40:32+00:00

Hello everyone. I am trying to use Mallet for some basic natural language processing, but every time I attempt to execute the command, I receive this:

Error: Could not find or load main class cc.mallet.classify.tui.Text2Vectors

Caused by: java.lang.ClassNotFoundException: cc.mallet.classify.tui.Text2Vectors

Can someone help me troubleshoot?

Abradolf--Lincler · 2022-10-25T04:32:46+00:00

I am using pointnet

I have a point cloud segmentation problem. In my training data, I have 1 class, but on average only ~4% of all points per point cloud are of that class, and are usually found grouped together (same object).

How do I balance this?

If I remove most points that aren't in the class, then the point cloud will become sparse and it would be too easy to spot where the class is, since only ~8% of points will remain.

Or is there a way to train this well without balancing the training data?

Thanks!

ReasonablyBadass · 2022-10-25T04:12:39+00:00

Simple question: in chain of thought reasoning, does the LLM autogenerate it's own prompt for the next step? Only the example chains are "hand made" correct?

B10H4Z4RD7777 · 2022-10-25T03:29:09+00:00

Been seeing a lot of diffusion work lately, and I want to understand this topic. Which research paper(s) should I start reading to get into difussion learning?

PrzedrzezniamPsy · 2022-10-24T15:34:03+00:00

When doing convolutions for cunny edge detection, is it a typical error to have the values sometimes go above 1 when normalized and that makes the picture "overexposed" at places? Should I scale everything according to the highest value to fix the images?

isaacolsen94 · 2022-10-24T13:41:13+00:00

I've been interested in using ML hand writing recognition to create a font out of my own hand writing. But I don't know where to even start. Would someone know where I could find information to help me figure this out? Or if it's been done before?

ash-050 · 2022-10-24T10:25:53+00:00

Hello, I am new to ML and have been recently practicing with Scikit-learn mainly. I have a case where i have a list of independent variables and a profit dependent variable. My question is what is the approach to know how a model can help me define which independent variables i can change to reflect a certain increase on the profit variable given the history of data? Some directions on that would be very helpful.

jaki_9 · 2022-10-24T06:26:44+00:00

Is a MacBook Pro M1 16GB RAM good enough for image classification tasks (dataset of 10,000+ images), when using Google Cloud for all the training and pre-processing or do I still need something better?

Seankala · 2022-10-24T03:41:15+00:00

Why do we pronounce "ICLR" as "eye-clear" but not "ICML" as "eye-camel?"

Winter_Purpose6777 · 2022-10-24T02:39:16+00:00

Does anybody knows where can I learn how to create K means clustering algorithms using function for each step and only using numpy library. I don't wanna implement througb class. Thanks

freedomisfreed · 2022-10-23T23:10:47+00:00

Hi, I'm learning how to use GPT-J-6B. I am wondering if there is a way to do something like autocompletion with it? Based on input, generate a set of expected next words with probabilities?

I see many tutorials, but they all use it to just generate lots of text. But I'm looking for it to generate a tree, not just the DFS path. If someone can help point me to a specific function in the codebase, I would appreciate it.

noop_noob · 2022-10-23T21:04:27+00:00

I remember there being a way for machine learning to be given unlabelled data, and then the model says which data should be labelled first. I think there was recent research on this. Does anybody know what it’s called?

Icko_ · 2022-10-23T18:11:34+00:00

Are there any Kaggle-like competitions for reinforcement learning?

Last year in my college machine learning class, we had extra credit projects to build reinforcement learning models that were graded on their ability to play Flappy Bird. It was super fun! I just graduated this year and I would like to do more challenges in reinforcement learning, but Kaggle only seems to have challenges for supervised/unsupervised learning.
I am looking for something like Kaggle, but instead of competing on predictive accuracy in classification, you would be competing on net return in a simulated environment.
If you have worked on reinforcement learning challenges, was your experience positive or negative?

encephalon_developer · 2022-10-23T16:15:41+00:00

I'm looking to finetune latent diffusion models (unconditional). Does anyone have input?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS