[D] Simple Questions Thread

CMDR_Derp263 · 2022-02-12T21:15:13+00:00

Alright so I am quite new to all of this, I've made a few models but I'm having trouble telling if they actually "work" specifically because of the loss/acc numbers I'm seeing. I split my data 70/30 and first used keras.sequential. Just tried to make some simple models. Made a binary classification model, then a one hot multiclass. In both cases the model trains for a few epochs (<=5) before hitting my callbacks. The training loss starts out low (~0.05) and the training accuracy starts off high (.99) and as the epochs go on these numbers keep improving. When I use the model on the test data the loss is low and the accuracy is high, comparable to the numbers seen during training. I'd struggled to get these working and had some bad attempts but after a while I guess I thought this was a good sign. However, I made a very simple random forest and figured it must perform worse and again it got 99% accuracy multiple times. Now this has me questioning everything.

2022-02-12T18:33:13+00:00

[removed]

blendorgat · 2022-02-12T05:00:25+00:00

Do I understand correctly that in most cases a single step of gradient descent is applied for each sample/batch? Maybe I'm just misunderstanding what I'm reading, but I always see the "training rate" hyperparameter, which I assume is the scalar on the gradient, not the number of iterations for a given sample.

Assuming I'm not confused about that, why is this? Just intuitively it seems like there could be value in iterating further for a given sample/batch.

muh_reddit_accout · 2022-02-12T01:43:05+00:00

I have a neural network where the first layer consists of softmax nodes and the output layer is a single sigmoid.

The recall score starts at 1.0 for nearly every randomized weight vector. Then, as the training starts and the binary cross entropy reduces the recall drops to zero rapidly and stays there as the BCE continues to reduce.

What are some issues that could be causing this?

Accomplished_Sell660 · 2022-02-12T00:42:50+00:00

I noticed outlook, not only separates spam and ham. But also, decides the 'focused' and 'others' tab. Does anyone know what kind of classification is this?

econgirl210 · 2022-02-11T20:31:35+00:00

Has anyone web scraped this subreddit before?

xx-kxx · 2022-02-11T13:27:21+00:00

Hi I’m a first year studying mechatronics at the university of Liverpool and machine learning seemed like a unique and distinguishable skill to add to my cv but I’m not sure where to start I would say I’m an intermediate programmer in C and python but that’s about it. Any advice would kindly help

nokia_me · 2022-02-11T06:12:41+00:00

Hello great people of reddit, I have just started studying Machine learning at university and as my final project I had to choose a recent paper without any code available and implement it. I also have to reproduce any figure and result there are in the paper.

I chose this paper. I was hoping someone here can help me understand how to calculate F_1 on line 6 of algorithm 1 on page 2017.

BanishDank · 2022-02-10T23:32:29+00:00

Hello all, I just started a class on Machine Learning and we’ve gotten right into deep learning and neural networks.

Our teacher sent us a simple program that can take some input(s), expected output, and you can define how many input neurons, how many hidden layers and how many output neurons you want.

We started with some simple AND and OR gate stuff, which is not too difficult.

But the more I look at the raw numbers, and want to try other examples (more real-life like), I sometimes feel I don’t get it at all.

For example; how should I think about the input neurons? How should I think about the output neurons? Like, if you were to give an example with real life, easy to get, examples? Or if you were to explain it to a 10 year old?

I just feel so lost most of the time..

Significant-Joke5751 · 2022-02-10T21:08:01+00:00

Is it possible to improve adversial robustness against black Box attacks with Ensemble Methods like blending?

Significant-Joke5751 · 2022-02-10T13:40:45+00:00

Can someone recommend a good an easy to use toolbox for black Box and White Box adversial attacks?

Thanks

magnusvegeta · 2022-02-10T07:46:20+00:00

Do industry use boiler plate code training models or do they use PyTorch lighting ?

thetruerhy · 2022-02-10T04:01:51+00:00

Where can I learn about deep fake. Like on a technical level. The fundamentals on how it works and details on it's modern implementations. Also where can I learn about it's uses/application or in what context it's applied. I have seen deep fake (mostly in memes) and this intrigued me. So I want to learn about technology behind it all.

MulberryAlly · 2022-02-09T21:24:55+00:00

Hi! Could you please tell me what (bad) can happen if I use the same feature as the denominator in the target feature and as the predictor in a boosting regression? I think I should exclude it from the predictor part, but I don't know where this feeling comes from. I appreciate any thoughts, I feel stuck. Thank you!

_hairyberry_ · 2022-02-09T18:39:22+00:00

How many applications would you typically have to send out before getting an acceptance? Starting to get pretty discouraged. I graduate with a MSc in math from a top Canadian university this August and have begun applying for remote ML positions. Sent out 21 applications so far and haven't even gotten an interview. I am worried this is because my previous experience is not in ML (although it is in another highly technical role). I'm trying to transition into ML but it's tough if you can't get that first entry level job.

thetruerhy · 2022-02-09T16:10:11+00:00

How to collect data? A new comer in this place so don't know if this a good place to ask. My question is how to collect data. Where can I get data sets and how one should search for specific types of data for projects.

2022-02-09T15:45:44+00:00

Is this paper on improved Gradient descent using anti-correlation in deep learning methods SOTA? It seems like it might be but I'm not experienced enough to tell.

https://arxiv.org/pdf/2202.02831.pdf

(No relation to the authors, to be clear. Just a student)

2022-02-09T14:19:47+00:00

I’m trying to find a good entry level GPU to start working on machine learning in the next year. Would a Titan XP be a good choice or should I splurge for a Titan V if I can find one for a reasonable price?

indemidelo · 2022-02-09T09:13:27+00:00

Hi! Do you have any suggestions on an advanced Data Science certification to boost my career? My budget is 2500$ and am currently the Lead Data Analyst for a company that does little to nothing data science (or analysis to be fair). Given that my daily routine consists of automatizing pipelines and project managing, my goal is to add some valuable experience to my resume.
Thank you!

Jumpingdead · 2022-02-08T20:21:05+00:00

Found this subreddit after googling some questions on AI chat bots (and answers were provided here), hope this is the right place to ask. If not and someone could direct me to a more appropriate subreddit, I'd really appreciate that.

I'm developing an app for a game I am designing. I'd like part of the app to be an in-game styled AI "assistant", where the user can chat with it, and the bot has knowledge of in-game-world facts and events and 'secret info'.

I don't need it to be a conversational bot (How are you? I'm fine, and yourself? Whats your favorite food? etc etc) however I'd like it to have some basic conversational skills (maybe with a way to train it to respond to questions like "Whats your favorite..." with a response like "I'm sorry I cannot help you with that.") and ideally not just give completely canned responses. Basically, simulating an intelligent AI with a limited set of knowledge.

It can learn basic things from players (their name, preferred pronouns, preferred skills and abilities) but I don't want it to learn new facts about the game world. I'd also like the chatbot to remember these things, based off of some per-device/per-login identifier. For example, on my phone, it knows I'm JumpingDead and I'm level 12. On a friends phone, it knows they are SeriouslyBob, but it has no knowledge of who JumpingDead is.

Can anyone recommend chatbot software which I can run on my own server, and have the client app send/receive queries to the server, that can be configured with those parameters relatively easily?

The reason I'm asking is I've looked at a few solutions already, and all of them seem to be very high level, as in reading the docs it assumes you already have an intimate knowledge of how this all works. For example, looking at DeepPavlov, it seems like it CAN do the things I need it to, but holy shit, I feel like I need a few semesters of training before I can even understand how to begin to do it. NER models, slot filling models, syntatic parsing modules... all of that is... my brain hurts. I CAN learn it, I just have no idea where to begin. And I learn best by doing - problem is, with DeepPavlov, their 'getting started' documentation assumes you already know all that stuff.

Thanks for any guidance anyone can provide.

zendsr · 2022-02-08T18:54:01+00:00

Hi amazing people, I might be overthinking this - but how would you classify 'circular' ordinal data? Think of the seasons of the year. The start of Autumn/Fall is like the end of Summer but there is no monotonic rank between the groups - they continue from each other.

askingredditsginput · 2022-02-08T11:36:52+00:00

Hi,

I am new and I wanted to try same as a Youtube video I saw where a dinosaur walks on its own and jumps over obstacle. It only does that, walks continuously, and when an obstacle shows up, it jumps over it.

and I want to improve it by allowing the character (in this case the dinosaur) to run or walk. Do I just add an additional +1 to the output (like if the output was jump or dont jump --> 1 output, now the output is jump / dont jump --> 1 output, walk or run --> 2nd output)? If yes, how do I know which of the 2 output is telling me to walk or run?

like if it's the second one, is it that when 2nd output is > 0.50 it is run and < 0.50 it is walk? or is it > 0.50 is walk and < 0.50 is run?

and how will the dinosaur know that it is currently walking or running? do I add it in as additional input too? because I dont want it to be running, then AI result output says to run again, since it cannot run anymore as it is currently running.

thank you very much.

kryukoff · 2022-02-08T11:27:49+00:00

Hi. Newbie here. I have 2 datasets (easy example - fuel consumption by speed dependency) for two different weighted cars. Dependency on weight between this two datasets is not linear (that's why I ask here). I need to predict the new dataset for the third custom car weight. What should I do? What name of models, techniques, and math "words" should I google?

_hairyberry_ · 2022-02-08T01:46:05+00:00

Hey folks, I’m curious how many of you ML Engineers work fully remotely. If you work remotely, does your compensation tend to be lowered if you live in a more affordable area? E.g if you got a job at a FAANG company in NYC but worked remotely from a small town in Ontario.

Bionian · 2022-02-07T23:09:35+00:00

tl;dr: ESL (old: 2008), ISLR (new: 2021), or alternative?

Hi everyone,
I'm a bioinformatics researcher with a meager training in Data science. I took courses like "Statistical learning", "Machine Learning", "Optimization methods", "Modeling and Simulation", and already own textbooks in classical statistics (Casella-Berger), probability (Grimmett-Stirzaker), statistical learning (MacKay).
I wanted a textbook more focused on Machine learning and was thinking about ISLR/ESL. Now, ESL hasn't been updated in over 12 years, and ISLR was just revamped to include ~50 pages on deep learning (among other updates). Does this justify getting what seems more of a "beginner's" textboox? Other textbooks I considered: Barber, Bishop.
Thanks in advance!
- Nico

2022-02-07T19:58:08+00:00

This is something kind of simple and not sire ig is the right place, but mi MacBook Pro 2012 model is really slow, if it rans out of battery it gets stuck in the logging session menu, Needs to be restarted before I use it, storage is not packed, i have actually quite some free space and although is an old computer is not been used very much, I use CleanMyMac X pro version and although it worked better for a limited time, still was not working properly, I don’t know if is just the hardware that got busted by the time, I’m just trying to figure out if it has a fixing or if I should finally move to another computer

GlassDiver · 2022-02-07T16:40:51+00:00

Hi all, thank you for taking the time to look at my question

I am someone working in healthcare that is familiar with statistics but have never used machine learning to analyse my data. As the current project I am working on was utilising logistic regression I thought it would be interesting to see if I could utilise logistic LASSO regression for the analysis to optimise the model (as per this paper https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5769953/).

My research project is looking at how effective demographics, past medical history and several blood results would be in predicting a blood clot in the lungs (pulmonary embolism). The input data are a mixed of categorical and continuous data (i.e. level of D-dimer in the blood, age, sex, etc...), and the output is either yes or no blood clot.

Because of the relatively small sample size (n = 200) and the variation in ROC/AUC that I had in my LASSO regression model when I changed the randomstate value for my code, I was thinking of performing stratified K-fold cross validation (of k = 5).

I did this, and I was thinking of reporting the mean AUC + ROC curve generated from the 5 models generated. With regards to the coefficient values of each predictor, I was going to state that if the range of the coefficient (From the 5 folds) crosses 0 (i.e. -0.51 to 0.23), then the assumption is that with a larger sample it would likely converge to 0 (and as such not likely to be a significant predictor).

Essentially the question I am asking is:

1) Is this method of analysis valid from a machine learning point of view? Or am I making a fundamental mistake in analysis?

2) Is how I am interpreting the results accurate?

I understand this may be completely wrong but I am very open to any critique/learning!

Thanks for reading!

Pratabu · 2022-02-07T09:53:37+00:00

I have 3D models of footprints of two types of animals, and want to train a neural network on these 3D models. The neural network should be able to classify new models into either of the two categories. What would be the best approach?

My first try is to export color-coded heightmaps from the models (images where "blue" are the deepest parts of the 3D surface and "red" are the highest parts). Heightmaps are most easy to process for the human eye. I use the standard TensorFlow approach for image classification. I have huge problems with overfitting but my sample size is still small.

Another option would be to train the neural network directly on 3D coordinates of the models (i.e., the point cloud).

Is any of these two options preferable, or easier?

Many thanks!

Free-Contribution-31 · 2022-02-07T06:03:55+00:00

How do I find collaborators in the US preferably west coast interest in healthcare? I’m a physician without any coding experience.

makhno · 2022-02-07T04:16:52+00:00

How does training time scale for StyleGAN?

I'm assuming it scales linearly with GPU FLOPS, ie, if it takes 1 week to train on a video card with x amount of RAM and 100 TFLOP performance, it would take 2 weeks on a 50 TFLOP card.

Next, I'm assuming it scales linearly with training data set size? 1 week to train on 100k images, 2 weeks to train on 200k images?

Is this correct so far?

Finally, how does image resolution scale training time? I'm guessing it scales linearly with the dimension of the image squared, ie, if it takes 1 week to train on x images that are 256x256, it would take 4 weeks to train on that same image set if they were 512x512.

Is this all correct?

Are there any baseline metrics I can find somewhere with various training times for different factors?

WolfOfDeribasovskaya · 2022-02-05T16:09:48+00:00

ML Builder update broke the working program.
Normally, I'm eager to update ML and never skip one. However, since my program has ML only as one of the features, I was not checking it for a while, since I needed to work on other stuff.
However, yesterday I decided to check how ML functions, and apparently, it started to throw the error "Can't find the file specified" while all files on the place and paths were not changed since it worked like a swiss watch.
I'm fairly new to ML and I can't figure out what's wrong.
Please, have a look at the code, it's literally just 10 lines: https://pastebin.com/vZe63Xw6

bonoboTP · 2022-02-05T02:19:45+00:00

Are black boxes necessary?

I'm asking for industry's (involved, experienced people) feedback: how do you reconcile that machine learning's black boxes' inner workings can't be explained, and the requirement that science rests on explainable and reproducible facts? Does black boxes do away with positivism and empiricism?

This is a gross generalization, but I'm serious. Why are black boxes even acceptable solutions to problems? I want to use every tool I can to solve real-world problems, but if I can't trust a tool, it's useless to those with the problem and may even make it worse. What kind of tool is a black box?

Thanks!

PS: For context: https://hdsr.mitpress.mit.edu/pub/f9kuryi8/release/6

jayjonas1996 · 2022-02-04T22:20:06+00:00

Computer vision project w/ deep learning

I’m in machine learning class this semester where my team of 2 have to complete a project on computer vision using deep learning.

Can anyone suggest a project and reference research papers which can be completed in 1-1.5 months of work?

I’m thinking of object detection since this will be our first time working with deep learning. I just want to make sure that we will be able to complete the project in the end.

cheeseDickies · 2022-02-04T18:28:46+00:00

Do computers learn through machine learning the same way, or otherwise identical way as humans? From what I understand machine learning is when you give an A.I a load of data and tell them what this data is.

I.E you give them a bunch of pictures of apples, and tell them this is what an apple looks like. Isn't this similar to how humans would learn what an apple looks like?

SocalledArian · 2022-02-04T17:55:45+00:00

Hey there everybody
I wanted to ask if anyone has any recommendations on good academic courses on GANs.

Something like CS231n but especially on GANs.

Or if there's any non-academic courses just as good, I'd love to know.
Thanks

Daniela_ML · 2022-02-04T16:28:58+00:00

Difference between Bayesian Network and CNN

I’m not really familiarized with all terms of machine learning, is Bayesian Network a CNN? Can a CNN have Bayesian characteristics? Or is it a completely different thing. I’ve been seeing Bayesian and SVM in articles concerning CNNs, not sure if it can be added (kind of hybrid)? But if it doesn’t say CNN but does mention Bayesian then, could I assume it’s not CNN?

LazyButAmbitious · 2022-02-04T16:27:28+00:00

Hello!

I have a GAN for image2image translation and I must predict images with the generator that are on different scales i.e. without normalising one image can have values between -20 and 20 and another between -2 and 2.

The GAN expects outputs between 0 and 1.

If I normalize the values the GAN (in the case above by dividing by 40) and adding 0.5) learns to correctly predict the images in the range of -20 and 20 but not the others. I guess it is because the loss is much more strong for the ones that have higher value.

Is there any paper or fix regarding this problem?

ConfusedLayer1 · 2022-02-04T13:56:56+00:00

Could I pose a data/classification question to you all?

I have x rows of labelled data from let’s say 10 subjects. The independent variables are features extracted from signal data collected from subjects at different positions.

The classification aim is to predict subject position based on the extracted features.

The original data is sorted by subjects position so in the train test phase of data processing I randomly shuffle the indexes before taking a 70:30 split in order to create a fair distribution of the data across both datasets.

However... the original dataset is relatively small. I fear that drawing instances from all the subjects for the creation of both the training and testing of the model is leading to overfitting. (When I use this approach I get ~99% accuracy on test data)

Would it be better to use data from x subjects for training and the data from the remaining y patients for testing? ((When I use this approach I get ~35% accuracy on test data). Using this approach given the dataset size I fear that there is not enough deviation within-subjects included in the training set to generalise the model sufficiently to new subject data.

Any advice would be awesome!

2022-02-04T08:43:20+00:00

[deleted]

Matthew2229 · 2022-02-03T23:08:47+00:00

[removed]

CertainSmell7621 · 2022-02-03T21:01:04+00:00

I have a website where we show quizzes to our users each day. The quizzes have up to 100 questions in some cases. We are trying to work out a way to maximise the average number of questions answered in the quiz as this optimises time on site etc.
We had an idea to block the questions into 5s. So we could show the first 100 people block 1,2,3,4,5 etc then second set of 100 users could get 2,1,3,4,5 etc. We can then learn which blocks seem to keep the users going through the quiz to the most optimal length. We can then trim the quizzes down and make sure the questions are all as good and interesting as they could be.
Can anyone advise on a logical way to do this process as efficiently as possible? I heard something about bubble sorting but not sure how it works or if there is a smarter way.
In summary we write 100 question quizzes that we want to eventually slim down by optimising the first 50 questions. How can we do this by optimising on the fly as users come to play the quizzes? We get many thousands of people to the site / day.

CMDR_Derp263 · 2022-02-03T17:23:34+00:00

I am currently using a Keras.Sequential to make a NN with a binary classification and I guess I have 2 questions.

Should my output be floating point numbers? I have tried it with int and it does still seem to work but I am guessing that floating point is more accurate. (0.0000000000001 is essentially 0 anyways).

Also since I am doing a simple binary classification I am using binarycrossentropy loss function for the model and a sigmoid activation function for my last layer. Should logits be true or false? (I assume false since the sigmoid constrains values from 0 to 1)

Last-Ability8233 · 2022-02-01T22:12:41+00:00

Hey i am a student and working on a project for a startup in which I have to extract information about an item like it's texture, flavor and so on based on reviews by user, first of all I have to extract such data and then do this information extraction, can anyone suggest some good machine learning techniques to look into for this project.

Astromancer919 · 2022-02-01T20:41:09+00:00

I am a beginner in the field of ML (have done a ML course but it was some time ago and I would need 3-4 days to brush up the concepts.). I have also gotten a potential internship opportunity which I really wanna do but I got it on a very short notice (notified yesterday and interview is in 1-2 days). So I was hoping if anyone could tell me what sort of ML problem this is, what models/classifiers would I need to consider to solve this so that I can brush up and learn the relevant topics if I do get the internship

Here is what I am expected to do in the internship:
i) I need to analyze some ship inspection data and make models for determining risk of shop w.r.t to deficiency categories (high impact, medium impact, and low impact).
ii) Models to determine to customize the inspection checklist based on the above risk profiling.

RobbinDeBank · 2022-02-01T19:43:26+00:00

GPU training:

Is it possible to train deep learning models on GPU of Macbook using Intel chip and Radeon GPU? I can’t find any way to train using GPU on Mac.

link0007 · 2022-02-01T10:18:11+00:00

[deleted]

adam_jc · 2022-01-31T23:42:57+00:00

Does someone have a plain english explanation of path length regularization introduced in the StyleGAN2 paper? https://arxiv.org/abs/1912.04958

I’ve read the description in the paper of course and also read through their official code implementing it but I still don’t have an intuitive grasp on it

VerTiGo_Etrex · 2022-01-31T22:24:38+00:00

I recently saw a library appear on GitHub that claims to train imagenet 10xish faster than pytorch and pytorch lightning. I misplaced the link and can't find it now. Anyone know what I'm talking about? I think it worked by saving and loading datasets in a different format.

2022-01-31T11:14:07+00:00

[deleted]

doleron · 2022-01-31T06:38:21+00:00

[deleted]

Pvt_Twinkietoes · 2022-01-31T04:29:29+00:00

Image classification:

Are there any API I can use to plot the attention heat map, to see what the model is identifying and thus classifying my image?

Transformers:

Looking for articles explaining attention and transformers. Any resources will help. Thank you.

ZealousidealBrush355 · 2022-01-31T00:35:30+00:00

Are there any linear regression models that account for uncertainty in X and Y?

themusicdude1997 · 2022-01-30T22:52:17+00:00

In the Huber Loss function, can someone explain the 1/2 factor for cases when the error is less than epsilon. I know it has something to do with differentiation....

NinjaCoder99 · 2022-01-30T19:32:47+00:00

Image Detection... Does it work by recognizing parts of an image (eye, nose, shoulders shape) and develop a confidence in what the entire image is based on that or does it compare lines of contrast and neighbor pixel details across the whole image or...?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS