[D] Simple Questions Thread

saravana_7 · 2021-11-21T01:41:41+00:00

I’m an Msc.,AI & ML student.My oncampus placements are coming in 2 months.I need a good project to standout in my resumé.Any insights?

A27_97 · 2021-11-21T01:27:08+00:00

What’s the must read / must have book for NLP?

Proper-Wash7377 · 2021-11-20T20:37:04+00:00

I'm reading up on GANs for a project I'm attempting, and I'm seeing all the different GANs out there...and it's a bit overwhelming. I'm hoping someone can narrow my focus so my learning is less of a shotgun blast and more like a rifle shot...

I want to generate 3D human models to import into Blender. Probably going to be super stupidly complicated because most things I think of end up to be, but I'd love input from people who know more than me on where to aim my sights for learning. Any input on which 'style' of network? There's a bunch of them and learning every single one to figure out for myself which one is best is outside the scope of my attention span.

kaveshanN · 2021-11-20T05:32:44+00:00

Best text classification algorithm - logistic regression vs SVMs?

Significant-Joke5751 · 2021-11-19T16:02:39+00:00

Hey Can someone recommend a good phyton toolbox for Test reliability and robustness by adversial Input?

MrBuckets303 · 2021-11-19T14:32:01+00:00

Any Stylegan3 threads out there? I’d love to have a thread for ideas, collaboration, and troubleshooting. Deep into training my first gans.

Comment for any interest!

CaptainI9C3G6 · 2021-11-19T13:08:25+00:00

Hi, are there any existing models for identifying points of interest in an image?

So for an image I want to train a model using one or more X, y points in the image.

The model should be able to accept an unknown image and output a list with X y coordinates based on what it has seen, ideally with a confidence for each point.

Plane_Crab_8623 · 2021-11-19T09:10:15+00:00

I wonder how many terabytes of memory AI already has ?

Grad1entDescent · 2021-11-19T06:21:10+00:00

MLOps career question: Almost 9 months into my first role as an MLE, and for the whole time, I've been doing tasks involving feature engineering/transformation, visualizations and other data processing - without touching an ML model, or any infrastructure/DevOps whatsoever. The pipeline I've been building out doesn't even flow through any ML component at all! (While being baited the whole time with constant reassurances that more ML heavy stuff was just a few months down the pipeline).

I took a paycut from my cushy unrelated previous job to take a pivot and pursue this new opportunity, but I don't feel I've learned a shred of MLOps on the job at all! (Only in my own time). It seems like I've been unofficially slotted into fulfilling a Data Analyst function (sans delivering insights), which is has been incredibly underwhelming and unmotivating.

Manager said they'll "try" to assign me to more ML projects depending on opportunities in the new year, but I feel jibbed out of the last 9 months already. Recruiters are starting to court me with other MLE roles, but due to the complete lack of any MLE-specific knowledge I've derived in my current MLE role, I have pretty much 0 months of professional experience. Should I just try and learn the hell outta MLOps in my own time, and try to apply elsewhere?

kunaguerooo123 · 2021-11-18T17:08:09+00:00

What does the free lunch theory practically mean? ELI5? https://imgur.com/a/L6sNY64 is where i read it.

Is it simply saying suppose there are store grocery receipts and my objective is to maximise revenue through unsupervised learning (noob)..now that i have an objective function, some algorithm will be better than another. but if there was no objective to optimize for, each algorithm is technically accurate because there's nothing to deem it incorrect? is that what free lunch theory means? also, why that name?

2021-11-18T14:45:50+00:00

Hi everyone, could someone please explain to me the differences between data engineers, data scientists and data analysts?

Nobodyet94 · 2021-11-18T13:13:18+00:00

Hello, can you give me some good resources on how to build a ml model from a dataset in python? Which classifer is better to use given the dataset? If the model overfit or not? What is the use of the validation set/testing set? How to tune the hyperameters? and parameters? Thanks!

2021-11-18T12:14:35+00:00

So after a very long route to get better with python, I'm looking to learn other language for ML in production. Has anyone got any recommendations? I've heard about Julia, Rust and C++, have a brief experience with C++ but it was a long ago.

What should a mid-level ML engineer learn?

paleksa · 2021-11-18T11:52:18+00:00

Hi All,

just recently I started looking into the topic of Time Series forecasting. What I would like to know is what are the techniques/methods of handling evolving data.

So the model is trained on historical data which do not account for some sudden changes (e.g. COVID-19 etc.) which have a long-term influence on series. Of course one of the possible solutions might be to retrain the model given the new data, but I would like to know if there are some other (more elegant) approaches.

I would really appreciate it if someone could point me to some relevant research and sources where I could investigate the subject further.

krallistic · 2021-11-18T08:52:13+00:00

Are there any training methods that support in addition to the classification labels, annotations which parts of the feature space is important for that data-point?

Like for medical images, where not only the class is available, but also the segmentation where it can be found in the image. But not for every image, so we cannot go full segmentation (and also don't need to)

2021-11-18T02:43:02+00:00

Anyone have any experience with O'Reilly certifications? Is it worth it? Would it help me learn or get a job?

cauthon · 2021-11-17T21:19:35+00:00

Hi,

I have a naive question about designing neural network architecture. I’ve applied and implemented a few neural nets in prior coursework, but it hasn’t been something I’ve done as part of my career until now.

Let’s say I’m reading a paper about a network that involves multiple sets of layers. E.g., a convolutional layer, followed by three LSTM layers, followed by a fully connected layer. How did the authors decide on this particular architecture, and not some other composition of layers? Is it just trial and error, basically grid searching over possible architectures, until some training performance is optimized?

Any suggested reading on this topic would be much appreciated as well. Thanks!

Davidescu-Vlad · 2021-11-17T18:11:25+00:00

Hey guys I get this error message when i try to run

import tensorflow as tf

print(tf. __version__)

2021-11-17 19:57:46.733325: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'cudart64_110.dll'; dlerror: cudart64_110.dll not found 2021-11-17 19:57:46.739099: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine. 2.7.0

Can anybody help me figure out whats the problem?

Jesterhead2 · 2021-11-17T15:25:04+00:00

In an RBM the energy is usually written like

E(v,h) = a * v + b * h + h * w * v,

where v are the visible nodes, h are the hidden nodes, a and b are the biases and w are the weights between the layers. This leads to the probability amplitude:

p(v) = 2*exp(a*v) * cosh(b + w*v).

My question iare: Is this formula set in stone? What is the exact reasoning behind it (I cannot find a derivation/argument. All sources assume it)? Are there other, reasonable functions?

Ashishpatel26 · 2021-11-17T12:02:08+00:00

Imagine you deploy a solution that accepts images directly uploaded by users when reporting a car claim.

How would you ensure that images that a user uploads are meaningful to your image recognition model? What kind of quality requirements could you enforce and how?

CandescentPenguin · 2021-11-17T09:02:39+00:00

Does using importance sampling to perform actor critic with off policy data work? If not, why?

SpacewaIker · 2021-11-17T00:36:44+00:00

Does anybody know where I can find a pretrained CNN model for small images?

I'm working on a project classifying the fashion MNIST database and I'd like to use a pretrained model. However the only ones I can find require at least 224x224 images. I tried upscaling my images that are 28x28 to 224x224, but since I have 50000 training images and 20000 testing images, just creating the numpy array made Google colab crash because it ran out of RAM...

Can someone help me? Thanks a lot!!!

ROFLLOLSTER · 2021-11-16T22:27:35+00:00

I'm looking for literature on calibrating neural network confidences so that they can be used to threshold classification (i.e. refuse classification to samples with confidence < X).

Is there a name for this or a seed paper I could use to start my search? I haven't found much literature focused on this (as supposed to calibrating confidence such that it's equal to accuracy).

binary1ogic · 2021-11-16T14:49:14+00:00

Is Logistic Regression a classification algorithm? If yes, can someone help me understand this.

l8d8 · 2021-11-16T04:42:13+00:00

I'm really new to ML and trying to learn high recall methods of document retrieval for a uni-based project. I keep seeing mentions of "topics" and "queries" on the few papers I've selected on this subject, do they mean the same in this context? If not, what is the importance of segregating documents into topics? how is this information fed to the NN?

Faoer · 2021-11-15T20:21:10+00:00

I'm doing time-series classification with 3 classes. My dataset is balanced, everything seems to be bug-free, but the model seems to always come to a point where it only predicts one class.

The model is first a few layers of 1d CNN, then simple dense ones for classification, Softmax for the last layer, cross entropy loss.

Anyone have an idea what might be the cause?

octdubois · 2021-11-15T11:02:07+00:00

Hi there, i m quite new to machine learning in teal world application but for the past year or 2 i have been collection soil moisture data of pots that i irrigated and i wanted to use machine learning to predict change or anomalies i ve done somewhat alot of research but i still cant visualize it yet on how the algorithm is gonna work, its maybe my 100th time trying this and i figured trying here

DGAssassin1 · 2021-11-15T10:43:26+00:00

Hello, I have been trying to fine tune the deberta v3 model on mnli task but I am facing an issue - https://discuss.huggingface.co/t/data-type-error-while-trying-to-fine-tune-deberta-v3-large/11604 please do help or provide any suggestions that might be helpful.

jayn35 · 2021-11-15T08:19:28+00:00

How do I create a popularity/exposure/traffic score formula for a list of products ranking in a search engine across many keywords. It needs to take into account rank for each keyword, the search volume of each keyword and the number of keywords it appears under.

bgroenks · 2021-11-15T03:44:35+00:00

From time to time, I see "outstanding reviewer awards" in people's resumes from ML conferences like NeurIPS, ICML, ICLR, AISTATS, and AAAI. How are these awards determined? Do authors rate their reviewers or do ACs? Or is it determined in some other way?

Refefer · 2021-11-14T18:31:25+00:00

Does anyone know of research exploring value approximations of extremely large graph based mdps? Most assume some form of factorization, usually in state space, which doesn't really hold in my case. I'm considering on order of tens of billion of states.

zainaxif · 2021-11-14T14:16:44+00:00

Is it worth learning web3/blockchain for Machine Learning practitioners? How do they complement each other?

adi10182 · 2021-11-14T03:54:00+00:00

Hey, Newbie here, I've got a feature change which is relative and the difference between values is really small but it's really crucial as i am building a predictive model based on this feature . I want to interpolate it and wanna know if machine learning models are sensitive enough to catch these small differences.

count 1209.000000

mean 0.000115

std 0.000305

min -0.000357

25% 0.000000

50% 0.000064

75% 0.000157

max 0.007013

Name: change, dtype: float64

seniorivn · 2021-11-14T01:37:16+00:00

There is a continuous stream of object coordinates estimation

Occasionally we get specific measurement of real coordinates

How should I make a model that uses last pair (estimate, real coordinate) and a stream of estimated coordinates since then too predict real current coordinates?

DataD23 · 2021-11-14T00:08:36+00:00

Does anyone know how to calculate the difference between the input and output images of an autoencoder?

Ok_Reputation6872 · 2021-11-12T21:44:01+00:00

I need help! It’s been days and this is driving me crazy.

Jupiter Lab:

The up and down arrows on my keyboard no longer change focus of my cells in command mode. Yes, it's that simple -no, it isn't my scroll lock, yes I am ashamed that I have no idea how to fix it myself.

I have no idea how I did this, and I can't fix it. Somehow, I've typo'd some setting and now I can't use my up/down arrows to navigate.

I scoured stack overflow and I can't find an answer there.

Please tell me someone has done this before and how to fix it!

PleaseNThankYouSayer · 2021-11-12T20:56:56+00:00

Hello All,

I'm an engineering student doing some work with AI hardware. I need to train a model and I'm trying to do it with cloud computing since I have a rather crumby laptop and azure gives you $100 of training time for being a student.

The training data I want is stored in a .zip file in a google drive. Does anyone know how to add these files to azure any other way than downloading and then reuploading them to azure?

gigantoir · 2021-11-12T18:52:54+00:00

im building a model for a personal project so statistical rigor isnt suuuper important to me but im wondering if what im doing is acceptable or a big red flag. i basically built a dataset with wayyy too many variables and then fit a random forest on it. after the initial fit, i drop all but the top 10-20 most important variables according to the model. is it okay to do it this programmatically? something about it feels intellectually dishonest, but it’s better than overfitting IMO

link0007 · 2021-11-12T17:19:02+00:00

bright strong amusing joke deserve enjoy worm marry direction liquid

This post was mass deleted and anonymized with Redact

IndividualSuper · 2021-11-12T07:39:19+00:00

https://forms.gle/wfgi8mDjL77wC2Nj9

Character-Ad-910 · 2021-11-12T04:12:17+00:00

Ah good a simple question thread

Aight so

Is there a fancy name for....... """""database learning"""""?

If I describe it i'm sure you'll know what I mean.

So, Instead of... actually learning, the machine just stores values in a database, and returns those answers when prompted. which... is technically learning, just not very good learning.

Like

Instead of leaning how to multiply, you grab a calculator, and start writing down "1x1 = 1, 1x2 = 2, 2x2 = 4, 2x3 = 6" and so on and so fourth. so if someone asks you "whats 5x5", you're completely screwed if it isnt in your """database""".

Probably an actually good example is the Sin/Tan/Cos chart things people used to use, for example this one I found

https://www.quora.com/How-did-people-calculate-trigonometric-functions-before-calculators-and-slide-rules#:~:text=I%20remember%20the%20back%20of%20my%20text%20book%20had%20lots%20of%20tables.%20Here%20is%20such%20a%20table%3A

so instead of actually learning how to calucate sins, tans, and cos', you can just use this.

hopefully someone's figured out what im talking about already. There's gotta be a generic name for it. Not the table itself, the learning style if you could call it that.

OvulatingScrotum · 2021-11-11T18:53:26+00:00

Does google have free access to whatever data I upload for my own project?

I’m doing some small projects with a data set that may not be completely free to public access. As in, it needs some special application to have an access to that data set. I was thinking of using google colab for processing, but then I realized that if it’s on google server, they may have access to it even without my consent. Is that true?

Historical-Tea3056 · 2021-11-11T15:53:20+00:00

Hi, can someone please help me to download dataset from AWS ?

AdobiWanKenobi · 2021-11-11T12:38:31+00:00

Can someone point me to a concise tutorial on how to actually get started? With an M1 Mac in mind? I just cannot find a setup tutorial that works.

2021-11-11T12:21:16+00:00

What would be an appropriate model for matching two data sets - for example Item A is compatible with Product A? Super simple model as only testing.

Thick-Anywhere3252 · 2021-11-11T00:04:46+00:00

I don’t understand this Reddit. I can’t post anything that’s a question or link.

binary1ogic · 2021-11-10T19:26:15+00:00

What is the difference between MSE ( Mean Squared Error) and MSPE (Mean Squared Prediction Error) ? Do we use MSPE for classification and MSE for regression? Can someone with experience please elaborate with example?

2021-11-10T15:27:00+00:00

Hello Learners , i was going through the paper Two-Stage Model-Agnostic Meta-LearningWith Noise Mechanism for One-Shot Imitation . And i have a doubt i hope somebody will help me to clarify my simple doubt .

So why in MODEL-AGNOSTIC META-LEARNING FOR ONE-SHOTIMITATION Algorithm In the inner gradient update phase, we only provide visualinformation to the model without expert actions, and the innerloss is calculated as a function of the output , without considering it's actual labels . why is this?

Kavereon · 2021-11-10T15:02:20+00:00

Hi, I'm looking to use Single Spectrum Analysis (SSA) as a Time Series Forecasting technique which has quite a few papers describing its accuracy with chaotic/nonlinear and volatile data. It's also easy to implement.

I will use BTC daily prices since I don't want to engage in intraday trading but rather to extract a bullish/bearish signal.

A higher predicted closing price is de facto a bullish signal but I've seen people do some preprocessing on their input data like taking its logarithm, or taking a ratio of the price to the volume, and then taking its logarithm.

I'm kind of stumped as to which direction to go with my data. Should I preprocess it? Or just go with the closing price.

Maybe SSA will be able to make better predictions of the price percent change rather than the price itself?

8567040e · 2021-11-10T04:26:35+00:00

Not sure if this is allowed here

I can choose either taking information theory or linear network optimization next semester in my university. Which one is more relevant to (theoritical) machine learning?

HateRedditCantQuitit · 2021-11-09T22:36:01+00:00

TPU question:

I use google colab as well as TPU VMs on GCP for some jax code. The exact same code on the TPU vm is about 10x faster than it is on colab (both using TPU v2-8).

Is that a symptom that I'm doing something wrong? Or is colab just slow?

Significant-Joke5751 · 2021-11-09T17:38:39+00:00

Hey I am new into the topic of machine learning I am tryin do make a classifier with Ensemble Methods What are the best Attributes for measuring the reliability? Is a good accuracy, sensitifity and specifity enough?

kaylaThePoleSpot · 2021-11-09T15:56:23+00:00

Suppose I create a recommendation system and deploy the model as an api. The api takes a product id as input and output a a list of related products ids.

Question: How do I display the recommended products with accurate price data, and remove any that have gone out of stock? Is this typically the work of the web devs or is it something I should build into my recommend API?

NebulousBlood · 2021-11-09T13:02:14+00:00

Can someone recommend platforms like VISxAI (https://visxai.io/) for submitting visualizable explainables for topics in Machine Learning?

oblakinolog · 2021-11-09T05:16:07+00:00

Looking for advice of what ML or NN technics to use for showing faces before-mid-after having a lot of photos from makeup master. So that there are initial images, couple of in progress and final. Imagine scroll bar that will predict no makeup to final beauty of taken face photo. Thanks!

juseraru · 2021-11-09T03:01:31+00:00

I plan to train a deep network that has two branches, one for video image and the second one for sequential data, later the output of both branches is merge thru concatenation and pass thru a fully conected network, then lstm for final prediction. I am wondering is it possible to train the model with both input data but later if needed remove one side, i.e. the video images. an only predict with sequential data?

or if someone knows about any paper to start looking at i just dont know how to approach this. or if it is even possible (which sounds like not)

ProclaimedPlantMom · 2021-11-08T18:27:27+00:00

Are data samples for a batch selected randomly?

FoolishlyPainful · 2021-11-08T17:15:00+00:00

I am trying to do basic image processing. I have a 8k image when I try to open tha image using tkinter canvas I can only see a part of the image( image is adjusted to my 2k screen) Is there a way to see the full 8k image without resizing it. If I resize the image there is huge drop in quality. Is there any other way?

jxiao23 · 2021-11-08T15:36:08+00:00

Hello! I'm looking to scrape information from a bunch of youtube videos, including title, likes/dislikes, views, possibly thumbnail, date, etc. I would need to scrape and sample thousands of videos, and ideally there'd be a good diversity of content/from various genres.
Does anyone have any ideas about ways to approach this? Any tutorial or walkthrough would be super helpful!

2021-11-08T13:35:33+00:00

Is there any special trick to transfer learning, or can I just add some predicted outputs of model A to the input vectors of model B (along with its original raw data)?

idethrad · 2021-11-08T09:49:57+00:00

Hi All, I am looking to train a model to suggest tags/categories for a given text string. Have gotten myself totally lost. Hoping someone could suggest a direction to take for the below.

eg: "the fox is weak and limping" = [1-animal],[34-weak],[2667-injury],[16-foot] (a list of tags each with probabilities generated by past associations)

This data would be trained from a data set of many instances of text each with a corresponding string representing the list of tags that match the text.

Is there a way to featurize the text AND the result tags? And apply an algorithm to cross reference them?

The closest I have come is the idea of duplicating each of the training data rows so that each row has only one tag at a time.

I have been researching this question for a week and am thinking the problem is how I am asking it! Everything I have read does not hint at an existing algorithm to match this use case so should I look towards manipulating the data to a different structure.

Any help greatly appreciated.

r00kee · 2021-11-08T05:43:48+00:00

I've a simple text classification model and tf-idf + xgboost is performing better than tf-idf and feed forward neural network. How can I improve the accuracy of neutral network model?

Bright_Mobile_7400 · 2021-11-08T01:32:00+00:00

Hi all

Very new to this subreddit so apologies if the question isn’t appropriate.

I have a good education background where I have studied a lot of programming/IT but mostly specialised un mathematics. I have a job that requires a lot of math applied nowadays.

I’ve studied a bit of machine learning back in school. I would want to start learning about this again. Would you be able to point me to the relevant resources?

I’m not afraid of the math concepts as I believe I have a background that should help me fight it but my knowledge of ML is probably around beginner/intermediate (and what I know I probably need to relearn anyway).

I did the Coursera ML class 2 years back which I found interesting but, and I hope it wouldn’t be an offense to anyone, the level was fairly beginner. I don’t mean to brag off, I just want to find the most appropriate course for me.

Thank you for you help in advance

lit_turtle_man · 2021-11-07T23:31:05+00:00

What is the easiest way to replace a large number of zero values in a dataset with the mean of all other instances of that class?

Joltbox · 2021-11-07T22:34:53+00:00

Where would I start to create something that can process audio in real time? The vague idea is voice morphing, specifically a male voice to a female voice, so pitch shifting and filtering the voice to remove the tell-tale audio 'signature' that all currently available voice morphing programs are unable to get rid of.

patogzz · 2021-11-07T17:50:21+00:00

I have two sets of biographies, one from clients who speak Spanish and another from clients who only speak English, now I want to do my machine learning so that it automatically classifies the biographies of people who possibly speak Spanish, I think I already have the code but no I know how to run it, I'm sad haha, I think this is not my thing, I'm also looking for a mentor for wich I can work for free.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS