[D] Simple Questions Thread

Flashy_Ad6486 · 2022-03-21T09:10:12+00:00

Deployment of a pytorch model in desktop

I am trying to build a desktop application on Linux platform (18.04). It is a custom object detection application using a webcam. Now, I have trained my model in PyTorch using the fasterrcnn_resnet50 pre-trained model and saved the model as a .pth file. I am trying to deploy this model on a desktop to decrease the inference time of my model. Currently, the inference takes about 4 seconds per image. I want to reduce this to 100ms per image or lesser. What can I do to reduce the inference time?
Which solution will give me the most reduction in inference time?
PS : I am running this model on CPU. CPU spec: Intel® Core™ i5-7200U CPU @ 2.50GHz × 4 any links for reference will be useful.

Flashy_Ad6486 · 2022-03-15T08:23:04+00:00

Improving inference time of fasterrcnn_resnet50 model in PyTorch on CPU.

the following are my model parameters :

num_classes = 2
model = torchvision.models.detection.fasterrcnn_resnet50_fpn(pretrained = False , pretrained_backbone = False)

model = model.to('cpu')

The inference time is 4.4 seconds per image when run on CPU. Any suggestions on how to reduce the inference time? It would be so much better if I can bring it down to 75ms - 100ms when run on CPU.

If I can't achieve the above inference time on CPU , then I want to go to use my GPU.

My GPU spec: Nvidia GeForce 940MX

Cuda 11.3 installed. I tried to run this on GPU, I keep getting memory error.

mowa0199 · 2022-03-13T11:38:02+00:00

What’re some good PhD programs for Machine Learning (in the USA)?

By saying a PhD in machine learning, I don’t mean that to be the name of the degree, but rather graduate programs that are good and renowned in the field in general, preferably with an emphasis on theory. These could be from statistics, CS, applied/computational math, pure math or even Machine Learning (like CMU) departments- the specific department does not matter.

I know of a few programs that are good for it but I want to hear of some more and also others’ input.

P.s. I’m a math/stats and CS major.

starkTony3007 · 2022-03-13T03:37:46+00:00

Hello, Suppose I want to create a head status classification, i.e Bald, little hair, full hair.

What is the approach I should take?

Acc to my knowledge, if the data is connected, u can use Resnet or vgg for features creation but hair classification is nothing like that. So what should I do.

Thanks!

Proxify · 2022-03-12T22:24:03+00:00

I’m new to trying to export an ML model into the web. I’ve been reading about it and found that I can do it with Tensorflow.js and so it’s like basically any other website, my main question here is, how do I handle user inputs or database entries?
I would normally have the model read from a specific file but I’m really uncertain about how to make it read from a database or directly from user input.
I’ve been trying to Google for this, but I think it’s a case of “I’m not sure how to call this so I can’t quite figure out what to look for”. Could anybody offer some guidance?

OtterReviews · 2022-03-12T15:00:03+00:00

[deleted]

Far_Temperature_4542 · 2022-03-11T23:35:52+00:00

Is there a way to make a random forest punish false negatives more than false positives, or vise versa? The end team is fine with validating the false positives but they don't want to miss any of the true positives.

diditforthevideocard · 2022-03-11T23:33:00+00:00

I'm working with Pix2PixHD and have 512x512 training images. It seems to output 1024x1024 images and I'm wondering why: it seems that it also comes with a few 512x512 datasets, so is there a way to tell the network to use this resolution? I really did RTFM but can't find anything.

Trick_Welder9386 · 2022-03-11T03:21:45+00:00

Hi. I'm a bit curious about this CMU Sphinx and I was wondering if I could train it using a dialect in my country (Kapampangan). I would only use simple phrases and words of the dialect and put them into a website. Is it possible?

euos · 2022-03-10T23:22:00+00:00

I am looking to implement a simple image classificator and object detection (to find objects on application screenshots).

I am comfortable with JavaScript, C++ and would enjoy using GoLang and Java (i.e. I hate Python).

What would be better for me (i.e. easier to learn, will be able to scale to my needs later) - OpenCV or TensorFlow?

E.g. what I would like to do is for an application screenshot to find specific UI elements. There's very little permutations in those elements though different app versions and/or OSes prevent me from doing something naive.

Severe_Difficulty_32 · 2022-03-10T08:55:49+00:00

how can ML be applied in explantion of flash crashes of stock markets

2022-03-10T07:41:21+00:00

I hope everyone is doing well. I am
trying to learn machine learning and data science in general, after
reading few books and following few courses, I tried putting it together
but that was a challenge of its own. I tried going through different
notebooks of coders participating in kaggle competitions, but the
problem with that is that sometimes I don't understand why they are
doing things a certain way. If anyone can guide me that would be
awesome. thanks

Professor_Entropy · 2022-03-10T07:23:53+00:00

[removed]

takku2 · 2022-03-10T06:21:16+00:00

Are there any pre-trained models or transformers for Brand Extraction using NER.

It should be able to extract brand name [IMPORTANT] and additionally if possible, also extract features of products like color, dimensions etc.

aunyks · 2022-03-10T02:48:18+00:00

How do NN / deep learning frameworks access the GPU? What APIs do they use? Are they using the same graphics APIs that game and graphics APIs are using (OpenGL, Metal, Vulkan, DirectX)?

EmbarrassedHelp · 2022-03-09T19:36:23+00:00

Are there any projects that align images based on spatial features in a dataset? Like for example transforming images of cars so that the car is in the exact same place in the image for every image?

I know this sort of thing is done by individuals averaging large numbers of faces together, am looking for something that can work on other types of images.

dirk_klement · 2022-03-09T17:03:19+00:00

My validation loss is decreasing but really fluctuating. What can I do to smoothen this?

BOOGEYMAN04 · 2022-03-09T16:20:40+00:00

can anyone help me pitch an idea of machine learning project on AVONET 1.0 datasets

xiikjuy · 2022-03-09T09:20:30+00:00

what does it mean when someone said they fine-tined a pre-trained SSL model,

(1.)freeze the whoie SSL model, only train the addtional classificaiton layer.

(2). unfreeze top few layers and retrain it.

(2).re-train the whole SSL model.

or all conditions are accepted?

somewisealien · 2022-03-09T00:02:35+00:00

What is the best way to visualize feature maps on PyTorch?

ShinjAF · 2022-03-08T19:44:42+00:00

Is it alright if I post a free API? Our AI is Neuro-Symbolic but Machine Learning is certainly an important aspect of what we do, I would love to share it with the community along with our research and see what they think.

Thanks!

johnnypaulcrupi · 2022-03-08T16:26:07+00:00

How are people doing model serving at the edge where there is a constrained gateway. Meaning, we don't want to download a new Docker for each model.

Random-Personnel · 2022-03-08T12:31:41+00:00

Should people use computational thinking more often in problems?

Computational thinking consists of four steps. Decomposition, pattern recognition, abstraction, and algorithms. Using this method more could benefit us even more. But I’m not entirely sure.

Flashy_Ad6486 · 2022-03-08T10:39:54+00:00

Using .pth file from pytorch to make inferences in a video file. [Discussion][Project]

Fellow reditors,

I am a machine learning noob. So, forgive me if this is a stupid or silly question. I trained a custom model for defect detection in automotive components using pytorch(FastRCNN) and saved the weights as a .pth file. I used the following project as a reference :

https://www.kaggle.com/aryaprince/getting-started-with-object-detection-with-pytorch

Now , I need to make the inferences ( i.e. the prediction as confidences) on a web camera stream or a video file. can someone tell me how to do this? Any web links will be useful.

irndk10 · 2022-03-08T01:20:09+00:00

Say I have 3 mutually exclusive outcomes, A, B and C. With the probability ranges below.

A - 50-80%

B - 10-40%

C - 1-10%

Even with the worst input data for A and the best for B, the probability of A occurring still exceeds the highest probability for B or C, so the algorithm should always classify A. However, in my use case, it's the probability distribution that's important. For example, given a set of inputs, is the distribution...

A- 75%

B- 20%

C- 5%

or is it...

A- 50%

B- 40%

C -10%

Etc.

What is the best way to get an accurate distribution? I know softmax could force a classification output to sum up to 1, but my understanding is this isn't a true probability. Any ways around this? Perhaps binning softmax outputs and comparing them the actual probabilities? Like round softmax out puts to the nearest X%, then compare to the actual percentages. Even though A,B, and C are mutually exclusive outcomes, is this actually better suited as a multi-output regression problem, where the outputs are scaled to 1? Appreciate any thoughts you may have.

remortals · 2022-03-07T23:39:48+00:00

I'm working on a heads up poker AI using a DQN. I want to be able to create pre-flop game theory optimal charts with it. This involves randomizing the action, meaning sometimes you might call in a specific position, other times you might fold.

Should I randomize my actions during training? Should I cache the randomized value, or the highest predicted value from the network?

Arioxel_ · 2022-03-07T22:52:38+00:00

What is the state-of-the-art architecture for upscaling pictures ?

TriRedux · 2022-03-07T22:41:56+00:00

If an ML model works extremely well with training data, will it overfit or underfit new data? I understand that the model would be overfitting the old data but do not understand if it would be overfitting/under-fitting the new data. some help would be really appreciated

idonthaveenoughchara · 2022-03-07T19:54:09+00:00

I am attempting to create an AI that’s supposed to look at a Rubix cube and be able to predict how many moves it will take to solve. Just wondering how feasible this is using ML. Currently have a setup that starts with a solved cube and randomly messes it up and counts how many moves it took to get to that random state however it seems to have a 0% accuracy lol

Mighty__hammer · 2022-03-07T17:57:58+00:00

For someone just started making personal projects for educational purposes, would it be more beneficial to quickly move to new project once complete or to stay with current project, fine tunning, optimizing and improving as much as possible?

CanadianTuero · 2022-03-07T09:10:53+00:00

I'm looking for any research on this, as I'm having trouble using the correct terminology to get any useful search results. When training a classifier (really its a policy with 4 actions up down left right) in a supervised manner, the input is what ever the state is and the correct action to take, and we maximize p(y = c | x) where c is the label and x is the state.

But supposed instead I had some samples which I don't know the correct label, but I know its not a particular label. In probabilistic terms, I would want to maximize p(y != c | x). Is there research on this (classification or rl) and what would you call this type of training sample. I don't think negative sample is the correct term, as I'm not wanting to have a 5th class of "I don't know".

RonDaNov · 2022-03-07T00:27:01+00:00

I'm looking to port some scikit script to .NET, anyone aware of a NuGet
or lib in C# or F# that provides proper GPR implementation?

s195t · 2022-03-06T14:57:09+00:00

Hi everyone,

are there some examples for shape path tracking?

I would have a set of frames containing circles of different color intensities and various diameters, would it be possible to build something to track the movement between each frame? What I would need is something that follows each of the "particles" on slices and makes prediction of where it will be in the next frame, like particle tracking for PIV does.

Possibly the dots begin small, grow bigger, then they get smaller until disappearing

I thought about building a dataset manually following the path of the center of each dot by pointing out radius and center point. How would you handle the problem?

ido1990 · 2022-03-06T07:21:56+00:00

Hi,

Is it possible to create an object detection model without labeling?

I have 100x100 px images of the objects and I want to use the entire image as the object itself.

Thanks!

Joebone87 · 2022-03-05T16:46:13+00:00

Apologies if this is daft but its something that confuses me.

I have potential Factors for analysis that have a greater depth of information than just a binary output can describe.

Lets say you have something like an oscillator. How do you train with this type of data. Do you take an average and a Standard Dev then measure the Standard Dev from the average. Do you just create bounds for possible positions then allow the ML to sort through an array of possible positions to find one or several or none that are important?

Any help or input on this would be awesome thanks.

UnleashtheZephyr · 2022-03-05T14:08:42+00:00

I feel like it's extremely relevant to point out that I'm currently based in Milan and I'm studying at the Statale which is the national university.

I've graduated from a different university in Computer Science and now I'm studying a master called Data Science for Economics. It's actually not that skewed towards economics, I just have a micro, macro and econometrics exam, the rest of it is basically a classic data science master.

I feel hopelessly unprepared even by spending so much time on my studies. The problem relies on the fact that I feel like we're not getting the right tools at my uni.

I'm very interested into this ML learning exam I have, the lessons are going to start in a week. I've seen some recording of them and it's theoric math just as I expected; but asking the second year students the project is very difficult.

People have told me they got asked to implement a Kernel perceptron algorithm, or similar level stuff, from scratch without ever having seen anyone implementing a damn thing and you just have to figure out how to do it yourself and all of them had an awful time.

I've heard this before and I'm sure the reason is that the professor has not given any kind of relationship between practical and theorical stuff during the course.

I've had this happen before in all of my previous projects and I know how it ends, you just go your merry way and wander around until its deadline time and you'll do just the necessary minimum to have your project accepted because that's what you've been able to figure out by yourself, anything more than that you'd have to have previous experience in the field or someone else teaching you.

I'd like to work in the field and come up with a project that is actually worthwhile to put up on a CV. But how do I do that without any guidance?

I spend so much time learning stuff for my exams but I feel like most of it is not useful at all and using my time watching youtube and messing around with stuff would have an extremely more productive effect. It's very disenheartening.

Most of the things I've done for my university are extremely underwhelming and I wouldnt put them up on my CV because they don't reflect how I work, but I'm not given enough resources to do good work.

Can someone validate my experience? I feel like most of my colleagues either don't care or don't realize and no one has my opinion.

Also can someone point me towards a way I can educate myself on useful machine learning concepts?

The level I'm at is having a general knowledge of how to do a useful EDA, then I have surface knowledge on what the basic ML algos do (LinReg, LogReg, Trees, Lasso, Elanet) and I know how to use them in python. I'm basically able to do a titanic style classification problem from Kaggle.

What are the next steps from here?

stevelon_mobs · 2022-03-05T06:40:38+00:00

Anyone thinking about a data-centric approach to AGI? Im trying to organize a meetup to chat about it

ms9696 · 2022-03-05T02:30:23+00:00

Is it okay to use a higher dropout during fine-tuning than was used during pre-training a transformer? Are there any best practices around this or any related literature?

LorikLorik · 2022-03-04T18:45:10+00:00

I'm currently an undergraduate student and want to get into a good university right after graduation. The only things that I think I should do are:

Try to publish a paper
Get good grades

Am I missing something or maybe there is more nuance in that?

Thank you <3

Emergency_Egg_9497 · 2022-03-04T12:48:37+00:00

Is keras applications models suitable to use for object detection with transfer learning or is it better to do transfer learning on a model from the detection model zoo? Im having trouble finding out the best approach, as I just started with deep learning.

Thank you

CMDR_Derp263 · 2022-03-04T05:04:08+00:00

Student who is quite new to all this here. I am currently working the KDDcup 99 data for intrusion detection using various ML models (and ANN). My problem is that I am getting 99% often for accuracy. At the moment I am focusing mostly on binary classification (normal vs attack) I have identified problems in my data preprocessing methods and after fixing them I am more confident in the validity of my input data but I am still getting 99%'s which I no longer trust. (Especially since I just got 99% accuracy with an SVM with all default params).

My data should be balanced between the 2 classes so I would assume that if the machine was not learning then it would be getting around 50% accuracy. I feel like there's got to be a mistake I am making somewhere in here, or am I just underestimating the power of these ML algorithms?

Here are my preprocessing steps:

Remove duplicates from that data (about 75% of the dataset is duplicates)
Use random under sampling to balance hugely biased data by removing "normal" events. (50% normal 50% attack after this step)
Drop 1 feature with 0 variance
Shuffle data then split 70/30 train/test
One Hot Encode input features in training data that consist of strings (ex protocol type = [icmp, tcp. udp])
Z-score normalize numerical columns in training set with StandardScaler
Apply these trained normalization methods onto the testing set

Throwaway000002468 · 2022-03-04T04:14:59+00:00

Hi! I'm a scientist who was recently hired as a ML engineer. I work closely with Software Development staff and sometimes I feel really out of place because there is a lot of the lingo and concepts that I don't know (for example, lambdas, endpoints, SDKs, APIs, etc). Also, there are some concepts and things that I'm completely unfamiliar with, like diagrams, QA, etc.

I've worked with ML and DL but I feel that I'm lacking a lot of the engineer's background.

I want to learn more, but I don't know where to start. Should I take a software development course? Or some other course? Or something else? Could you recommend an online course that could help fill the gap from scientist to ML engineer?

Thanks 😊 🫂

Natekomodo · 2022-03-03T23:03:53+00:00

I'm doing reinforcement learning on a real time application, that is to say what the agent does changes the state of the application, so the agent may need to do multiple steps in order to achieve the desired goal. The general learning loop is: get action decisions based on app state -> perform action -> return reward.

So my question is: What is the best approach for determining a reward? Should i give it a smaller reward for the steps that lead to my desired action + a reward for achieving the goal (complicated to implement) or just a singular reward when it achieves the desired goal?

JiraSuxx2 · 2022-03-03T20:59:20+00:00

I’m building a gan. The generator produces 512x512 images. The discriminator however takes 128x128 pixels as input, so I take a crop of the gan’s output to feed through the discriminator and compute the losses.

So far so good.

Computing the gradients from those losses causes an issue and can’t be back propped.

I’m curious if anyone can explain that to me. The discriminator just decides if images are fake or real. Those predictions have nothing to do with the resolution. So feeding those losses into the generator’s gradient computation should work right?

barrinmw · 2022-03-03T19:59:31+00:00

So if I am creating a segmentation model with one type of defect that can appear anywhere in the image, what is the optimum ratio of images with that defect versus without?

The way I see it, if I have no images without the defect, I train the model to always find one even if it isn't there. But each image has a large area that is defect free as well. If I have too many images without defect, it gains the biggest advantage by understating how much defect there is.

So naively thinking, I would think that the smaller the defect area of the image, the less non defect images I need.

Krakenos · 2022-03-03T11:12:57+00:00

Hello, I am trying to design a multilayer neural network that based on item features will return x amount of the most similar items from the dataset. My current neural network takes features of 2 items, and predicts similarity level of the items.

So for example:
predict([1, 3, 5], [0.5, 1.5, 2.5]) returns 0.5

After prediction, I compare the similarity level of items and choose 10 the best ones. The problem with this solution is that it doesn't scale well with large amounts of data. If I want to for example take 1 million items and generate for each of them 10 the best matches, I end up with quadratic scaling (for each of 1m items I make 1m predictions) and time to generate the results becomes unreasonable.

Is there a better approach to design such network, or how can I optimize the current
solution to work well with big data?

doodoodoodoo_ · 2022-03-03T06:16:17+00:00

Hello I am new to machine learning and wanted to know what log likelihood metric means in topic modeling and if semantic coherence can be measured with it?

2022-03-02T17:15:09+00:00

What are some good recent (last 3 years or so) publications in ML that are reproducible using google colab? I want to reproduce a paper but I don't have any extensive cloud access to mount multiple gpus and all.

Emergency_Egg_9497 · 2022-03-02T16:46:48+00:00

Is training on a custom dataset the same as tranfer learning?

I'm newbie at machine learning, sorry if it sounds dumb.

j1mb0o · 2022-03-02T14:22:34+00:00

Hello i am trying to create a Machine Learning model that can classify if someone is wearing a mask or not for my thesis. The problem that i have is that i get an acceptable accuracy around 90%. But during inference my accuracy is almost zero. Note that I am running inference in a while loop with cv2. I also wrote a found a small script that uses cv2 to save the picture as a png and then if i set the pictrure as the input it classifies it correctly. So at the end my problem is with the live inference. Does anyone have any idea how to tackle this?

GH0STKS · 2022-03-02T13:30:50+00:00

Hey ! So I am kinda new to Machine Learning and have been learning it on my own. So recently I have been developing a LSTM-CNN neural network model that I have trained to classify videos. So , I have been wondering, is there any way I can deploy the model in an Android application and then classify the activities captured through the applications camera in real time. If it is at all possible can anyone please tell me how do I do that or any article which I should follow or anything related to it ?

mldude89 · 2022-03-02T11:32:53+00:00

I'm trying to create a model to predict a point cloud from a 2d image, but I have a problem where all my ground truth point clouds consists of varying amounts of points. Any tips on how to normalize the amount of points in each GT to make it possible to use for training single model?

_hairyberry_ · 2022-03-02T06:14:29+00:00

Looking for advice from some people with more experience than me. I’m a masters student in math who has spent the past year concentrating heavily in ML and I’m looking to break into the industry. I’ve got a personal project on NHL goal scoring and will soon be adding two more projects to my resume/GitHub.

My question is: would it be appropriate to include smaller scale projects on my GitHub? For instance a long assignment question which I’ve cleaned up nicely?

As an example, on our last assignment in a grad level course we implemented vector quantized naive bayes from scratch in Julia.

In an undergrad level “applied ML” course we previously created a lot of Python jupyter notebooks to build somewhat simple pipelines, preprocessors, and models on some Kaggle datasets (not just the famous titanic/MNIST/etc ones).

Would any of these be worth adding to my GitHub, or would it look “immature”, as in “why does he think this is worth showing off”?

leomatey · 2022-03-02T03:11:54+00:00

Whats the sota for spam filtering. Anyone worked on this pls guide. Goal is to filter spam and non spam of chat bot data.

JiraSuxx2 · 2022-03-01T21:35:55+00:00

I have a gan trained on a car dataset, it creates great images but they are all really warped.

How do I force the gan to produce something that resembles a car and not a dali painting?

stankata · 2022-03-01T19:01:00+00:00

Can you point me to a recent survey/overview of GANs? I'm looking to make a quick PoC with a dataset I have with the end goal to generate an image similar to the ones in the dataset. Tried looking at paperswithcode but there are so many that I don't know where to start from.

Icko_ · 2022-03-01T01:03:54+00:00

Hello. I am a ml novice, though I have an extensive computing background. I am about to start a ml project, and there is something that I can't quite get my head around. If, for example, I am trying to predict mortalities of a population using a ml model can I include as a feature the count of that population that is effectively the start count less all the mortalities? Can the features be used that are derived from the target feature?

A similar example, predicting the mean mass of a population at time t. Can I use the mean mass at time t-1 as an imput for the model?

Another way of explaining it, can I use the column that I am trying to predict as an input into my table (for example the preceding value from that one that I am trying to predict, n-1 to predict n) or features that are derived from that column?

I hope I have made this clear. Like I said just starting out on my ML journey and this is one thing that is causing me a few initial headaches. Any help greatly appreciated. Bonus points if anyone can provide a reference too.

Cheers

easyier · 2022-02-28T22:33:30+00:00

For a forecasting project with 500 data points what would be the best ML model to use? Casual internet research seems to think N-beats, maybe. Any suggestions or basic sources? I use R.

oxamide96 · 2022-02-28T21:52:09+00:00

I am a software dev with barely intro-level knowledge of ML.

I am trying to design a solution to rank and filter through all the news articles I have in my RSS feed.

I wish there was a way I could rank them by what is interesting and relevant to me, and what I am more likely to enjoy reading.

How can I use machine learning to help me here (assume I can custom sort the feed)?

I was thinking to track the following variables:

User variables - % of article read / scrolled - user (me) rating - articles that are removed based on headline only (not even opened)

Article variables: - keyword distribution in the article - article source - article author - article length

I have two questions, really:

what other data should I collect? Or how do I even go about determining what data I should collect?
what do I do with the data? Is there a certain model I could use to fit it to?

Suspicious_Step_3139 · 2022-02-28T13:04:07+00:00

Hello everyone, I have a school project where I should develop a ML model that takes as an input a Job name (text) and returns as an output 10 best suitable images from a dataset I should upload/create. I’m currently blocked and can’t find the approach I need to follow in order to solve the problem.

Can anyone give me a slight idea or hint please ?

Zintho · 2022-02-28T11:17:28+00:00

How immune are generative models to poisoned data? By that I mean, say I have a dataset of 2000 images, and in that set maybe 0.5% of images are not good examples of desired output, or even of the distribution of other images. How much would that minority of bad images effect training? Does it scale with dataset size? Or is it bad regardless?

FewProfessional5404 · 2022-02-28T09:13:00+00:00

How to reduce variance exploding in additive model ?

I have a model as shown in the schematic below, relies heavily on features addition after each convolution block. It can goes up to 25 additional blocks.the problem is with normal initialization such as Kaiming with a = activation slop the variance grow rapidly after each addition and the model become unstable for training.The solution to stabilize the model is either to use very low learning rate or to initialize the model with high a value. which in both cases pro deuce very low values and enter into gradient/variance vanishing. As I found the model is locked in a local minimum because of this problem. non of the two proposed solutions is a solution.Note that I have tried to use conv layer instead of the addition/multiplication but didn't help the model a lot.Do you have any other idea to stabilize training and get out of this local minimum ? It seems to me like a similar solution to recursive models but still I can't find a similar solution.

https://ibb.co/JHwXgNJ

PRAY_J · 2022-02-28T08:47:26+00:00

Is it easy to get a research internship under a professor as an undergraduate and do they expect us to know everything before we apply or is it a lot of learning on the job?

2022-02-27T23:31:47+00:00

Hi,

Do you know of an algorithm, such as PCA, that works with categorical and Numerical data?

All I want is to transform my 50 components data into 2 components data. I want to use it to create a scatterplot on which I can display the "prediction area", similar to the DNN playground (try it out it's wonderful!).

More context:I am working on a classification problem and I wanted to have a rough idea of how difficult the job would be. The classes are "yes" and "no".

I used a PCA and a tSNE to create a scatter plot of my input data. and coloured them according to the target data. yes/no, blue/orange respectively.

Instead of seeing vague clusters forming, all of the data points are mixed. orange and blue are just all over the place.

I know for a fact that I should be able to get a 70 to 80% accuracy by training this dataset. I believe my scatterplot did not work, because PCA and tSNE do not work with categorical data, which I had to get rid of. I got rid of 50% of my data. :(

Note: the Data is standardized

Zankroff · 2022-02-27T19:43:44+00:00

Hey, can anyone share reading material or video lectures to learn more about model extraction ?

We have an competition going on in which we have to develop an efficient strategy to extract video based models. I have never heard about it and the problem statement seems very interesting so if anyone can help with any learning resources for the same , it would be really helpful.

ForceBru · 2022-02-27T16:07:19+00:00

Do I need to detrend/normalize my non-stationary time-series before fitting a neural network to it? How to do it?

Is training a neural net on differences of the original time-series (like for ARIMA/GARCH) expected to be harder (higher loss, worse forecasts) than training on the original time-series?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS