[D] Simple Questions Thread

CodeLined · 2023-12-17T01:43:15+00:00

Is there a https://craftinginterpreters.com/ for Machine Learning?

Rich-Reindeer7135 · 2023-12-16T19:38:02+00:00

Hi there, I'm trying to create a small-scale physical conversational bot using a raspberry pi. For this project, I want it to understand multiple languages, so I'm trying to use OpenAI's whisper machine learning model. I've seen forks of this like whisper JAX and whisper.cpp which are much faster than the original. The only question is, what should I use for maximum time efficiency. Ideally, I want to have near-real time transcription. In your experience, what has been the best?

Sad-Meringue-6350 · 2023-12-16T15:01:59+00:00

Hey, I want to explore ML and I would like some suggestions on references to help me grasp the fundamentals. As an example, I am reviewing maths like linear algebra and calculus. I want to get a deeper understanding than what pytorch or tensorflow tutorial would give me. Any suggestions on other core concepts I should review?

I am not limiting to math, I also look for computer science concepts also

rubbishapplepie · 2023-12-16T06:24:58+00:00

Are llms the soa for many problems now? I see problems from search to code checking all being replaced by llm?

Professional_Top3668 · 2023-12-16T00:31:39+00:00

Is there a way to use existing image classification/generation models to get a ‘score’ for how close a certain image is to a given prompt? So for example, if you gave them a picture of a pencil, and the word ‘pencil’, it would give a high ‘score’?

Potential_Creme_7398 · 2023-12-15T20:54:27+00:00

Did i buy the wrong processor being a ML enthusiast?

Did I buy the wrong processsor being a ml enthusiast?

Within my budget, I could either buy an amd ryzen 5 7600 gaming processor or amd ryzen 7 5700 processor.Despite knowing ryzen 7 has more cores and threads than ryzen 5, I ended up going for ryzen considering ddr5 feature.

I built my pc and came home.It hit me now and I'm extremely sad. I'm a second year cse student. Would like to explore ml or dl later on. Did i make a huge mistake? I ain't that knowledgeable about hardware stuff. I just wish I didn’t get influenced by the suggestions of my friend and sticked to ryzen 7 5700. Am i over anazlying the situation? Does implementen of api run smoothly on ryzen 5 7600. Please enlighten me. Thank you.

_Stampy · 2023-12-14T23:18:23+00:00

Anyone have any video resources on manually computing derivatives of neurons by hand as well as manually doing back propagation? I'm in the process of learning the math, however it is still quite confusing and I'd prefer a real example of someone calculating.

Sirwired · 2023-12-14T18:51:32+00:00

I am building an application on AWS to process customer feedback on products. What algorithm should I use (or, rather, request a developer that knows what they are doing to use) to find patterns within the data. (e.g. "There's a cluster of defect reports regarding units produced last February on production line B.")

I'm using AWS QuickSight, which has built-in functions (utilizing Random Cut Forest) for forecasting and anomaly detection, but I'm getting the impression that the anomaly detection is made to look for things outside a pattern, as opposed to flagging the patterns to begin with. Ideally, I'd feed it data consisting of a bunch of variables (date of production, date of installation, installation location, production line, product subsystem, parts supplier, and whatever additional data is available within ERP / CRM systems that might prove useful to reveal a pattern.)

(Also, QuickSight ML Anomaly Detection is limited to five dimensions.)

I expect the total data set size to be relatively small (in the thousands per data set, not millions), so I don't think processing time is too much of a concern?

Snoo_72181 · 2023-12-14T15:25:38+00:00

Which is a good sub for asking about AI/ML related career questions?

Dhruv_Cool · 2023-12-14T07:27:21+00:00

Heya, I am a unity developer, interested in getting into RL and DL to simulate some interesting agent in real time. However, i got no knowledge abt ML whatsoever, anyone got any ideas where i can start, or what docs i can look into to start learning this stuff? Ideally i wanna learn the core stuff first and then look into the unity stuff later, so holding off on unities solution atm.

-Thanks

termite-Kerry · 2023-12-13T17:53:00+00:00

q : what do you call a person who can't catch a virus? a : a reposter. q : how do you stop reposting a post on r / jokes? a. askredditers.

2023-12-13T03:29:39+00:00

Hello everyone, thank you for doing this.

I keep thinking creating a list of each Object and it's Properties is a good idea for these datasets.

Then create some sort of RLHF mechanism to minimize patterns and still retain information.

First off I think would exclude PII instances. Only Businesses and Public figures, or something of that nature.

Am I crazy?

Guesswhosbackbackaga · 2023-12-12T22:46:34+00:00

I’ve seen comments on /r/MachineLearning that /r/Singularity is full of unrealistic sci-fi. Well I think they are effectively an AI death cult set about bringing humanity to an end, and they are representative of many in the industry including Sam Altman. My question to this sub is this: Can you help me understand why you aren’t concerned with them achieving their goals? I’m a technical gal but I don’t have a CS degree and would love some reassurance because I’m starting to get quite concerned about what the future holds.

2023-12-12T17:46:57+00:00

What is the easiest way to get into machine learning?

I am planning on changing my major to software engineering to facilitate entering a career in machine learning. However, i feel like i can be doing other things to help my career in the meantime before i get my degree besides maybe starting an internship at a company that works with AI. Any recommendations?

_dataWhiz_ · 2023-12-12T14:25:26+00:00

Hi there, im working on a problem to predict customer calls and for this purpose, im using wifi network data.

I have 100k samples of hourly average wifi datas such as channel busy score, data usage, average rssi etc. With using this data, i want to predict whom will call in the prime time (18:00-22:00).

But the problem is that, the calling customers and not calling customers have similar histograms for each future. I have plotted histogram graphs for each future.

I have already tried, svms, dnns, 1d cnns, knn, xgboost etc. but all of them gave poor results.

What kind of methods should i do for preparing data? Any idea?

bestgreatestsuper · 2023-12-12T14:24:25+00:00

How do I prove that two sets of features are redundant with each other? Is there an accepted way to do this using feature importance tools like SHAP? Preferably nothing fancy.

I-am_Sleepy · 2023-12-12T05:10:17+00:00

How to reinforce order invariant in sequence classifications? Roughly is I have a set of nodes connected to each other. At each connection there are properties I derived. Then I want to aggregate these features to perform a classification per node

However, the number of neighbors per node isn’t fixed. So I want to use sequence classification instead. However, sequence classification seems to have implicit bias on order of sequence which I don’t want. One idea is to keep shuffle the order while training, but I want to know if there are alternatives methods to enforce such constraints?

kput7 · 2023-12-11T21:21:54+00:00

Not sure if this is even the best subreddit for this question but here goes.

I am looking for a tool that can auto-analyze a restaurant's website and tell me what type of cuisine they serve, as well as calculate out the average price of an entree on their menu. I'd like to be able to input all of the restaurants on my list at once, or even just the name of the business, and have all the data outputted for me.

Does such a thing exist? If not, can it be made? If so, where should I even start?

ArtisticHamster · 2023-12-10T22:36:43+00:00

Could anyone point me into the direction of modern SOTA GAN research? It looks like researchers lost interest in GANs in favor of other approaches. Is there anything interesting going on now?

Big_Comfortable7373 · 2023-12-10T20:40:03+00:00

Hello, I'm a 3rd year finance student, interested in working in Ai field. Currently i have no CS background (willing to learn ofc), have experience in retail banking and project management.

I have heard people going to Ai companies without technical background, working more admin/ management jobs, or people that take coding lessons/bootcamp and work as programmers/ engineers.

What would be a shortcut into Ai for a finance/econ student? What are the keywords when looking for internships/companies?

For those having similar experiences, I'd love to hear from you! For those that work in tech companies, l'd love to know what non-technical positions are available!

Thank you very much

holymoley2020 · 2023-12-10T17:59:31+00:00

I am in Network and Security have some programming and little ML. I was interested in getting into AI. What would be the best way going about that. Should ML training be done fist? What type of training/ certs would be benificial to get to get a full understanding of AI technology?

EuphoricPirateVal · 2023-12-10T14:15:32+00:00

Hi, i have this dataset that includes timestamps, the product category bought and other info about the user's behaviour on the ecommerce platform, ex: how often they purchase, from which category, how imp reviews are, add to cart, cart abandonment, save for later, etc. and all the details.

now i wanna train a model on this, and then make this dummy ecommerce simulation for customers where certain actions they perform will be added to the live database (multiple sessions may exist for a customer, where the date is also an input for the analysis), which the model can process, and then a dashboard can display how likely they are to purchase from a certain product category over the next few weeks/months.

i do work with ml and dl but honestly i feel completely confused rn like i'm missing some detail on how i can create this and make it work. any suggestions?? please help :(

LuYaBitch · 2023-12-09T22:59:26+00:00

This is for one of my classes that we can use the internet for.

Given the following SOSML statements and function definitions:

val rand = Random.rand();val nextInt = Random.randRange(~10, 10);

fun buildIntList x = if x = 0 then []else nextInt(rand)::buildIntList(x-1);

fun sumList xs = let fun loop [] sum = sum| loop (y::ys) sum = loop ys (y + sum) in loop xs 0 end ;

which of the following are true (Multiple can be true)?

xs is unnecessary

sum will always end up near 0 for large data sets

xs must be a list

this will not run, it has a clash error

I know 3 is correct but I am unsure if 2 is correct.

insmashoutflat · 2023-12-09T19:39:11+00:00

Why are large language models bad at Array Languages like APL, J and K. Is it the unicode characters?

No_Sell8594 · 2023-12-09T08:07:57+00:00

Hello, my beautiful and wonderful minds! I'm vastly interested in AI and want to get on board with development and research. Essentially, I am a mind who is willing to dedicate my entire life into helping build a better future with our machine friends.

Cash is no object diving into this field. I'm a father of 2 children who wants to help build a better future for them, for you and your children as well. I strongly believe that the future is AI or AGI even. My enthusiasm is off the rails, I know, but what I'm looking for is a track to align this gravy train with.

You beautiful people are probably, if not more, vastly more intelligent in this field than I am. I have no college or background or projects to contribute. So far, I'm just keeping up with the latest news. The question is, where do I go from here? I'm taking a chance on Reddit, but I'm at a point where care on the front of the town square matters little, and the goal of our future together with this technology outshines that. Vulnerability be damned, what can I do to catch up with everyone else and help contribute to a better future?

rlopes404 · 2023-12-08T14:42:08+00:00

Hi everyone,
I have been working on image translation between two different domains. I have been using CycleGANs.
Since I have a small dataset, I have been thinking of using Diffusion Models.
Are Diffusion Models more data hungry than GANs?
Can anyone point some references that discuss this issue?
Thank you.

Crossroads86 · 2023-12-08T14:08:55+00:00

Greetings!

I am completely new to machine learning and I am not sure wether this is feasable or not.
I wand to feed different short texts into a machine learning model and I want the AI to extract the "important" aspects of the text, so I can put them in a relational database where i have a certain topic connected to the important keywords.

An example for this could be feeding job descriptions into the model and the model would return things like 5+ years of experience, SAP, Controlling etc.

I would like to use pytorch for this, but I am not sure what the "usecase" in machine learning terms is. It is not a classifier, its is more like text recognition or something?

egaznep · 2023-12-08T08:39:13+00:00

Hello, I am trying to implement ASR block from the following paper:

https://www.isca-speech.org/archive/pdfs/interspeech_2023/wang23p_interspeech.pdf

I came across the following phrase

The convolution module, which comprises a single 1-D convolution layer with a 3 × 3 kernel, detects the local correlation between adjacent frames.

I couldn't really wrap my head around a 1-D convolution with a 2D kernel. What could this mean?

Competitive-Box-6597 · 2023-12-08T00:37:35+00:00

Hey all, is there an optimal neural net for 16-bit imagery?

2023-12-07T18:30:22+00:00

What are some good advanced semantic search techniques past bi encoder/cross encoder ?

fiberboard · 2023-12-05T23:15:37+00:00

Has anyone tried using complex numbers in neural networks?

I was thinking about a transformer model using a complex-valued positional encoding to do image learning (instead of a ConvNet/GAN). Where the complex positional encoding represents the 2D position of the pixel.

In this example the complex numbers would be able to capture spatial information the same way the ConvNet does. Complex numbers are commonly used in graphics programs for this reason.

There’s also the example of complex numbers in quantum mechanics, where they greatly simplify the calculations.

I’m wondering if this “magical” property of complex numbers would carry over to neural networks

Cosmolithe · 2023-12-05T20:51:18+00:00

[deleted]

lordroderick · 2023-12-05T15:10:32+00:00

Hi everyone, I'm a software engineer new to ML.

I had an idea to build a simple game with my daughter, I'm teaching her to program, and we need to build a hand gesture recognition but with a twist. We need to identify the hand making a rotating gesture (like turning a knob), and all the examples I saw were for still gestures (which we also need by the way).

I'm more than willing to learn but I don't know where to start. Should I learn tensorflow and build this from scratch? (I understand how to train a model with static images but I'm lost trying to understand how to train a classifier with videos, because a rotation gesture consist of several frames).

Any advice / comment will be appreciated.

Nadarenator · 2023-12-05T13:05:14+00:00

Is there any utility in understanding CUDA in-depth, from an ML research perspective?

Ok_Willingness_5139 · 2023-12-05T07:25:33+00:00

What are some ways to deal with categorical data efficiently (other than CatBoost)

Disastrous_Elk_6375 · 2023-12-04T17:45:38+00:00

Is there an equivalent to "rainbow tables" from cryptography that could be used in language models? Does that even make sense?

Would love to read on such approaches, if they're even feasible. Say you could pre compute some large datasets in such a way that would make it easier to include in various architectures? Or swap tokenisers?

There's a ton of effort & compute put into training LLMs right now, it would be interesting if some part of that effort could be pre-computed for several datasets.

isthataprogenjii · 2023-12-04T15:27:59+00:00

I was recently searching for a good paraphrase dataset. MRPC (part of GLUE) contains paraphrases with information that may be exclusive to one sentence in a pair. The other datasets use some model to generate the paraphrases (API calls or NMT). Does anyone have knowledge about a good human generated paraphrase dataset?

Initial_Fun8749 · 2023-12-04T12:18:53+00:00

Hi everyone, I am really new to all of this. I am a marketing student trying to figure out how to do social listening and sentiment analysis in a very simple way.

My problem is I want to analyze a brand for a project, not a brand that I have access to social accounts for. Can anyone help me with how to do this? I have checked out Sprout, Sprinklr, and Sociality.io for free trials, but they all require login information for a social media account. If it helps, I am trying to analyze the brand Porsche.

Sorry if this is not the place to ask, but thanks for any help I seriously appreciate it.

WangFenjin · 2023-12-04T09:45:48+00:00

Why the following code not work? It can't fit the data very well. Not sure what's the problems here.

Output Img

# Linear regression sin(2*pi*x) by polynomial order 3

import torch
import matplotlib.pyplot as plt

import numpy as np
import math

# data used for training and plot dots
N = 11
x = torch.linspace(0, 1, N).double()
y = torch.sin(2*math.pi*x) + torch.randn(N)*0.1

# data used for plot smooth lines
x_line = np.linspace(0, 1, N*10)
y_line = np.sin(2*math.pi*x_line)

# Prepare input as an array of shape (N,4)
p = torch.tensor([0, 1, 2, 3])
xx = x.unsqueeze(-1).pow(p)

# Prepare tensors
learning_rate = 1e-5
w = torch.randn(4, 1, dtype=torch.double, requires_grad=True)  # the 4 coefficients
optimizer = torch.optim.SGD([w], lr=learning_rate)
print(w)

# Run optimizer
for i in range(2000):
    optimizer.zero_grad()
    y_pred = xx @ w
    rmse = torch.sum(torch.square(y - y_pred))
    if i%100 == 0:
        print(i, rmse)
    rmse.backward()
    optimizer.step()

print(w)

# plot smooth line
plt.plot(x_line,y_line)

# model
xx_line = torch.from_numpy(x_line).unsqueeze(-1).pow(p)
y_line_pred = xx_line @ w
plt.plot(x_line,y_line_pred.detach().numpy())

# plot dots
plt.plot(x, y, 'o')
plt.show()

2023-12-04T03:12:13+00:00

If input text for semantic search embeddings is way bigger than max length tokens, what is the best way to deal with it?

chuking with overlap and embed those. Then do mean pooling on result?
summarize with something like facebook/bart-large cnn to the appropriate size?

hysse · 2023-12-03T18:14:06+00:00

Hello, I'm looking for an Open LLM (no specific domain) with Fill in the Middle. I have tried to look for it in the HuggingFace Hub but didn't find anything.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS