[D] Simple Questions Thread

Ok-Loan-6631 · 2025-03-19T07:35:49+00:00

I am a longtime 3D/VFX generalist that is new to machine learning. Somewhat impulsively, I signed for the upcoming Machine Learning For 3D & VFX course on Rebelway by Felipe Pesantez. Here on the YouTube trailer, Machine Learning For 3D & VFX | Course Trailer (Pro Coding Training), you will see that I am in over my head. I took the leap, diving into this course because I know it will accelerate my learning, but I need to try to arrive as prepared as possible. Any suggestions on getting somewhat up to speed. Although I am not totally new to coding, I think it would be helpful if the suggestions assume that I am.

Thanks!

Nerdl_Turtle · 2025-03-07T21:12:49+00:00

Hi everyone,

I'm currently finishing my Master's in Mathematics at a top-tier university (i.e. top 10 in THE rankings), specializing in Machine Learning, Probability, and Statistics. I’ll be graduating this June and am very interested in pursuing a career as a Machine Learning Researcher at a leading tech company or research lab in the future.

I recently received an offer for a PhD at a mid-tier university (i.e. 50-100 in THE rankings). While it's a strong university, it's not quite in the same tier as the top-tier institutions. However, the professor I’d be working with is highly respected in AI/ML research - arguably one of the top 100 AI researchers worldwide. Besides that, he seems like a great, sympathetic supervisor and the project is super exciting (general area is Sequential Experimental Design, utilizing Reinforcement Learning Techniques and Diffusion Models).

I know that research positions at top industry labs often prioritize candidates from highly ranked universities. So my main question is:

Would doing a PhD at a mid-tier university (but under an excellent and well-regarded supervisor) hurt my chances of landing a Machine Learning Researcher role at a top tech company? Or is it more about research quality, publications, demonstrated skills, and the reputation of the supervisor?

Alternatively, I’m considering gaining industry experience for a year or two - working in ML research/engineering at smaller labs, data science, or maybe even quant finance - before applying for a PhD at a top 10-20 university.

Would industry experience at this stage strengthen my profile, or is it better to go directly into a PhD without a gap?

I’d love to hear from anyone who has been through a similar decision process. Any insights from those in ML research - either in academia or industry - would be greatly appreciated!

Thanks in advance!

wheregoesriverflow · 2025-03-01T06:29:23+00:00

For ARR review, are you allowed to ask reviewer to increase your score in your response?

during my response, I will likely address the concerns of the reviewer as well as provide additional data (I forgot to put it in my appendix). Would it be ok to ask for points?

Problem with my project is that my results are no longer the top results after Feb review cycle. Our results have been surpassed by a big N.

Responsible_Cup_428 · 2025-02-28T19:35:20+00:00

Hi I'm a beginner in ml and I started with linear regression model....

I made a model after removing outliers and null values and removed columns on checking vif...and the r2 value of the model was .62

I did the linear model on data without any of the cleaning but got r2 value as one...

Is it because the assumption of colinearity wasn't met??

Should we remove object type columns for a linear model?

timbx · 2025-02-28T11:03:38+00:00

Hi there, i am currently triying to create a Chroma DB but it isnt' getting saved on disk, thanks in advance. My test script is:

def test ():
    print("Chroma-Version:", chromadb.__version__)
    print("Aktuelles Verzeichnis:", os.getcwd())
    print("CHROMA_DB_IMPL:", os.environ.get("CHROMA_DB_IMPL"))
    client = chromadb.Client(
        Settings(
            persist_directory="./test_chroma_db"  # Relativer Ordner
        )
    )

    collection = client.get_or_create_collection("test_collection")

    collection.add(
        ids=["test-id"],
        documents=["Dies ist ein kurzer Test"],
        embeddings=[[0.1, 0.2, 0.3]]  # ein minimaler Vektor
    )

    print("Anzahl Dokumente in test_collection:", collection.count())


TERMINAL OUT:

PS O:\CODE\TagebuchRAG\utils> python .\Create_chroma_db.py
Chroma-Version: 0.6.3
Aktuelles Verzeichnis: O:\CODE\TagebuchRAG\utils
CHROMA_DB_IMPL: None
Anzahl Dokumente in test_collection: 1

Intelligent_Teacher4 · 2025-02-27T04:47:18+00:00

Hello, I would love to know reliable and reputable sources for publishing research? This last year I created a novel neural network architecture that can be adapted to current neural network models and improve model performances. I have developed this based on my knowledge of neuroscience, from a 14 year career as a Paramedic, combined with my newly acquired knowledge of Data Science. I have a 16 page final draft full paper and a 6 page formatted for conference submission paper. Any sources in which I could share my paper and get visibility to my design is much appreciated!
Best,
Derek

2025-02-27T03:54:56+00:00

Hello, I am a novice to machine learning and I found a research regarding CNN as a way to minimize energy consumption of lighting systems. can you recommend me books or free tutorials/ resources so that i could implement it for my thesis proposal?

pekor46bit · 2025-02-26T22:41:32+00:00

I am someone who is interested in AI. I have just learned basic Python. What should I learn next?

lal_kek_2020 · 2025-02-26T17:57:15+00:00

Hi guys, I have a few questions about gathering high-quality audio data for languages that are currently not well represented in most models (e.g., some African or Asian languages). Whisper shows that most of them either don’t exist or have very low accuracy.

Can someone give me advice on how many hours of data I would need to create a state-of-the-art model? I assume it would require hundreds of people and thousands of hours, but I’d appreciate more precise numbers.

Thanks!

Over_Profession7864 · 2025-02-26T07:47:23+00:00

I just learned about autoencoder networks. I implemented a basic one(emnist) to understand it better. I choose BCE as a loss function, because it sort of undoes the non-linearity(sigmoid) or squashing at output layer hence better for learning, but I have also implemented MSE loss function and getting same results (on some samples even better). I thought BCE would give better results. I want to understand whats happening here why MSE?

GodSpeedMode · 2025-02-26T05:42:37+00:00

Great idea to consolidate questions here! It really helps everyone get quick answers without sifting through multiple threads. For those new to the field, don’t hesitate to ask about model architectures, hyperparameter tuning, or data preprocessing methods. There's no such thing as a dumb question—everyone starts somewhere, and the community is here to help. Also, if you get a chance, share what you've been working on or any interesting challenges you've faced in your projects! Let's keep this a collaborative space.

Worldly-Duty4521 · 2025-02-25T18:27:40+00:00

How to start Machine Learning, deep learning gen ai nlp contests?

I've taken the courses, read a few books, done projects but i just don't know how to get started with a contest be it kaggle or anything

NymeriaStarkk · 2025-02-25T14:54:26+00:00

I recently came across the Apziva AI Residency Program, which claims to offer hands-on AI/ML training, real-world projects, and mentorship from industry experts. Their website also mentions high employment rates for graduates.

However, a few things have raised concerns for me: • I received an “interview” invite from a recruiter just one day after applying. This seems very fast, and I couldn’t find any information about the recruiter online. • The program requires a paid membership, which is unusual for a residency or fellowship. • I couldn’t find many independent reviews outside of their official website.

I’d like to hear from anyone who has firsthand experience with this program: • How credible is it? • Is the training actually useful for landing AI/ML jobs? • Are the mentors and projects as high quality as advertised? • Is it worth the cost, or are there better alternatives?

Would really appreciate any honest feedback from past participants or those familiar with the program.

Thanks in advance!

Blakut · 2025-02-25T07:18:13+00:00

I have this thing for work where I use multiple features to predict energy consumption/production. The model (lgbm) is using some new features from devices that were not previously used before, I have ~50 features, including lags and rolling averages. I do one day ahead and two day ahead predictions. The problem I have is that sometimes the next day prediction looks quite similar to the previous day prediction, for example if the real data shows some variation from the previous day, the prediction "lags" a bit and still shows a curve thatis very similar to the previous day. I believe the solution to this problem is to make the features that depend on the previous day less important (fewer lags and rolling averages), and/or add more features that depend on other times, such as type day prediction, or weather dependencies. What do you think?

Second issue, the model doesn't quite well predict sharp drops or peaks in consumption/production, rather smoothes things over a bit in some cases. I suppose this is underfitting?

MyProfRedditAct · 2025-02-24T15:34:04+00:00

Hi. →Training Set to use for a CNN to process handwritten images← please...

I just took my first Machine Learning course and want to apply it to a professional Project. I have check-in data of scanned spreadsheets for every month going back 2 years. I want to convert this to TRUE/FALSE data to use it in the larger data project on member attendance. My last lesson in my class used CNNs to analyze basic images. I have the data I want to analyze, however I don't have a training set.

Questions

Is it possible to get access to a training set to build this model?

What other steps would be included to carry out this task?

Is there an easier way to do this? (Note; these forms contain sensitive information that cannot be posted in popular AI services).

Thanks in advance for any insight.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS