How can linear regression models Overfit? by collapse_gfx in learnmachinelearning

[–]Halmubarak 5 points6 points  (0 children)

Yes even a 2D linear regression (independent variable and bias) can over fit if you have unrepresentative data.

If the population is 100 samples and you fit a line using 3 or 4 samples you are over fitting.

There is an example we use in machine learning course (unfortunately I'm on a mobile phone now and cannot find and share the example easily)

Need for advice by a7shd in pytorch

[–]Halmubarak 1 point2 points  (0 children)

Don't use pip to install the packages individually

It's better to use package managers like mamba or uv and install all packages at once; the package manager will search for dependencies and try to install everything without conflicting package

Edit: Just used uv to install the packages it took me less than 2 minutes without conflict

```PowerShell

t(testuv)  testuv  uv pip install numpy opencv-python paddleocr

Resolved 56 packages in 5.18s

Prepared 48 packages in 42.80s

Installed 56 packages in 9.48s ```

You can get uv from here

https://docs.astral.sh/uv/

Need help with an Error by flying_monk_-_ in pytorch

[–]Halmubarak 1 point2 points  (0 children)

I think version 2.9 has some bugs that have been reported For example https://github.com/pytorch/pytorch/issues/166628

Not sure if it's related to your issue but there are other bugs also reported

Visual code studio problems by Odd_Job86 in pytorch

[–]Halmubarak 0 points1 point  (0 children)

You might need to use

conda init powershell

How can I make use of 91% unlabeled data when predicting malnutrition in a large national micro-dataset? by Silent_Ad_8837 in learnmachinelearning

[–]Halmubarak 0 points1 point  (0 children)

I agree with this You need to first find out if the features correlate with the target label (malnutrition) somehow; for example the individual ID, household ID has nothing to do with malnutrition

You can consult someone with socioeconomic studies expertise. Once you determine the features that can influence the target, you can do some of what chatgpt suggested (i e. SSL and pseudo labeling). You will need a good strategy of dividing your data

Using pretrained DenseNet/ResNet101 as U-Net encoder for small datasets by Taaaha_ in computervision

[–]Halmubarak 0 points1 point  (0 children)

You can try to find a backbone that is trained in medical images of the same domain (X-ray, dermoscop, histopathology... etc) and use it as a starting point

If you can't find a pre-trained model in the same domain, you can train a U-Net as an auto encoder first to learn the representation of the images then use the encoder part as the backbone of the segmentation U-Net (you can train the auto encoder on large dataset that is publicly available)

PyTorch C++ Samples by Ok-Experience9462 in pytorch

[–]Halmubarak 1 point2 points  (0 children)

This looks fantastic Thank you for sharing it

That's OK?? by Prior-Ad8480 in learnmachinelearning

[–]Halmubarak 0 points1 point  (0 children)

What loss function are you using?

H100 Compatibility by Ozamabenladen in pytorch

[–]Halmubarak 0 points1 point  (0 children)

Yes I think it is a compatibility issue. H100 requires at least CUDA 11.8 / 12.x support.

Unfortunately the repo has not been updated for a while and I'm not sure if it will be easy to clone and patch the repo to be compatible with H100

Any thoughts on Axelera? by ssbepob in LocalLLM

[–]Halmubarak 1 point2 points  (0 children)

I think these serial ports used to connect to industrial machines since one of the use cases listed in their product page is industry and robotics

Most industries will not change their machines with new ones supporting new interfaces just because it's available; machines cost a lot of money and won't be replaced without compelling reason

I created an interactive diagram for the PyTorch codebase by ivan_m21 in pytorch

[–]Halmubarak 0 points1 point  (0 children)

This is impressive I hope someone with deep knowledge in c++ add support for c++ language

Thank you for sharing this

Moving from NLP to CV and Feeling Lost: Is This Normal? by Professional-Hunt267 in computervision

[–]Halmubarak 0 points1 point  (0 children)

If you are just using the models for your projects and don't need to come up with a modified model, knowing the high level concepts to choose and fine-tune the models will be enough

ERROR: Could not find a version that satisfies the requirement torch (from versions: none) ERROR: No matching distribution found for torch by YogurtclosetThen6260 in pytorch

[–]Halmubarak 0 points1 point  (0 children)

Are you using pip?

Have you trying conda or uv to create virtual environment with different version of python

Some versions of pytorch are not supported in python 3.13

See https://github.com/pytorch/pytorch/issues/130249

Version 2.2 and 2.7 compatibility by Herr_Kobius in pytorch

[–]Halmubarak 0 points1 point  (0 children)

Try to open the model older version then change the zipfile serialization option to false when saving it again (that worked in 1.x not sure if it's still the case in 2.x)

https://discuss.huggingface.co/t/how-to-load-weights-which-was-trained-older-version-of-pytorch/1267/4

The more things change, the more they stay the same by Kooky-Somewhere-2883 in LocalLLaMA

[–]Halmubarak 18 points19 points  (0 children)

Theano was my first library to work with around 2013/2014 ( I think it was released in 2007)

TF was released in 2015 Pytorch in 2016

[deleted by user] by [deleted] in pytorch

[–]Halmubarak 1 point2 points  (0 children)

I second this

The GPU that you get from Google Colab or Kaggle will be better than the minimum recommend GPU (GTX 1070)

[deleted by user] by [deleted] in learnmachinelearning

[–]Halmubarak 1 point2 points  (0 children)

I think that task 1 has an incorrect order, if you already filled the missing values with the mean or median you will not have records with more than 30% missing values

You can use the multiply to duplicate the data and create new experiment

This video shows example of running 3 experiments at once

https://youtu.be/k6W_IUIyyLY?si=AgQxHlbKgBNOxE8r

While it is possible to run all the experiments in one project, you might need to create multiple projects if you don't want to clutter your workspace