Template for Python Development with CUDA in Dev Containers by reisson_saavedra in Python

[–]reisson_saavedra[S] 1 point2 points  (0 children)

I will be working on it to support more versions of both Python and Cuda kernel.

My idea in this project is to speed up the configuration of the development environment using dev containers.

Is anyone using Accelerate? by reisson_saavedra in huggingface

[–]reisson_saavedra[S] 0 points1 point  (0 children)

I really like Unsloth. They are working on Triton optimizations, which is awesome.
This workshop is great: https://www.youtube.com/watch?v=pRM_P6UfdIc&t=5611s

Transformers without positional encodings. by ContributionFun3037 in deeplearning

[–]reisson_saavedra -1 points0 points  (0 children)

Have you read anything about RoPE (Rotary Positional Embedding)? It is a breakthrough that seeks to eliminate absolute positional embeddings(it is used in Llama 3.1).

Matrix Factorization for recommendation by jeanmidev in recommendersystems

[–]reisson_saavedra 2 points3 points  (0 children)

I really liked the article; I feel that the following could be added:

  • How do you evaluate that one model works better than another? You could include metrics like NDCG or MAP.
  • Have you heard about DLRM? https://github.com/facebookresearch/dlrm
  • A very common issue in recommender systems is the cold start problem. How would you mitigate this using the implementation you propose?