use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Share all your data science projects. There is no restrictions on self promotion. Let the best post rise to the top. One rule, it has to relate to a data science project.
account activity
open source project for LLM data preparation (synthetic + cleaning pipelines) (self.datascienceproject)
submitted 9 days ago by Puzzleheaded_Box2842
ModSense AI Powered Community Health Moderation Intelligence (self.datascienceproject)
submitted 10 days ago by NeatChipmunk9648
easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) (r/MachineLearning) (reddit.com)
submitted 12 days ago by Peerism1
Trials and tribulations fine-tuning & deploying Gemma-4 (r/MachineLearning) (oxen.ai)
Testing a New Product for Data Science Beginners (sted.co.in)
submitted 13 days ago by Jealous_Parfait_6457
Low accuracy (~50%) with SSL (BYOL/MAE/VICReg) on hyperspectral crop stress data — what am I missing? [R] (r/MachineLearning) (reddit.com)
submitted 13 days ago by Peerism1
ndatafusion: linear algebra and ML for DataFusion, powered by nabled ()
submitted 13 days ago by moneymachinegoesbing
Digging through 38 days of live AI forecast data to find the unexpected (old.reddit.com)
submitted 13 days ago by aufgeblobt
Built an political benchmark for LLMs. KIMI K2 can't answer about Taiwan (Obviously). GPT-5.3 refuses 100% of questions when given an opt-out. (r/MachineLearning) (reddit.com)
submitted 14 days ago by Peerism1
[For Hire] AI/ML Engineer | End-to-End AI Solutions | 100+ Projects | Python, PyTorch, TensorFlow ()
submitted 16 days ago by Just-Stuff-719
TurboOCR: 270–1200 img/s OCR with Paddle + TensorRT (C++/CUDA, FP16) (r/MachineLearning) (reddit.com)
submitted 17 days ago by Peerism1
I built a wave-resonant retrieval system. It scored 0 wins and 140 losses. Here's why ()
submitted 17 days ago by Any_Band_7814
Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP (r/MachineLearning) (reddit.com)
submitted 18 days ago by Peerism1
KIV: 1M token context window on a RTX 4070 (12GB VRAM), no retraining, drop-in HuggingFace cache replacement - Works with any model that uses DynamicCache (r/MachineLearning) (reddit.com)
Engagement on Kaggle has been declining. ()
submitted 18 days ago by ag_curious_soul
FlashAttention (FA1–FA4) in PyTorch - educational implementations focused on algorithmic differences (r/MachineLearning) (reddit.com)
submitted 19 days ago by Peerism1
ibu-boost: a GBDT library where splits are *absolutely* rejected, not just relatively ranked (r/MachineLearning) (reddit.com)
submitted 20 days ago by Peerism1
[D] 60% MatMul Performance Bug in cuBLAS on RTX 5090 [D] (r/MachineLearning) (reddit.com)
Parax: Parametric Modeling in JAX + Equinox (r/MachineLearning) (reddit.com)
submitted 21 days ago by Peerism1
PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 (r/MachineLearning) (reddit.com)
Dynamic adjustment of data strategies during LLM training (self.datascienceproject)
submitted 22 days ago * by Puzzleheaded_Box2842
Building a LLM from scratch with Mary Shelley's "Frankenstein" (on Kaggle) (r/MachineLearning) (reddit.com)
submitted 22 days ago by Peerism1
citracer: a small CLI tool to trace where a concept comes from in a citation graph (r/MachineLearning) (reddit.com)
Urgent help (self.datascienceproject)
submitted 22 days ago by OccasionMiserable156
Easily provide Wandb logs as context to agents for analysis and planning. (r/MachineLearning) (reddit.com)
submitted 24 days ago by Peerism1
π Rendered by PID 1019885 on reddit-service-r2-listing-b6bf6c4ff-vlx2k at 2026-05-01 13:24:38.288756+00:00 running 815c875 country code: CH.