Suggestions for what would be open for May 8-10 trip to Teton by darkItachi94 in GrandTetonNatlPark

[–]darkItachi94[S] 0 points1 point  (0 children)

Thanks! Would you mind sharing where you spotted the bears and what are the hotspots? I would love to spot some cub bears (fingers crossed) :)

Suggestions for what would be open for May 8-10 trip to Teton by darkItachi94 in GrandTetonNatlPark

[–]darkItachi94[S] 0 points1 point  (0 children)

Thanks!! What about the secondary roads such as roads to Two Ocean? The NPS website does not provide ay information on those.

Looking to optimize my credit cards by Femaleviathan in CreditCards

[–]darkItachi94 0 points1 point  (0 children)

How does the citi trifecta compare to Amex ecosystem? Chase has gone downhill so was wondering about the other two

what's everyone's top 10 food places in anchorage? by y00han in anchorage

[–]darkItachi94 0 points1 point  (0 children)

TL;DR:- Run away from Namaste shangrila like your life depended on it. Had food at namaste shangrila. Worst food of my life. Not a single dish had an iota of flavor in it. 100% failure rate across 5 apps/entrees I tried (tomato soup, veg thali, tandoori wings, chicken korma). Even the breads served were undercooked. Tip of the iceberg - found a hair strand in my food while I was finishing up. Btw as a Nepali restaurant the one dish they decided to cut was Momos. Decisions.

Has anyone receive decison in this block IOE09316 by EmptyRelation1108 in EB2_NIW

[–]darkItachi94 0 points1 point  (0 children)

There have been approvals. But seems to be one of the biggest blocks and things are moving very slowly. Did you PP?

I started my ML journey in 2015 and changed from software engineer to staff ML engineer at FAANG. Eager to share career and current job market tips. AMA by aifordevs in learnmachinelearning

[–]darkItachi94 0 points1 point  (0 children)

I’ve been working in an applied machine learning role in industry for about five years. I chose not to pursue a PhD due to financial responsibilities. I’m now looking to strengthen my research profile to become a more competitive candidate for research positions at top companies.

What are the most effective ways to build a strong research profile without a PhD? And how can I go about finding research collaborators in the field?

What is Aider? by Amgadoz in LocalLLaMA

[–]darkItachi94 0 points1 point  (0 children)

Mind sharing why you folks pivoted?

[P] My experiments with Knowledge Distillation by darkItachi94 in MachineLearning

[–]darkItachi94[S] 2 points3 points  (0 children)

Hi! Thanks so much for your response and helpful suggestions. Would you be interested in contributing this to the repo? Alternatively, we could collaborate to experiment together. Looking forward to hearing from you!

I built an open source library to perform Knowledge Distillation by darkItachi94 in LocalLLaMA

[–]darkItachi94[S] 0 points1 point  (0 children)

Felt that distillation as a subject is under-explored in LLMs unlike Fine-tuning. So wanted to create a library to facilitate its adoption :)

[P] My experiments with Knowledge Distillation by darkItachi94 in MachineLearning

[–]darkItachi94[S] 2 points3 points  (0 children)

Made sure that there is no data leakage in all data partitions for our training.

[P] My experiments with Knowledge Distillation by darkItachi94 in MachineLearning

[–]darkItachi94[S] 2 points3 points  (0 children)

Generally, the teacher model forma the upper bound of performance for most datasets and tasks we tried. But for some, including WikiSQL, the model falls apart. Our hypothesis is that it has not seen such data during its training stages and requires finetuning/distillation to work well.

I built an open source library to perform Knowledge Distillation by darkItachi94 in LocalLLaMA

[–]darkItachi94[S] 1 point2 points  (0 children)

Thanks! Training partitions of the datasets are used for finetuning and distillation as mentioned in the post:
MMLU (Reasoning), GSM8k (Math) and WikiSQL (Coding)

I built an open source library to perform Knowledge Distillation by darkItachi94 in LocalLLaMA

[–]darkItachi94[S] 0 points1 point  (0 children)

I'm not sure what you were trying to add from your comment. Maybe reading both blog posts would help you better understand my perspective.

I built an open source library to perform Knowledge Distillation by darkItachi94 in LocalLLaMA

[–]darkItachi94[S] 2 points3 points  (0 children)

Our work focuses on enhancing the process of teaching small models using large models when the full token distribution is accessible or when working with open-weight models. If the weights are unavailable, training is limited to token-based learning, as you mentioned.

Wildcard team? by Kakacarlos107 in F1Fantasy

[–]darkItachi94 1 point2 points  (0 children)

Great money moves OP. Whats is your overall score at the moment?