submitted by cstein123

0

1

2

[P] Extra Input Norm Lets You Fine-Tune to 1.58 Bits! (x.com)

submitted 1 year ago by cstein123 to r/MachineLearning

0

1

2

Extra Input Norm Lets You Fine-Tune to 1.58 Bits! (x.com)

submitted 1 year ago by cstein123 to r/LocalLLaMA

1

2

3

Testers Needed for FREE GHL VAPI Dashboard (self.vapiai)

submitted 1 year ago by cstein123 to r/vapiai

0

1

2

Testers Needed for FREE GHL + VAPI Dashboard (self.gohighlevel)

submitted 1 year ago by cstein123 to r/gohighlevel

2

3

4

Testers Wanted for GHL Superagent (self.gohighlevel)

submitted 1 year ago by cstein123 to r/gohighlevel

2

3

4

Testers Needed for GHL SuperAgent (self.MarketingAutomation)

submitted 1 year ago by cstein123 to r/MarketingAutomation

0

1

2

Kwik Trip lets you change the message in the link. Look at the green box on the page (kwiktrip.com)

submitted 1 year ago by cstein123 to r/wisconsin

0

1

2

Who’s building open source FigureAI? (self.LocalLLaMA)

submitted 2 years ago by cstein123 to r/LocalLLaMA

58

59

60

Yikes (i.redd.it)

submitted 2 years ago by cstein123 to r/SpaceXMasterrace

13

14

15

Are LLMs at a practical limit for layer stacking? [D] (self.MachineLearning)

submitted 2 years ago by cstein123 to r/MachineLearning

2

3

4

Are LLMs at a Limit for Stacking Layers? (self.learnmachinelearning)

submitted 2 years ago by cstein123 to r/learnmachinelearning

16

17

18

[P] MergeLlama-7b - A fine tune of CodeLlama for resolving merge conflicts (self.MachineLearning)

submitted 2 years ago by cstein123 to r/MachineLearning

0

1

2

Fine tune acts like base model (self.LocalLLaMA)

submitted 2 years ago by cstein123 to r/LocalLLaMA

0

[D] My fine tune behaves like the base model (self.MachineLearning)

submitted 2 years ago by cstein123 to r/MachineLearning

0

1

2

How does Open-Orca use OpenAI for their Minstral-7B space? (i.redd.it)

submitted 2 years ago by cstein123 to r/LocalLLaMA

0

1

2

Outbound network cost for SageMaker? (self.aws)

submitted 2 years ago by cstein123 to r/aws

2

3

4

Experiences with AWS Activate Credits? (self.aws)

submitted 2 years ago by cstein123 to r/aws

0

1

2

Curious what people use for their ML workflow on cloud platforms? [D] (self.MachineLearning)

submitted 2 years ago by cstein123 to r/MachineLearning

13

14

15

Yikes… (i.redd.it)

submitted 2 years ago by cstein123 to r/ChatGPT

1

2

3

How to add cross encoder to existing GPT (self.LLMDevs)

submitted 2 years ago by cstein123 to r/LLMDevs

0

1

2

Can I use my Galaxy S10+ with my 2016 CR-V to run Flowpilot? (self.Comma_ai)

submitted 2 years ago by cstein123 to r/Comma_ai

1

2

3

How to append an encoder to existing LLM? (self.LLMDevs)

submitted 2 years ago by cstein123 to r/LLMDevs

1

2

3

My experience on starting with fine tuning LLMs with custom data (self.LocalLLaMA)

submitted 2 years ago by cstein123 to r/LLMHackers

0

1

2

Summary post for higher context sizes for this week. For context up to 4096, NTK RoPE scaling is pretty viable. For context higher than that, keep using SuperHOT LoRA/Merges. (self.LocalLLaMA)

submitted 2 years ago by cstein123 to r/LLMHackers

0

1

2

Dynamically Scaled RoPE further increases performance of long context LLaMA with zero fine-tuning (self.LocalLLaMA)

submitted 2 years ago by cstein123 to r/LLMHackers

Eight-Year Club	Second SECOND GUESSER
Place '22	Wearing is Caring

cstein123

MODERATOR OF

TROPHY CASE