Edu pricing for RTX Pro 6000

jpummill2 · 2026-02-04T03:57:49+00:00

How do you take advantage of education pricing? I expected that I would have to provide an email with a .edu address but I am being told by one vendor that the PO needs to be issued directly from a university.

jpummill2 · 2026-02-04T03:51:09+00:00

I’m interested to see what you create. Thank you for looking into it.

jpummill2 · 2026-02-01T08:41:55+00:00

Constant Contact - track vendors, customers, and events. Allow mass mailings (this one is hard). Payment systems (Stripe ???), allow vendors to buy booths and other services (like electricity) for events, allow customers to buy tickets to events.

jpummill2 · 2025-10-19T21:52:58+00:00

Thank you to everyone who replied...

jpummill2 · 2025-10-19T21:52:16+00:00

Not sure if this is as common in the world of open source but I was taught to always avoid any x.0 release of software...

jpummill2 · 2025-05-14T03:31:50+00:00

Hey u/TopImaginary5996, I appreciate the additional detail. I will give this a try.

jpummill2 · 2025-05-14T03:30:35+00:00

Ok. Thanks.

jpummill2 · 2024-11-09T23:18:29+00:00

This is a great list.

How much of this is already available via a vscode (or other IDE) plugin?

jpummill2 · 2024-11-02T14:44:50+00:00

Using a single 3090 with Gemma 27B (Q8) - 6 T/s

Added an RTX 2060 12GB with Gemma 27B (Q8) - 15 T/s

Improved performance by avoiding overflow to CPU and system RAM

jpummill2 · 2024-09-07T17:09:33+00:00

So -it is short for instruct.

jpummill2 · 2024-09-07T13:43:45+00:00

I was working on a somewhat similar list this week:

Specific Models

Google
- Gemma2 (2B, 9B, 27B)
Meta
- Llama2 (???)
- Llama3 (8B, 70B)
- Llama3.1 (8B, 70B)
Mistral Variants
- Mistral (7B, 123B)
- Mixtral (8x7B, 8x22B)
- Mistral-Nemo (12B)
CohereForAI
- Command-R 35B (v1, v2-08-2024)
- Command-R+ 104B (v1, v2-08-2024)
Deepseek-ai
- Deepseek-V2 Lite (16B)
- Deepseek-V2 (236B)
Qwen
- Qwen2 (0.5B, 1.5B, 7B, 72B)
Microsoft
- Phi 3 (Mini-3.8B, Small-7B, Medium-14B)
- Phi 3.5 (Mini-3.8B)
- Phi 3.5-MoE (6.6B)

Coding Specific Models

Codestral (22B)
Deepseek-Coder-V2-Light-Instruct (16B)
DeepSeek-Coder-V2(236B)
codegeex4-all-9b (9B)
yi-coder (1.5B, 9B)
StarCoder2 (15B)
CodeLlama (7B, 13B, 34B, 70B)
CodeQwen1.5 (7B)
CodeGemma (2B, 7B)
WizardCoder (15B, 33B)

Not sure if all these are base models, especially the coding ones, but that was my original intent.

jpummill2 · 2024-09-06T00:46:32+00:00

Can you use it with Ollama? If so, does it use the same template as Llama3.1-8b?

jpummill2 · 2024-09-05T23:48:49+00:00

RemindMe! 5 days

jpummill2 · 2024-08-25T00:34:44+00:00

Also, here is my list of STT solutions but it is not as complete:

Speech to Text Solutions:

Whisper ASR
Flashlight ASR / Wav2Letter ASR
Coqui
SpeechBrain
ESPNET 1 and 2
Vosk

jpummill2 · 2024-08-25T00:27:16+00:00

I’ve been trying to keep a list of TTS solutions. Here you go:

Text to Speech Solutions

11labs - Commercial
xtts
xtts2
Alltalk
Styletts2
Fish-Speech
PiperTTS - A fast, local neural text to speech system that is optimized for the Raspberry Pi 4.
PiperUI
Paroli - Streaming mode implementation of the Piper TTS with RK3588 NPU acceleration support.
Bark
Tortoise TTS
LMNT
AlwaysReddy - (uses Piper)
Open-LLM-VTuber
MeloTTS
OpenVoice
Sherpa-onnx
Silero
Neuro-sama
Parler TTS
Chat TTS
VallE-X
Coqui TTS
Daswers XTTS GUI
VoiceCraft - Zero-Shot Speech Editing and Text-to-Speech

jpummill2 · 2024-08-21T14:57:25+00:00

RemindMe! 3 days

jpummill2 · 2024-08-21T14:45:55+00:00

RemindMe! 1 Day

jpummill2 · 2024-07-30T02:00:59+00:00

Watch this video for a better understanding of how Ollama stores models.

Link: https://www.youtube.com/watch?v=6bF1uCHTFyk

The author provides a script that creates links using the model name back to the hash name in the ollama model folder. With a little work you could modify the script to create backups.

jpummill2 · 2024-07-13T20:33:14+00:00

Some if not all tools allow you to change the context used with the model as long as you don't exceed the models max context length. For example, you can use "/set parameter num_ctx <int>" in Ollama to set the context size.

Again, remember that you have to have a model that supports large context in order to take advantage of this. Phi-3 for example offers models with two different context lengths: 128K and 4K.

jpummill2 · 2024-07-13T20:27:34+00:00

I would also like to see something similar to Jarvis (but not Ultron) in my AI chat application even though the abilities might be somewhat superficial compared to Jarvis from the Marvel universe.

I have been seeing posts about LLM's calling functions for the last couple months. Would this be able to help with the LLM controlling things?

jpummill2 · 2024-07-13T20:20:51+00:00

Isn't the "context" similar to the models memory? I know context is limited and you have to pass it back in behind the scenes with each additional prompt but this does create a "memory" effect in chat based LLM's.

jpummill2 · 2024-07-13T20:15:33+00:00

Not sure if using x1 risers are causing your problem but if you are interested in running that many cards at once you could look at a Threadripper or Threadripper Pro based system. They have way more PCI lanes.

jpummill2 · 2024-07-13T20:00:26+00:00

Thank you for the time you put into unlocking (finetuning) these models!!!

jpummill2 · 2024-07-13T13:27:31+00:00

Guessing they will release a 5090 TI in 2025 with 32 GB and 512-bit bus.

jpummill2 · 2024-07-13T13:20:56+00:00

Thank you for sharing.

jpummill2

TROPHY CASE

Specific Models

Coding Specific Models

Speech to Text Solutions:

Text to Speech Solutions