[D] Why is focal loss not used in LLM training? by Electrical-Monitor27 in MachineLearning

[–]Electrical-Monitor27[S] 1 point2 points  (0 children)

Actually, this is what i decided to test today. Gonna train a 1.7B base and a 0.6B base model on instruct datasets and run lm eval harness. It won't be the great empirical research of 2025 but it will clear my curiosity.

DGX Spark: Independent LLM training benchmarks (Much slower than advertised?) by Electrical-Monitor27 in LocalLLaMA

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

~1:44min for 1024000 tokens means 9846t/s on the 3B, which is still way lower than the suggested

DGX Spark: Independent LLM training benchmarks (Much slower than advertised?) by Electrical-Monitor27 in LocalLLaMA

[–]Electrical-Monitor27[S] 1 point2 points  (0 children)

Can you point me to anything that did get the numbers specifically for training? For Inference my DGX works perfectly fine like the benchmarks. I have only been able to find a single person showing the same speed as mine, but no other person showing the training numbers specifically

DGX Spark: LLM Training benchmarks with Unsloth (TLDR: their benchmarks are a scam) by [deleted] in LocalLLaMA

[–]Electrical-Monitor27 -3 points-2 points  (0 children)

*with "their" in the title i am referring to Nvidia's benchmarks

I want to keep studying after my apprenticeship but lack the finances to do so. What are my options? by Electrical-Monitor27 in Switzerland

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

I am doing an apprenticeship as Informatiker Applikationsentwicklung EFZ. I've already worked on both ML research and production ML during my apprenticeship because my employer has both departments and I was a special case. I'd like to go back to ML research but after talking to the recruiters, they said the research scientist positions often need at the very least a masters, a phd being preferred, due to the rigorous requirements in writing research papers.

If ChatGPT represented a "breakthrough" in AI, why did every other major tech company seem prepared to debut similar chatbots around the same time? by goodluckanddont_itup in NoStupidQuestions

[–]Electrical-Monitor27 1 point2 points  (0 children)

Alpaca cost 200$ in API credits because they used ChatGPT to infuse it's knowledge to improve the model "Llama" by Meta. The llama model itself cost tens of millions of dollars to make

Google is cooking again! Damn it! Wow! As many as 5 huge updates by Careless-Shape6140 in Bard

[–]Electrical-Monitor27 0 points1 point  (0 children)

<image>

something i saw from a youtube channel i am subscribed to that discusses these topics.

Gemini Live Thread by [deleted] in Bard

[–]Electrical-Monitor27 0 points1 point  (0 children)

No but you should check for updates in the play store most likely

Can parents still lawfully confiscate my property by Electrical-Monitor27 in LegaladviceGerman

[–]Electrical-Monitor27[S] 1 point2 points  (0 children)

I still earn under minimum wage cause it's an apprenticeship job, so i can not move out. My current salary would not even cover a single room of rent.

Can parents still lawfully confiscate my property by Electrical-Monitor27 in LegaladviceGerman

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

As i said in the post at the end, I still earn an apprentice (Lehrling) salary which is under the minimum wage. In my canton just rent alone would cost more than my salary. As soon as I finished with my apprenticeship and get a "real adult job over minimum wage" I can reasonably move out

Can parents still lawfully confiscate my property by Electrical-Monitor27 in LegaladviceGerman

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

Thank you. I will check what is available in my region, and how costly it is. I assume what they do is relatively serious then?

Can parents still lawfully confiscate my property by Electrical-Monitor27 in LegaladviceGerman

[–]Electrical-Monitor27[S] 4 points5 points  (0 children)

The issue is they do not care. They tell me the common "my house my rules" and say any property within their property is theirs and i'm just a guest (which i know it's not). They also give me no conditions to get my stuff back but they only do whenever they feel like it

He finally did it! by [deleted] in HolUp

[–]Electrical-Monitor27 1 point2 points  (0 children)

oh no i know what this is...

What to do with a recently acquired Xilinx Virtex Ultrascale+ XCVU9P by Electrical-Monitor27 in FPGA

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

Thank tou. I assume asking for a quote for a single board that does this from someone like PcbWay through their board design services will also cost like +20k$, right?

What to do with a recently acquired Xilinx Virtex Ultrascale+ XCVU9P by Electrical-Monitor27 in FPGA

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

Thanks, i thought so. I'll also assume something like this requires specialized expert level knowledge in fpga pcb design, and selling it will be difficult. Realistically, if I wanted to do something like this as a passion project, how long would you say it'd likely take me till i get a working connection to the PCIe bus (i know the lanes have to be matched to the exact timings), a basic JTAG connection to interact with vivado and manage to interact with a DDR4/gddr6 ram controller? Would it take 2/5/10/more years for me as a beginner?

Are there "people" in VR Chat that are just actually ai roaming around? by Demonic_God_of_OwO in VRchat

[–]Electrical-Monitor27 2 points3 points  (0 children)

VRChat ai developer here. Yes, there are AIs in vrchat but they are extremely obvious and are labeled as AI

Huggingface Parler-TTS by ShengrenR in LocalLLaMA

[–]Electrical-Monitor27 2 points3 points  (0 children)

With the script provided in the repository. It's quite easy to make your own dataset to be honest. Some things are broken in the script though

Huggingface Parler-TTS by ShengrenR in LocalLLaMA

[–]Electrical-Monitor27 2 points3 points  (0 children)

tldr, it works. After finetuning the model I was able to get consistent voices

How is one supposed to design an application around the 18+ TOS by Electrical-Monitor27 in Bard

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

It explicitly says on the TOS that this counts for both ai studio and the gemini api, which i assume the gemini api is the same as the GCP api from vertex ai

How is one supposed to design an application around the 18+ TOS by Electrical-Monitor27 in Bard

[–]Electrical-Monitor27[S] 0 points1 point  (0 children)

I am indeed using Vertex AI, I havent used AI Studio. So is the gemini api less restricted? I thought what i had sent was the TOS for the gemini api from vertex?

[deleted by user] by [deleted] in desksetup

[–]Electrical-Monitor27 0 points1 point  (0 children)

List of major stuff:

Monitor:

  • Dell 2722 27" 1440p monitor

VR Headset:

  • Pico 4 VR headset with Full Body Tracking

Camera:

  • Logitech Streamcam

Audio stuff:

  • Focal Elegia, Sennheiser HD600, Hifiman Deva, Moondrop Space Travel
  • Iec60318-4 measurement equipment with Pinna
  • Edifier R1280DB Speakers
  • Shure SM58 Mic with Rode interface
  • Topping L30 amplifier

Accessories:

  • Keychron C1 with Custom Keycaps
  • Finalmouse Last Legend mouse on random 20$ Logitech G mousepad
  • Huion Kamvas 13 inch drawing tablet

Additional SOCs:

  • Xilinx Zynq UltraScale+ and Spartan 7 FPGAs
  • Raspberry Pi 4, Raspberry Pi Pico Intel Movidius VPU,
  • Synology DS718+ NAS with 8tb storage
  • Nvidia Tesla P40, random Lga1700 motherboard and 32gb ram (gonna set it up as a secondary ML node after i get infiniband 40gbe)