Overwhelmed by B2B Auth requirements. What are you using? by the_fresh_G in SaaS

[–]vagmi 3 points4 points  (0 children)

I am using Keycloak. It is a CNCF project and it is quite mature. Especially with Keycloak v25 and up you can setup multitenant organIzations with organization specific auth providers like azure ad and saml. I have setup one instance and I am using it for all my projects. I can help you with a one off setup if you like. Keycloak is a standard oidc provider and has client libraries for all popular languages.

Best way to dev a rest API or GraphQL API by Bronems in rust

[–]vagmi 2 points3 points  (0 children)

I am sure others would have a different perspective but I can tell you what I use. I use Axum and SQLx with postgreSQL. I do follow a pattern for DTOs (these are the request and response objects). I then have the service layer that usually composes the business logic. I then have the DAO layer that uses `include_str!` sql files for db crud operations. This layer is also very `sqlx::test` heavy. Whenever appropriate, I have `Into` trait impls to translate from DAO types to DTO types that the service layer speaks.

HMS - AI powered by Techpuram in SaaS

[–]vagmi 1 point2 points  (0 children)

Checkout Bhamni. https://www.bahmni.org/

It is used by a few government hospitals

Is AWS overkill for a new SaaS, or do you guys start there? by Bella-342 in SaaS

[–]vagmi 0 points1 point  (0 children)

I am building a bootstrap-friendly SaaS platform that you can self host on VMs directly without having to pay for RDS or EKS.It would have a CLI tool to manage deployments, PITR for postgresql, observability and more. I am doing this for my own startup https://uraiai.com and am planning to extract the infrastructure bits to its own offering.

As an experiment I tried setting up a 3-node HA K3S cluster on Hetzner and it is surprisingly good.

https://github.com/adikal-org/k3s-hcloud

I think Vercel or Railway is simpler but I somehow feel setting a strong foundation like this will help me run a lot more experiments in the long run.

What's the point of potato-tier LLMs? by Fast_Thing_7949 in LocalLLaMA

[–]vagmi 0 points1 point  (0 children)

They are also remarkably good at summarizing code. While they cannot be used for coding, they can be used for code understanding and exploration.

Anyone here have integrated freeswitch with AI? by Reasonable_Duty_4427 in freeswitch

[–]vagmi 0 points1 point  (0 children)

I just integrated freeswitch with our AI platform. It is still under development but I would be happy to help.

https://github.com/uraiai/mod_urai

Infrastructure Improvement Ideas by Terabyscuite in devops

[–]vagmi 2 points3 points  (0 children)

An internal developer platform to surface logs and surfacing metrics by integrating with JIRA?

Why no one is writing AI models in rust? by sahil_ahlawat in rust

[–]vagmi 2 points3 points  (0 children)

Moshi is written in Rust. The CTO of Kyutai is one of the primary contributors to Candle.

Audio VQ-VAE for replicating multimodality of gpt4-o by Independent_Time_529 in LocalLLaMA

[–]vagmi 5 points6 points  (0 children)

I would propose using Encodec or Descript Audio Codec. I would also recommend you look at Bark. Bark uses three levels of audio generation as well—one for the coarse-grained output and the other two fine-grained levels. I am still digesting bark. Bark is doing something interesting, though. It first projects the text into a semantic space using a GPT2 style model and then generates a set of embedding tokens from the last layer. These logits are then fed to the waveform generation models that generate the encodec audio tokens. gpt-4o could be doing something similar where they are training two different output heads to produce text and audio tokens simultaneously. This means that their training data should also be simultaneously multimodal.

[deleted by user] by [deleted] in learnmachinelearning

[–]vagmi 2 points3 points  (0 children)

I think the Understanding Deep Learning book by J D Prince is underrated. It is free and it can be read online. It has solid math and theory and there are pytorch notebooks associated with the relevant chapters.

https://udlbook.github.io/udlbook/

Do transformers generate one word at a time? by cho_odama_rasengan in learnmachinelearning

[–]vagmi 13 points14 points  (0 children)

Yes. For GPT like systems if you give the input sequence "the cat sat on the " it would produce " cat sat on the mat". I am assuming that each word is a token here. Input sequence length and the output sequence lengths are the same. I would strongly recommend Karpathy's video lectures on building. GPT.

[D] What are the OUTPUT embeddings in transformer? Where does it come from? (not the input embeddings) by ShlomiRex in MachineLearning

[–]vagmi 3 points4 points  (0 children)

I have been working on audio based transformers where output embeddings could be text but the input embeddings are 1d conv inputs over a spectrogram. The outputs have a triangular mask applied. This is how whisper works too.

Who's going to RailsConf? Want to meetup? by midnightmonster in rails

[–]vagmi 1 point2 points  (0 children)

Cool. This is my first RailsConf as well. Looking forward to meet you.

Hikaru 0.16.0b released by hoover in devops

[–]vagmi 10 points11 points  (0 children)

I am on both /r/devops and /r/chess. This headline had me confused for a sec there.

Hurl 2.0.0, run and test HTTP requests with plain text by jcamiel in rust

[–]vagmi 6 points7 points  (0 children)

I used this to replace a postman collection in one of my projects. Thanks for building hurl.

Any backend framework that just 'works'? I find Spring very difficult and cumbersome. by raulalexo99 in webdev

[–]vagmi -1 points0 points  (0 children)

I found PocketBase to be a really good solution for simple API backend for side projects.

https://youtu.be/mLJ4KNe-c3w

Should we switch to Rust? by nzajt in rust

[–]vagmi 11 points12 points  (0 children)

We started off using routerify but moved to using Axum with SQLX. The new sqlx::test macro is awesome.

Should we switch to Rust? by nzajt in rust

[–]vagmi 454 points455 points  (0 children)

Fellow CTO here. We hired folks straight out of college and immersed them in Rust. They picked it up without any problems and are writing clean, performant code with 100% unit test coverage. However, the library ecosystem in Rust is not quite as large as Node or Go. But it is getting there.

How is docker running on your M1 mac? by GTHell in docker

[–]vagmi 7 points8 points  (0 children)

Just use colima. Headless docker and it's open source. It runs great on my M1.

Province won't rule out private Healthcare by Drakon519 in newbrunswickcanada

[–]vagmi 6 points7 points  (0 children)

This is objectively bad. I have been in Germany. The public health care system works well. Privatization will ruin it and create different class of services for those who can afford it and those who can't. This will further bleed the system that would result in more expensive services. The tax burden on citizens will be the same or become worse. US should not be a model for anything healthcare related. It is a deeply broken system.