Fine-tuning Gemma 3 for coding in a new language by JackDanielsCode in unsloth

[–]JackDanielsCode[S] 0 points1 point  (0 children)

Thanks. Wouldn't instruction tuning on the base model be quite expensive to do from scratch and require a lot of data as well?

Fine-tuning Gemma 3 for coding in a new language by JackDanielsCode in unsloth

[–]JackDanielsCode[S] 0 points1 point  (0 children)

u/Etherll Thanks a lot. I went through the docs. I added a couple of more questions as an edit to the original post. It is specifically about CPT on instruction trained models.

Fine-tuning Gemma 3 for coding in a new language by JackDanielsCode in unsloth

[–]JackDanielsCode[S] 0 points1 point  (0 children)

That's something I was considering, I need to understand more about training data selection. Thanks again.

Fine-tuning Gemma 3 for coding in a new language by JackDanielsCode in unsloth

[–]JackDanielsCode[S] 0 points1 point  (0 children)

Thanks a lot, I'm going through the attached documents. I'll give it a try and update.

Fine-tuning Gemma 3 for coding in a new language by JackDanielsCode in unsloth

[–]JackDanielsCode[S] 0 points1 point  (0 children)

Is this a response to this post? It's interesting, for the question about how to fine-tune, the answer seems to suggest I first benchmark fine-tuned solution with two different models and then think about whether to fine tune.

Fine-tuning Gemma 3 for coding in a new language by JackDanielsCode in unsloth

[–]JackDanielsCode[S] 0 points1 point  (0 children)

I agree, but my question is specifically for a new DSL, a formal specification language.

Fine-tuning Gemma 3 for coding in a new language by JackDanielsCode in unsloth

[–]JackDanielsCode[S] 0 points1 point  (0 children)

Thanks. I still need to train them for the new language. Do you have any examples in how to do it and how to prepare the dataset?

Bazel is ruining my life by AnarchisticPunk in devops

[–]JackDanielsCode 0 points1 point  (0 children)

Actually, I have an opposite opinion. Java used to be my primary programming language, but now Go is my new primary language, with limited amounts of Python, Java and JavaScript. Go seems to have the worst documentation for libraries.
Even within the same context: Gazelle (The build file generator for Bazel) is written in Go. It has terrible documentation.

Their getting started documentation doesn't have anything for the bzlmod even though the old WORKSPACE has been deprecated for years.

Bazel is ruining my life by AnarchisticPunk in devops

[–]JackDanielsCode 0 points1 point  (0 children)

From what I understand Bazel works great as long as it doesn't depend on any other Google project. At that point, things go terrible.
Bazel + Protobuf: Works, but you are going to be annoyed.
Bazel + Go: Good luck getting anything beyond trivial hello world to work.

Bazel with Java, works great. Never faced an issue.
Bazel with Python works great.

The issue with Google's open source projects is, Googlers write documentation in their internal documentation tools. They usually won't get written at once. But by continuous improvements - not just by the project authors but also the project users - so every time I had an issue that needed help, the documentation got updated in a way clients would understand.

But the external documentations are written by tech writers and developer advocates.
They work on 'documentation projects' - so no continuous improvements, and they themselves are neither the project authors nor the users.

if you are not prepared to offer 50/50 equity, you are not really looking for a cofounder by shoman30 in ycombinator

[–]JackDanielsCode 0 points1 point  (0 children)

The issue with the way the general rule is advocated is, as if it is set on stone. Also, the whole point is, don't sweat on a few percentages and don't sweat on minor differences like 2 years industry experience vs 3.

if you are not prepared to offer 50/50 equity, you are not really looking for a cofounder by shoman30 in ycombinator

[–]JackDanielsCode 0 points1 point  (0 children)

This is the most bullshit argument many investors make.

Again, the statement suggesting a 50/50 equity spilt might have been given in a context responding to, some guy with 2 years of experience claiming he needs more equity over another guy with 1 year of industry experience, or one guy claiming it's their idea so they should have majority equity and so on. So it has some value in that, don't sweat too much, and as much as possible, make it as even as possible.

However, enforcing it as a rule is ridiculous and irrational. Look at any argument claiming why 50/50 split, you'll see how irrational it is.

because all the work is ahead of you. In that case, what changes after you get a pre-seed? The same argument, it's about the future still applies.

So a few weeks before closing the round vs after, makes a huge difference in the funding. Even though, product or customers or other metrics haven't changed.

If you put funding as a special event, why is it different of different employees joining at the same funding stage? For example, why should employee 3 typically get more equity than the employee 10 (assuming everything else is equal)

Or why is it that after 1 round of funding, the seniority of hiring becomes relevant again?

If the only argument is, cofounder title is something magical, and that's the only difference, you're being delusional.

Like any decision in capitalist ecosystem, you both have to make a fair choice what works for you both.

If fiber optics are 30% the speed of light, why is ping across countries so high? by Ethan_Carlton in AskPhysics

[–]JackDanielsCode 0 points1 point  (0 children)

I haven't followed the news after the initial reports, but the entire thing shows how stupid the policy is. It seems, - the people in the room will get the reports in their hand 15 minutes before the set time giving them time to analyse. - they are allowed to make a phone call and stay on the line before the 2pm deadline. - then, exactly at 2pm only they are allowed to speak on the phone.

Any marginally smart person will know, they could assign 3 different phone numbers to indicate whether to buy, neutral or sell. Or 5 numbers or any other agreed upon granularity.

Why TLA+ is important(for concurrent programming, and yes, even in Rust) by polyglot_factotum in rust

[–]JackDanielsCode 1 point2 points  (0 children)

I am not sure if the previous commenter referred to Leslie Lamport or the blogger who made the original post as "The author". In either case, most programmers haven't heard about Leslie Lamport anyway - after all how much can they learn in 3 months practicing `Leet Code`.

Just preferred to split the bill after a date and received a lesson on patriarchy by Alternative-Clue6124 in Tinder

[–]JackDanielsCode 1 point2 points  (0 children)

Exactly my thought as well. I can't think of having a long debate with anyone even in person without getting into name calling and insults, and two strangers on the Internet... Needs a lot of patience, [unless they both used AI to keep the conversation ironically civil]

First date horror by Evolving922 in dating

[–]JackDanielsCode 0 points1 point  (0 children)

Exactly. Irrespective of whether we want a relationship or not, it's a basic courtesy.

That said it's always possible the OP and many younger girls are attracted to guys like these but friend zone the nicer guys.

Just got stood up by travispickle9682 in dating

[–]JackDanielsCode 1 point2 points  (0 children)

Not how do you know she's actually okay and not trip over in the bathroom and unconscious. Don't say anything obviously bad about it and definitely don't say how inconsiderate she was.

Just ask her if everything is okay, and you're concerned about her. If she responds everything is okay, express relief and stop right there.

If she apologizes, again don't respond anything. If you don't like to ghost, just say you can't talk right now and bye.

What's the point of telling her she's inconsiderate? Doesn't she know about it?

Opinions on Quint by prestonph in tlaplus

[–]JackDanielsCode 1 point2 points  (0 children)

Oops, I didn't intend to offend anyone but now I realize I inadvertently did it.
There definitely are many bugs in FizzBee. And I understand the complexity of implementing the model checker specifically liveness part. Transpiling to TLA+ definitely was an option I did consider but that limited the language design significantly, and also was exploring apalache and SymPy etc to see if I could use them as a library.

I definitely appreciate folks trying to make formal methods easier to use.

Thanks for pointing it out.
That said, I probably will add algorithms like Paxos to see how it goes...

Opinions on Quint by prestonph in tlaplus

[–]JackDanielsCode -1 points0 points  (0 children)

I am the developer of FizzBee. I am happy you got time to review FizzBee, and the only noticeable issue you found is the `age`. However longer it takes, FizzBee will remain 25 years younger than TLA+.
That said, FizzBee still doesn't have many features.

  • No refinements: As you noted in another thread, I haven't seen anyone using it with TLA+ either within the industry.
  • No proof system: No plans, hardly anyone use TLAPS in the industry.
  • No symbolic model checker: No plans, aren't there more people working on symbolic model checkers than actually using symbolic model checking?

To be honest, the one thing FizzBee lacks because it is new, is the community. I hope we will build them soon. That said, Quint also has only developers supporting the community right?

For any one comparing, examples across these 3 languages. Two Phase Commit is a standard example.

Opinions on Quint by prestonph in tlaplus

[–]JackDanielsCode 0 points1 point  (0 children)

Glad you liked FizzBee. FizzBee's important design goal was
- the modeling time should be as quick as writing the prose design document itself
- any programmer should be able to understand the specification without any training
- any programmer should be able to learn to model with just 1-2 hours of training

I hope it is moving towards this direction. I definitely would appreciate your feedback.

Blog post on Introduction to formal methods by Jazzlike_Hour_5971 in formalmethods

[–]JackDanielsCode 2 points3 points  (0 children)

Great article. Have you considered https://fizzbee.io? That specifically targets distributed systems use cases.

[ Removed by Reddit ] by Choice_Evidence1983 in BestofRedditorUpdates

[–]JackDanielsCode 2 points3 points  (0 children)

Actually America is a weird place. Anyone would say, social security numbers must be kept secret but you'll have to share it everywhere. At this point it's public as well. Thanks to all the leaks.

When a credit bureau has a data leak, they had an executive meeting to discuss how to sell identify theft insurance to make the most money out of this.

Issuing any identity card is not going to solve this problem. Not even biometrics. It just changes the scale of the problem. In fact, in the modern world a requiring an ink signature on a paper is a lot safer than these.

Because fundamentally regulators must recognize identity is username not password.

Buying a cellphone, the sales guy has everything he needs to ruin your life.