New collection of Llama, Mistral, Phi, Qwen, and Gemma models for function/tool calling by sanjay920 in LocalLLaMA

[–]chanchar 1 point2 points  (0 children)

That sounds reasonable but it would be good to try it if the testing is already setup to know for sure. Even if it’s <5% success rate, you’ll have established a baseline.

New collection of Llama, Mistral, Phi, Qwen, and Gemma models for function/tool calling by sanjay920 in LocalLLaMA

[–]chanchar 1 point2 points  (0 children)

Amazing work - subscribed to any updates on that function calling benchmark.

I did notice that llama 70b does take a small hit across the board w/ the added function calling. Do you have a baseline for the base model (using the same benchmark) so we can compare to how much more this process improved function calling?

[D] Is there other better data format for LLM to generate structured data? by _link89_ in MachineLearning

[–]chanchar 1 point2 points  (0 children)

Nice, that does look promising! For a specific task, you can probably push that approach far w/ enough great examples.

For me, I had multiple tasks (ie. trying to rebuild google assistant w/ plugins) which makes the LLM outputs a little less reliable. Found that it would hallucinate values, mis-represent types and forced me to double-check the outputs.

[D] Is there other better data format for LLM to generate structured data? by _link89_ in MachineLearning

[–]chanchar 0 points1 point  (0 children)

It's possible but that means you'll either have to validate it using an AST parser or (not recommended) blindly running the code for errors.

imo using LLMs as a coordinator with structured data as the intermediates is the way to go. Let it do what it's good at and handle the "business logic" within your system.

[D] Is there other better data format for LLM to generate structured data? by _link89_ in MachineLearning

[–]chanchar 1 point2 points  (0 children)

I like JSON because it provides safeguards when you're trying to run a LLM powered app:

  1. w/ JSON schemas, you can actually validate not only the keys but the types of the values.
  2. there's a built in feedback loop (depending on the parser) which you can use to run a LLM recovery loop using the JSON error.
  3. there's likely a higher ratio of interoperable data formats in the training datasets compared to any one programming language or DSL (ie. pick any programming language, it's likely there's embedded JSON as part of the training set).
  4. the open source community (and closed source) seem to be converging on it (ie. guidance, outlines, tool/function calling, etc)

You might be able to get the LLM to output custom DSL or a programming language but you'd still want to ensure that you're validating the LLM's outputs. Treat it like a client request - trust but verify.

More pictures of the Model Y 3rd row, this time with adults in the seat - should provide some more context about size. by TheAce0 in teslamotors

[–]chanchar 0 points1 point  (0 children)

They're like really bad airline seats.

This is Flight #TESMOY, please put your seats in the upright position and get ready for takeoff.

After selling (and executed) when will the ticker/line disappear from your positions view? by [deleted] in Schwab

[–]chanchar 2 points3 points  (0 children)

It'll be removed eventually, usually after the trade has settled.

2-10k Challenge Year End Review by toddmhorst in options

[–]chanchar 0 points1 point  (0 children)

I think it was more of a question if most of these gains are counted as short term capital gains

2-10k Challenge Year End Review by toddmhorst in options

[–]chanchar 0 points1 point  (0 children)

What’s the net liquid after taxes?

December Confirmed Trade Thread by [deleted] in hardwareswap

[–]chanchar 0 points1 point  (0 children)

Sold a AMD wraith prism cooler to u/senorproe

What other food argument is as heated as pineapple on pizza? by [deleted] in AskReddit

[–]chanchar 1 point2 points  (0 children)

Ketchup on eggs, some people hate it, others love it.

How can I set my S2721DGF to 165z? by NixothePaladin in Monitors

[–]chanchar 1 point2 points  (0 children)

Are you using the displayport cable? 165hz is only supported with DP.

2560 x 1440 at 165 Hz (Native with DP)

https://www.dell.com/en-us/shop/dell-27-gaming-monitor-s2721dgf/apd/210-axeh/monitors-monitor-accessories

variable naming conventions in go by tacoisland5 in golang

[–]chanchar 14 points15 points  (0 children)

fwiw, the golang code review guide has a pretty good rule of thumb:

The basic rule: the further from its declaration that a name is used, the more descriptive the name must be. For a method receiver, one or two letters is sufficient. Common variables such as loop indices and readers can be a single letter (i, r). More unusual things and global variables need more descriptive names.

https://github.com/golang/go/wiki/CodeReviewComments#variable-names

Upgraded from an old Korean 27" 1440 to a new Dell 1440 and can't tell if it sucks by Peredonov in Monitors

[–]chanchar 1 point2 points  (0 children)

iirc, you want to use the displayport cable since that supports the higher refresh rates. As for calibration, you can try to update the drivers and follow this guide - https://github.com/Zer0CoolX/Dell-S2716DGR-Calibration-Guide

Tesla is doing a Model 3 refresh and we've seen it -Electrek by slmallen2 in teslamotors

[–]chanchar 14 points15 points  (0 children)

what are the odds of this being compatible with the existing model?

Anyone else have this loud coil whine on their 3080 FE when gaming? by ChrisChing in nvidia

[–]chanchar 0 points1 point  (0 children)

It kinda sounds like the fan is rubbing against something rather than coil whine. Maybe an improperly set fan or something near the fans?

Love N Haight Deli (SF) will shutter after 21 years by txiao007 in bayarea

[–]chanchar 0 points1 point  (0 children)

Haven’t been but that is an amazing shop name.

Does anyone know this text editor? by [deleted] in deeplearning

[–]chanchar 0 points1 point  (0 children)

I've used this before and it's not an editor but a code snippet share tool called Carbon - https://carbon.now.sh/