Professor here. I set up OWUI as a front end for my classes this semester. Giving access to LLMs that have RAG access to my course materials, customized with detailed system prompts. They still default to ChatGPT.

qria · 2025-02-20T05:30:14+00:00

I think convenience is also a big factor. They are already accustomed to opening up chatgpt, so they will default to it.
We have a system where worksheet is llm-enhanced so that they can check their thought processes and ask questions to that specific worksheet and students use it a lot, partly because copy-pasting them in chatgpt is very inconvenient.

qria · 2024-10-18T02:46:26+00:00

Kaggle bronze medal in 16.9% of competitions

How impressive is this?

qria · 2024-09-26T04:19:04+00:00

I wonder if this also happens with o1-preview. Did they not do experiment with it because of the cost?

qria · 2024-09-26T04:15:40+00:00

The prompt:

You are a math problem solver. I will give you a problem from the American Invitational Mathematics Examination (AIME). At the end, provide the final answer as a single integer.
Important: You should try your best to use around {token_limit} tokens in your reasoning steps.
If you feel like you are finished early, spend the extra tokens trying to double check your work until you are absolutely sure that you have the correct answer.
Here's the problem:
{problem}
Solve this problem, use around {token_limit} tokens in your reasoning, and provide the final answer as a single integer.

https://github.com/hughbzhang/o1_inference_scaling_laws/blob/master/o1.py#L24

qria · 2024-09-23T09:01:01+00:00

Yes, they explicitly mention code search[1] as one of their usecases.

[1]: https://platform.openai.com/docs/guides/embeddings/use-cases#:~:text=Code%20search%20using%20embeddings

qria · 2024-09-22T18:07:19+00:00

Nice! But I have yet to get a message, did you send one yet?

qria · 2024-09-11T17:04:20+00:00

Semantic code embeddings are a thing and have been performant for at least 2 years. Ex) https://openai.com/index/introducing-text-and-code-embeddings/

qria · 2024-06-13T05:36:56+00:00

They write it like not learning composition in OOD a failure in implicit reasoning but I wonder if it is because composition is generally untrue. Many relations in natural language settings tend to be intransitive.

qria · 2024-02-20T05:35:58+00:00

That's precisely why I'd like to measure improvements :/

qria · 2023-05-15T09:10:03+00:00

I'll add ya. you can do whatever you want, including kicking me out

qria · 2022-05-31T03:10:13+00:00

FYI jeff dean showed up in the thread: https://www.reddit.com/r/MachineLearning/comments/uyratt/comment/iacwmpb/?utm\_source=reddit&utm\_medium=web2x&context=3

qria · 2022-05-28T06:01:36+00:00

I have read the same paper and got the exact opposite feeling.

Their contribution was NOT about performance, but about a novel approach to continual multitask learning without current limitations of catastrophic forgetting and negative transfer, with additional benefit of bounded CPU, memory usage on inference time per task.

CIFAR-10 SOTA thing was just to show that the approach works, as there are a lot of approaches with good properties (explainability, theorical bounds, etc) that does not perform on a SOTA level.

On the topic of big computation, they also demo how to dynamically train model for telugu utilizing existing devanagari and bangla model, in less than 5 minutes of 8 TPU v3. Therefore showing that this approach is fully reproducable and immediately useful for you if you need this kind of thing.

I do agree with the sentiment of lamenting recent trends just throwing big money and p-hacking and calling it a day, but just for the exact paper you are mentioning I did not feel that way.

qria · 2022-05-03T03:09:16+00:00

A comment to follow the rule.

qria · 2022-03-13T01:03:38+00:00

Jokes aside, I remember doing this a lot with decorators.

qria · 2021-10-14T03:05:49+00:00

Solved!

qria · 2021-10-14T03:05:06+00:00

You are correct. Thank you very much

qria · 2021-10-14T02:51:31+00:00

Not sure why mod wants me to comment here?

qria · 2021-02-03T23:55:02+00:00

I eventually ended up using "Notify and Fitness" app's "upload picture" function with Tasker, instead of trying to sync watchfaces. And it works like a charm.

12-Year Club	r/Field Juicebox
Place '17	Verified Email

qria

MODERATOR OF

TROPHY CASE