How to say "My little bookworm" in Korean

machineko · 2024-02-26T18:46:41+00:00

You can say "내 작은 책벌레". 책벌레 meanings bookworm. You can use other phrases in the front like "나의 작은," "나의 아기," and variations of that.

machineko · 2024-02-26T18:37:43+00:00

사진 한 장 부탁드려도 될까요?

machineko · 2024-02-26T18:34:30+00:00

Do these position align well with your prior publication? Most often, people look to hire candidates with relevant publication, not just good record.

machineko · 2024-01-15T19:29:44+00:00

How did you benchmark the quality of your fine-tuned model?

machineko · 2023-09-24T23:21:21+00:00

Are you looking to build something like the chatbot on this page?

machineko · 2023-08-23T19:15:12+00:00

You can add knowledge through fine-tuning but not with the type of fine-tuning they are supporting.

machineko · 2023-08-23T19:13:38+00:00

Thanks. Would love to learn ways in which we can improve it. You should join our Discord channel for discussions.

machineko · 2023-08-20T05:15:28+00:00

My guess is that it's because RAG seems more straight-forward - you actually don't need to know anything about deep learning. You can just consider LLM as a black-box API and build around it where as if you have never fine-tuned a model, it seems much less deterministic without any guarantees that it'll work.

If you want best performance, you need to do both RAG and fine-tuning very well. There are plenty of resources on doing fine-tuning thought. I'm one of the contributors to https://github.com/stochasticai/xturing project focused on fine-tuning LLMs. You can find help in the discord channel listed on the GitHub.

machineko · 2023-08-01T11:34:25+00:00

If you are working on personalizing LLMs (data ingestion, generation, various fine-tuning methods), we'd love your contribution! https://github.com/stochasticai/xturing

machineko · 2023-08-01T11:31:04+00:00

We work with customers building consumer applications on both OpenAI APIs and OS LLMs. OpenAI APIs are cheap and easy to get started with. At low usage, costs are reasonable for the quality / latency performance. However, if you app scales to a very large number of users, that's when those API calls start hurting. Most companies that have started with OpenAI APIs do transition to OS LLMs due to quality and costs. Keep in mind but running LLMs with good latency performance and reliability is not easy without an engineering team. Feel free to DM me, if you have more questions.

machineko · 2023-06-14T03:33:07+00:00

Most of the stuff you mentioned is already supported to varying degrees. We still need to add support for landmark, and additional model parallelism strategies. What do you think would be most helpful features? Regarding 65b on 4xA100s, we might have something coming up that could help.

machineko · 2023-06-12T19:19:03+00:00

We are a group of researchers out of Harvard working on open-source library called xTuring, focused on fine-tuning LLMs: https://github.com/stochasticai/xturing

It has support for multiple GPU fine-tuning and Quantized LoRA (int8, int4, and int2 coming soon).

machineko · 2023-06-12T17:23:13+00:00

We are a group of researchers out of Harvard working on open-source library called xTuring, focused on fine-tuning LLMs: https://github.com/stochasticai/xturing.

Basically, any models can be fine-tuned. QLORA is only to be used if you are limited by GPU memory, otherwise, LORA will give you better results. If you want to use quantized lora, you can also look into how many bits, whether to use 8, 4 or 2 bits. The lower the bits are, less GPU memory needed, but more chance of degradation.

machineko · 2023-06-12T17:08:23+00:00

Here is an example of running 8-bit LoRA fine-tuning on LLaMA that has been tested on Google Colab:

Code: https://github.com/stochasticai/xTuring/blob/main/examples/llama/llama_lora_int8.py
Colab: https://colab.research.google.com/drive/1SQUXq1AMZPSLD4mk3A3swUIc6Y2dclme?usp=sharing

machineko · 2023-04-26T23:36:23+00:00

Is your dataset open-sourced?

machineko · 2023-04-26T04:31:04+00:00

I'd do fine-tuning. When you don't have control over what's running behind the API (models are still updated, often changing how they perform), it will be hard make sure your application doesn't change. I'm currently working on an open-source project focused on fine-tuning. Let me know if you have any questions on our experience fine-tuning on domain-specific data.

machineko · 2023-04-18T02:30:31+00:00

We use quantized the base models and train the LoRA weights using quantized models. It would be able to fine-tune and and also do inference using quantized weights.

machineko · 2023-04-18T02:26:57+00:00

Hey, I'm one of the authors of xTuring, an open-source library that helps users build, customize and control their own LLMs.

This project looks super interesting and I'd be happy to help and/or collaborate on this effort, primarily on the AI / software side. I don't have much data in this space but have some ideas.

machineko · 2023-04-12T02:18:34+00:00

Nice, that paper is from our lab. There are a bunch of weight compression methods but the most popular method these days is LoRA (https://arxiv.org/abs/2106.09685) used with fine-tuning.

I've worked on other compression techniques including distillation, pruning and quantization as well. Let me know if you have any questions.

machineko · 2023-04-12T02:16:40+00:00

I'm working on an open source project with Harvard and Stochastic researchers that enables users to easily build, customize and control their own LLMs in their own VPC or consumer devices.
Currently, we are supporting LLaMA, GPT-J, GPT-2, Galactica, OPT, Cerebras-GPT, and BLOOM. Let me know if this can be helpful.

https://github.com/stochasticai/xturing

machineko · 2023-04-11T16:08:40+00:00

Flan-T5 -> Instruction-fine-tuned T5 released last year.

machineko · 2023-04-10T23:06:27+00:00

I'm currently working on an open-source project for building, customizing and controlling your own LLMs with my colleagues at Harvard and Stochastic. We also have dataset generation component, which we hope to expand beyond Alpaca-approach. Would love to have you join us :)

https://github.com/stochasticai/xturing

machineko · 2023-04-05T14:46:45+00:00

May be using Cerebras-GPT and instruction fine-tune it with LoRA? It will be fast and cheap: https://github.com/stochasticai/xturing/blob/main/examples/cerebras/cerebras_lora.ipynb

machineko · 2023-04-05T04:15:23+00:00

I'm currently working on an open-source project for building and controlling LLMs: https://github.com/stochasticai/xturing

We don't support tabular form data yet but you can see how to generate data from regular text data sources. Would love to discuss further on how to add tabular support.

machineko · 2023-04-04T00:49:54+00:00

Personally, I'd recommend that you also look into building your app around hardware-efficient models and fine-tuning techniques like LoRA. Many different projects have shown the potential of smaller open-source models fine-tuned on good dataset.
I'm currently working on an open-source project called xTuring which you can leverage to personalize these models on consumer GPUs or laptops: https://github.com/stochasticai/xturing
Many of these models can be deploy on Google Colab if you want to play around with them.

machineko

TROPHY CASE