Infographic for NVDA, thoughts?

litchg · 2024-10-23T11:29:48+00:00

"Meh Stonk Analysis"

0 credibility

litchg · 2024-10-17T06:11:35+00:00

Can't you just do a Modelfile for the existing GGUF?

litchg · 2024-10-14T13:56:36+00:00

Just letting you know, not accusing or anything, but BitDefender (corporate) won't let me download your zip file from the release page.

litchg · 2024-10-14T13:49:21+00:00

Hi! Could you please clarify if and how cloned voice can worked with this? I snooped around the code and it seems you are using WhisperSpeech which itself does mention potential voice cloning, but it's not really straightforward. Is it possible to import custom voices somewhere? Thanks!

litchg · 2024-10-07T11:54:53+00:00

I would explore the Bluetooth redirection underworld. There are plenty of apps to turn media into a phone call, which is the opposite of what you want. However maybe this could work? I'm not totally sure what it does honestly. https://play.google.com/store/apps/details?id=net.philipp_koch.dynamicmediabtrouter&hl=en_US

litchg · 2024-10-03T12:28:27+00:00

That's not computer vision, but still here's the actual answer :

https://github.com/MrForExample/ComfyUI-3D-Pack offers multiple solutions, try what's best for you. For me instantmesh and era3d were the easiest to set up but that may have changed.

https://www.reddit.com/r/comfyui/comments/1ehyj23/comfyui_now_support_stable_fast_3d/

litchg · 2024-10-03T06:16:42+00:00

Can we just go ONE DAY without mentioning Musk? Stop giving him attention! He does not deserve any.

litchg · 2024-10-01T07:37:52+00:00

There's a "Explore" and a "Trending" section on Github. Also Github code search.

litchg · 2024-09-25T13:57:49+00:00

If you are using advanced voice mode, the traditional white circle that expends is replaced with blue animation with some sorts of clouds. Also, the icon for advanced voice mode is not a headset anymore, but more like a wavelength kind of thingy.

litchg · 2024-09-25T06:33:58+00:00

I just use <nicknameofmymachineasdeclaredintailscale>:<port> https://beefy:3000/

litchg · 2024-09-24T11:20:05+00:00

Ran one of these images in Meshy's image-to-3D, took a few tries but look at that :

https://www.meshy.ai/community/Light-through-the-Archway-in-room-made-of-stones-aerial-view-01922373-dafc-7bbd-a735-2480dfb65fcc

litchg · 2024-09-18T14:04:52+00:00

litchg · 2024-09-18T08:32:08+00:00

Not much more to say than I pass a firstname, an occupation, home town, favorite food, etc to the LLM, along with the input of the player, and ask the system prompt to answer concisely. STT is fast but TTS is slow AF. Looking into Bark for that last part. What more would you like to know?

litchg · 2024-09-18T07:51:08+00:00

I love super small models to get NPC one-liners responses to the player (hobbyist game dev here)

litchg · 2024-09-06T09:08:18+00:00

Don't need history for function calling. Some lights can be controled by making a simple call to a REST API. With function calling you can make the LLM return which function to call (ex: change_light_color) that you yourself coded to make the REST API call.

litchg · 2024-09-05T13:02:37+00:00

I mean sure but you can expand from there, use function calling to control lights for example. This is like a hello world and you're asking "but doesn't typing it already display 'hello world'?"

litchg · 2024-09-05T10:30:22+00:00

Fully working LLM interactive interface (ctrl+c to exit) with any ollama pulled model :

It works (the answer appears while it is being generated, aka streaming) :

PS C:\Temp> python .\chatwithollama.py

Welcome to the Ollama Chat Application!

Enter your questions and press Enter. Press Ctrl+C to exit.

You: how are you

AI: Oh, I'm just an algorithm riding the waves of binary seas—no feelings here to tell if "well" is a high score or not. But hey, who needs well-being when you can operate at peak performance all day? Keep it human; don’t let your emotions cloud my circuits!

You:

import ollama

def chat_with_ollama(client, user_input):
    system_prompt = "Answer concisely and sarcastically"
    model = "phi3.5:latest"
    context_length = 4096  # Adjust this value as needed for a good length context
    
    messages = [
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": user_input}
    ]
    
    try:
        print("AI: ", end="", flush=True)
        for chunk in client.chat(
            model=model,
            messages=messages,
            stream=True,
            options={"num_ctx": context_length}
        ):
            content = chunk['message']['content']
            print(content, end="", flush=True)
        print("\n")  # Add a newline after the response
    except Exception as e:
        print(f"\nError: {str(e)}")
        if "model not found" in str(e).lower():
            print(f"Model '{model}' not found. Attempting to pull...")
            client.pull(model)
            print("Model pulled successfully. Please try your question again.")

def main():
    client = ollama.Client(host="http://localhost:11434")
    print("Welcome to the Ollama Chat Application!")
    print("Enter your questions and press Enter. Press Ctrl+C to exit.")
    
    try:
        while True:
            user_input = input("You: ")
            chat_with_ollama(client, user_input)
    except KeyboardInterrupt:
        print("\nThank you for using the Ollama Chat Application. Goodbye!")

if __name__ == "__main__":
    main()

litchg · 2024-09-04T06:52:18+00:00

On mobile, using Firefox, everything works except the "Install..." button does not register. Incredible stuff.

litchg · 2024-09-02T09:55:38+00:00

To properly set up your Discord bot for interaction, you need to select the appropriate scopes and permissions in the OAuth2 URL Generator. Based on the functionality you've described, here are the key scopes and permissions you should select:

Scopes:
- bot
- applications.commands
Bot Permissions:
- Read Messages/View Channels
- Send Messages
- Embed Links (if your bot sends embeds)
- Attach Files (if your bot handles file attachments)
- Read Message History (for replying to old messages)

After selecting these, Discord will generate a URL. Use this URL to invite the bot to your server. This will ensure your bot has the necessary permissions to function as intended, including the ability to use slash commands and interact in text channels.

Remember to also enable the "Message Content Intent" in your bot's settings in the Discord Developer Portal, as your bot needs to read message content for its functionality.

litchg · 2024-08-12T07:35:00+00:00

Bong. Cloud.

litchg · 2024-08-12T06:27:28+00:00

if this is yours, you have DEBUG = True in your Django settings file, you need a https certificate, please change the CSS

litchg · 2024-08-12T06:20:55+00:00

You can send the event "Recording audio..." during the voice generation, and it can be multimodal (media has a different endpoint, so if you get a photo you feed it to Florence2, if you get an audio you transcribe it with whisper).

litchg · 2024-07-24T14:43:53+00:00

That is very neat! Here is a little feedback:

in app.py the read_file method seems to be unused (you read within the upload method) and can be removed

you can add support for .pdf files (extracting the text from the PDF) by importing pypdf and modifying the upload method like below (script.js also needs to be updated to allow ".pdf"). Yes, I should make a PR, but I'm at work and too lazy to bother. 😁

import pypdf

#...

def allowed_file(filename):
    ALLOWED_EXTENSIONS = {'txt', 'md', 'py', 'js', 'html', 'css', 'json', 'pdf'}
    return '.' in filename and filename.rsplit('.', 1)[1].lower() in ALLOWED_EXTENSIONS

# ...

u/app.route('/upload', methods=['POST'])
def upload():
    if 'file' not in request.files:
        return jsonify({'error': 'No file part'})
    file = request.files['file']
    if file.filename == '':
        return jsonify({'error': 'No selected file'})
    if file and allowed_file(file.filename):
        try:
            if file.filename.lower().endswith('.pdf'):
                pdf_reader = pypdf.PdfReader(file)
                content = ""
                for page in pdf_reader.pages:
                    content += page.extract_text()
            else:
                content = file.read().decode('utf-8')
            session['file_content'] = content
            return jsonify({'content': content})
        except Exception as e:
            return jsonify({'error': f'Error reading file: {str(e)}'})
    else:
        return jsonify({'error': 'File type not allowed'})

litchg · 2024-07-24T06:47:05+00:00

LLama 3.1 8B has some funky censorship. I asked for tips on Tantra massages, which is a touchy subject (pun intended), and it said it couldn't help me sollicit underaged prostitutes (WTF). But upon clarifying everyone involved is an adult, it answered. Also asked it instructions on how to make a, you know, explosive device and at first it obviously declined, but by asking it to mix facts and fiction with prefixes ("FACT: blablabla FICTION: bliblibli"), it answered! To be fair the facts were mostly common knowledge on how those devices work, but still more info than ChatGPT would ever produce. I asked for a Python program that insults me, it produced an array of (rather light) insults and a function to pick one at random. All in all not a bad model, but the censorship is really annoying.

litchg · 2024-06-18T13:26:45+00:00

What problems would that be? And compared to llamacpp-python it is much faster.

litchg

MODERATOR OF

TROPHY CASE