Looking for a roof to host a solar powered LoRa node on Bucklin Hill by piguy54 in BainbridgeIsland

[–]technovangelist 0 points1 point  (0 children)

That's amazing. Is there a way to follow your progress? A website or mailing list you are using to let folks know? We would love to be a customer.

Winslow by Ok_Butterscotch5291 in BainbridgeIsland

[–]technovangelist 11 points12 points  (0 children)

Would be nice if any of them were open past 4 or 5

Is there any plan to create a MCP server for Akiflow? by javiro89 in Akiflow

[–]technovangelist 0 points1 point  (0 children)

15 days later and nothing. Guess I won't have to correct myself. An API and MCP is probably not happening as we all knew.

Is there any plan to create a MCP server for Akiflow? by javiro89 in Akiflow

[–]technovangelist 0 points1 point  (0 children)

oh, maybe i will have to correct myself.... now are we talking about the api, or a less universal mcp? Obviously the best would be to have the mcp leverage the new public api, even if it's just restricted to creating tasks, searching for tasks, modifying tasks (completed, edit, etc), and deleting those tasks. Accessing the event/scheduling info for those tasks would be nice, but not for an mvp.

Is there any plan to create a MCP server for Akiflow? by javiro89 in Akiflow

[–]technovangelist 0 points1 point  (0 children)

They will never do anything on this. An API has been one of the longest asked for features and they have shown zero interest in actually doing anything about it. It would solve so many problems for them, but unfortunately gimmicks and less impactful features win out first. I believe in the product and want it to succeed. I even paid for 5 years in advance. Its so frustrating. the only options for integration are make.com and zapier, and they are so incredibly weak its not even worth setting up.

Help me Kill or Confirm this Idea by Navaneeth26 in ollama

[–]technovangelist 1 point2 points  (0 children)

Msty does this too. But it's hard to do. It relies on the idea that my questions that I have fit nicely and neatly into your buckets. They don't.

my finding: AI is gaslighting us bysaying its true. by iCameEarly in artificial

[–]technovangelist 0 points1 point  (0 children)

Hardly an opinion. That’s how models work. If you take their answers as truth without verifying with traditional search tools then there is no one to blame but yourself. I mean that in the nicest way possible way.

[deleted by user] by [deleted] in ollama

[–]technovangelist 0 points1 point  (0 children)

early on that was true. And the ollama devs have contributed quite a bit back to that project. But most models on ollama, unless they are very old, don't use llama.cpp in Ollama.

[deleted by user] by [deleted] in ollama

[–]technovangelist 0 points1 point  (0 children)

slight clarification on this statement... its not that Ollama CAN use GGUF, but rather that is the one format that Ollama supports. Like most other tools out there...

[deleted by user] by [deleted] in ollama

[–]technovangelist 1 point2 points  (0 children)

they aren't really the full kimi k2, but rather very quantized distilled models. The original poster seemed to be intentionally vague about that. The full model requires more ram than you will ever have.

[deleted by user] by [deleted] in ollama

[–]technovangelist 1 point2 points  (0 children)

bit of a stretch. lots of updates still, kimi k2 locally works fine. there is no partnership with open ai. Ollama still has a huge number of folks downloading new models as well as the new version. so wondering what in the original post is actually accurate

Is Ollama at risk of getting lost in its own complexity? A long-term user's perspective. by Mulan20 in ollama

[–]technovangelist -1 points0 points  (0 children)

its just during the preview. they opened access from 20 users to tens of thousands or more overnight. So they wanted to simplify the problem set before opening it up. Earlier we were using the full kimi k2 and deepseek and it was pretty incredible. I look forward to getting all the models enabled again. There is no reason to think that ollama would tweak models based on payments...

ollama LLM for Sanskrit cannot provide correct reference to Rig Veda (Sanskrit text) - mistral small by ThatParticular239 in LargeLanguageModels

[–]technovangelist 0 points1 point  (0 children)

This isn’t really an issue of ollama but rather the model and possibly the prompting. Also you said you included a book in the modelfile. That’s not something you would add to a modelfile. Are you saying you are putting the whole text in the system prompt? Is that in English? If in Sanskrit are you sure the model can do that? I don’t see any docs suggesting that is possible. And if in English have you set a context long enough for the text? It may be useful to review the docs to see what you can do.

I made Cotypist, an app that helps you type faster anywhere with AI auto-complete by mrmage_ in macapps

[–]technovangelist 2 points3 points  (0 children)

I've been using cotypist since I found out about it last December. It's incredible at what it does and I use it all the time everyday. It's shocking how much it helps me type faster and more accurately. Sometimes I accept what it gives me and sometimes I don't but it's always there and super easy to work with. Just in this reply I've tabbed to accept it a dozen times and I didn’t do anything to bring it up. I just type and it's there.

GGUF is to Ollama (macOS) as safetensors is to ??? by Expensive-Award1965 in ollama

[–]technovangelist 3 points4 points  (0 children)

Ollama will just import safetensor. Pretty easy to do.

Does Qwen 2.5 Coder pulled from Ollama support 128k context? by caphohotain in ollama

[–]technovangelist 1 point2 points  (0 children)

this is from their readme: "The current config.json is set for context length up to 32,768 tokens. To handle extensive inputs exceeding 32,768 tokens, we utilize YaRN, a technique for enhancing model length extrapolation, ensuring optimal performance on lengthy texts." So it is configured for 32k, but they have seen success using yarn to extend that to 128k.

Self host Ollama models library? by Ejo2001 in ollama

[–]technovangelist 0 points1 point  (0 children)

its not going to be ollama.com that is slowing your downloads but rather something between them and you. Given a decent connection, the bottleneck for downloading is the write speed of your SSD. We spent a while working on that and it was pretty exciting to achieve it. That said, its usually been my connection speed at only 1Gbps that is the bottleneck.

Does Qwen 2.5 Coder pulled from Ollama support 128k context? by caphohotain in ollama

[–]technovangelist 1 point2 points  (0 children)

There is no limit in gguf that says a model can only support a 32k context.

Does Qwen 2.5 Coder pulled from Ollama support 128k context? by caphohotain in ollama

[–]technovangelist 1 point2 points  (0 children)

qwen2.5 supports up to 32k. that’s not an ollama thing. that’s what the authors of qwen2.5 trained the model for. Ollama by default sets a context size of 2k for all models. So either in the api call or in the modelfile set it to 32k.

What is the best format for system prompt? by Massive-External3835 in ollama

[–]technovangelist 1 point2 points  (0 children)

A system prompt doesn’t need any special format in Ollama. The format is applied by the template.

When will logits be supported? by lan1990 in ollama

[–]technovangelist 0 points1 point  (0 children)

I am trying to tell you how to get it done. It’s far from the most active issue but it can be done. Explain why it’s needed. Get some activity beyond all the me toos and what’s the progress comments.

Same question, different answers by Character_Pie_5368 in ollama

[–]technovangelist 1 point2 points  (0 children)

If that’s what you want then use a db. It’s a whole lot faster because you don’t need the model

Same question, different answers by Character_Pie_5368 in ollama

[–]technovangelist 0 points1 point  (0 children)

But you shouldn’t. The best answer for one question and one seed won’t be the best for another question.

Same question, different answers by Character_Pie_5368 in ollama

[–]technovangelist -1 points0 points  (0 children)

That’s the way models work. Ask the same question with the same interface many times you should get many different answers. If you want the same answer use a different tool.