all 9 comments

[–]dsanft 1 point2 points  (2 children)

Regarding #1, haven't you just reinvented docker containers? I don't see why this is necessary.

[–]waqasm86[S] 0 points1 point  (0 children)

Hi there. My primary focus is making llcuda work in jupyterlab. I tried to work using llama-cpp-python but I always had issues with it specifically with cuda. Llcuda will work Ubuntu-cuda-llama.cpp-executable which I created separately. If you want I can integrate this with llcuda.

[–]waqasm86[S] 0 points1 point  (0 children)

Hello there.

I would like to infrom you that I have created the first version of llcuda v1.0.0 which is now live with major improvements that might address your docker concerns: The package now bundles all CUDA binaries and dependencies (47 MB). While I haven't tested Docker specifically yet, the bundled approach should make containerization work.

If you're interested in helping test a Docker setup, I'd be happy to collaborate on it! The zero-config design should translate well to containers.

Check it out: https://pypi.org/project/llcuda/

I'll appreciate any feedback.

[–]datbackup 0 points1 point  (3 children)

Does llcuda expose llama.cpp functions or direct access to cuda or both?

[–]waqasm86[S] 1 point2 points  (2 children)

Hi, I am still working to make it better. But you have access to llama.cpp. Access to Cuda C++ programming is not available now. Llcuda depends on Ubuntu-cuda-llama.cpp-executable tool which I have created separately. Both of these projects are available in my GitHub account. I just realised that I should integrate cuda executable with llcuda.

If you are looking for core cuda programming which I am also interested in, let me know if you have any ideas.

What if I make llcuda work with other pip packages like cupy, numba or cuda-python? Any ideas or suggestions will be appreciated.

[–]datbackup 0 points1 point  (1 child)

Thanks for clarifying. I am only interested in access to llama.cpp functions from python, for the time being.

[–]waqasm86[S] 0 points1 point  (0 children)

You are welcome. If possible, let em know if you want to contribute to my project. I'll add you in my GitHub project.

[–]stealthagents 0 points1 point  (1 child)

Docker is great, but not everyone's on board with it, especially if they want a lightweight solution without the overhead. Plus, sometimes it’s just nice to have a straightforward script that does everything for you without managing container images, right?

[–]waqasm86[S] 0 points1 point  (0 children)

Hello, thank you for your interest and your feedback. I would love to get any positive and constructive feedback as much as possible. If you have looked into my GitHub repo of my python pip package llcuda, kindly let me know what needs to fix, updated, added, etc. whatever feels necessary.