What is FOSS answer to BingGPT & Google Bard?

Ganesh Venugopal@lemmy.ml · edit-2 1 year ago

What is FOSS answer to BingGPT & Google Bard?

lloram239@feddit.de · edit-2 1 year ago

what do us poor folks on Linux do?

Run llama.cpp and any of the models listed here, that stuff has been around for months.

TheBloke has a lot of models converted to GGUF format which you need for llama.cpp.

Quick Start Guide (requires Nix, otherwise compile llama.cpp manually):

$ GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/TheBloke/guanaco-7B-GGUF
$ cd guanaco-7B-GGUF
$ git lfs pull --include=Guanaco-7B.Q4_0.gguf
$ nix run github:ggerganov/llama.cpp -- -m Guanaco-7B.Q4_0.gguf --instruct
> Write haiku about a penguin
 A penguin walks on ice,
 Takes a plunge in the sea,
 Hides his feet from me!

RickyRigatoni@lemmy.ml · 1 year ago

a package manager that can pull, build, and run from git with one command is pretty neat

257m@lemmy.ml · 1 year ago

I ran it on my pc with a gtx 1070 with cuda enabled and compiled with the cuda compile hint but it ran really slowly how do you get it to run fast?

lloram239@feddit.de · 1 year ago

To make use of GPU acceleration you have to compile it with the proper support (CUDA, OpenCL, ROCM) and add --gpu-layers 16 (or a larger number, however much your VRAM can handle). If that's not enough, than the GPU/CPU is probably to slow.

You can try a smaller model, those run faster, but give worse results.

257m@lemmy.ml · 1 year ago

Thanks I might try that out later.

What is FOSS answer to BingGPT &amp; Google Bard?

What is FOSS answer to BingGPT &amp; Google Bard?

What is FOSS answer to BingGPT & Google Bard?

What is FOSS answer to BingGPT & Google Bard?