reply on: Making my local LLM voice assistant faster and more scalable with RAG \ stacker news ~tech

pull down to refresh

0 sats \ 2 replies \ @OT 15 Jun 2024 \ parent \ on: Making my local LLM voice assistant faster and more scalable with RAG tech

Even for self hosting like this guys doing?

122 sats \ 1 reply \ @mrsu 15 Jun 2024

Yes. Its getting much easier. You can easily spin up a back end now without much tech knowledge. You don't even need a GPU (although that speeds things up significantly).

If you're interested, look into Ollama or LM Studio. They provide APIs to interface with the LLMs. Then there are a bunch of clients you can install and point to this endpoint.

0 sats \ 0 replies \ @OT 15 Jun 2024

Thanks

I’ll have a look