reply on: Making my local LLM voice assistant faster and more scalable with RAG \ stacker news ~tech

pull down to refresh

0 sats \ 4 replies \ @OT 15 Jun 2024 \ on: Making my local LLM voice assistant faster and more scalable with RAG tech

Pretty crazy.

I guess us non-nerds are going to get something like this in 5-10 years.

21 sats \ 3 replies \ @mrsu 15 Jun 2024

Probably much sooner than you think. AI is moving pretty quick.

0 sats \ 2 replies \ @OT 15 Jun 2024

Even for self hosting like this guys doing?

122 sats \ 1 reply \ @mrsu 15 Jun 2024

Yes. Its getting much easier. You can easily spin up a back end now without much tech knowledge. You don't even need a GPU (although that speeds things up significantly).

If you're interested, look into Ollama or LM Studio. They provide APIs to interface with the LLMs. Then there are a bunch of clients you can install and point to this endpoint.

0 sats \ 0 replies \ @OT 15 Jun 2024

Thanks

I’ll have a look