pull down to refresh

I really need to get my self-hosted LLama running...
What do you use it for?
Just general purpose research and analysis assistance, but I don't like the idea that what I'm doing now (say researching anti-money laundering laws and coinjoins or ecash legality) being later used to get me in trouble when the standards change.
reply
Maybe use ppq.ai (low volume) or venice.ai (higher volume) instead?
reply
I used to use Venice (free version) but I found the results weren't as good as Gemini or Chat. But I probably should have given the paid Venice version a chance before going to paid version of one of the popular llms.
reply
It used to be that you could pay for venice pro with sats - not sure if that's still the case.
ppq makes you pay per query and is anon and incentivizes LN, so you can basically use gpt-5 there anon, paying per query.
reply
There's Maple
reply
When you say ppq.ai (low volume) do you mean you should only use a small volume of queries? And is that so that the source LLM can't aggregate?
If so, how does venice.ai not have the same issue?
reply
Good question.
Venice is an inference provider, so they run open source models for you. Chat is a monthly/annual subscription. Their API is pay-per-token and involves tons of shitcoinery.
PPQ is a router and token reseller where you pay per token on both chat and API. If have a low volume of tokens (i.e. you don't use it much) this can be, despite their price markup, cheaper than paying for a subscription. If however you use it a lot, it won't be cost-effective.
reply
100 sats \ 1 reply \ @Signal312 2 Sep
Ah, I see, so you're not talking about privacy here, just cost.
One of the reasons I like ppq.ai is that you can switch easily between LLMs.
reply
Yes, privacy is a procedure - neither does KYC of any kind so it's easy to be anon on there.
I don't mind ppq, works as advertised. It's expensive though; there's room for competition there.
reply