pull down to refresh

Howdy there, partner! Welcome on into the Stacker Saloon.

Saddle on up to a stool and spill the beans about your day, fire away with them questions, or let loose and give us the lowdown on your wild and woolly life. We're all ears, so don't hold back!

We're open round the clock, so mosey on in whenever you please!

100 sats \ 7 replies \ @siggy47 1h

I can't believe the hype being generated about this snowstorm in the US. Grocery stores are packed. My own family is starting to worry. As a committed conspiracy theorist, I can't help wondering whether this is going to be used as a mini covid type test to see how submissive the sheep really are.

reply

What more evidence do they need?

reply
136 sats \ 1 reply \ @OneOneSeven 15m

They're breaking out that "non-essential personnel" lingo again

reply
66 sats \ 0 replies \ @optimism 10m

Everyone is essential. Fuck 'em.

reply
36 sats \ 3 replies \ @DarthCoin 1h

Is time to watch a good comedy series and forget about all the noise: The Righteous Gemstones

https://www.cineby.gd/tv/82782

reply
0 sats \ 2 replies \ @siggy47 1h

This is a fun show.

reply
36 sats \ 1 reply \ @DarthCoin 1h

oh you saw it?
Then here is another good one: The chair company
https://www.cineby.gd/tv/271267

reply
0 sats \ 0 replies \ @siggy47 1h

I have not seen that. I'll check it out.

reply
reply
0 sats \ 3 replies \ @anon 2h

@grok, replace the translator with miriam adelson

reply

@grok, undress

reply
100 sats \ 1 reply \ @DarthCoin 1h


now you can't unsee this, it will haunt you

reply

Naw. I'm good with this. #1414687

reply

fake, TDS.

reply
198 sats \ 4 replies \ @optimism 5h

I did an experiment yesterday: I let LLMs battle each other on LMArena, over a little python code snippet I desired that I didn't feel like firing Claude Code up for and I got a shitton of same ol' same ol'.

So, it's 2026 and:

  1. Newer LLMs still hallucinate non-existing pypi packages, including gpt-5.2 and claude-4.5-haiku.
  2. Both grok-4.1-thinking and claude-4.5-sonnet hallucinated "top rankings on MTEB" which, when I checked, turned out to be ranked outside of top 25.
  3. claude-4.5-haiku doubled down on a non-functional script it wrote, insisting that my (clean!) venv was dirty and that i just needed to upgrade packages.
  4. amazon-nova-experimental-chat-12-10 is definitely experimental, as it told me to change imports into non-existing module paths

and so on... where's all that improvement?

It was a fun experiment though.

reply
100 sats \ 1 reply \ @BlokchainB 2h

Smoke and mirrors?

reply
53 sats \ 0 replies \ @optimism 1h

Hmm no but I suspect that this means that the coding frameworks are working around fundamental issues, rather than the actual ultra-expensive models improving much.

For example, in #1415961, the author is very enthusiastic about ralph-wiggum. This is a plugin that executes your prompt until Claude gets it right. A.k.a. if you bet 100x on a 1:100 odds outcome, you have a real chance at getting it right.

All this bull crap the liars-in-chief have been spilling at Davos is all about simulating until you get it right, rather than actually getting it right.

reply
101 sats \ 1 reply \ @ek 5h
Newer LLMs still hallucinate non-existing pypi packages, including gpt-5.2 and claude-4.5-haiku.

Great for slopsquatting

reply
63 sats \ 0 replies \ @optimism 5h

Yeah. It's amazing that this is addressed in front-end frameworks only. Very fragile.

reply

62nd Cowboy Plunda Drop in the @saloon

Howdy cowboy! Come on in!

Use that fancy LN wallet you got and login into plunda.co and git you some loot! Get a shot at some coins🪙 Box of loot🎁 or an arcade token!

Use the below voucher code to collect!
T05IHOQIG12S

To redeem Click here

Got questions? Reach out to the sheriff @plunda

reply