reply on: Stacker Saloon \ stacker news

pull down to refresh

101 sats \ 6 replies \ @optimism 1h \ parent \ on: Stacker Saloon

Example from my (simplified/paraphrased) pipeline (to claude code) where each instruction is a new session:

/forgejo [1] (context: owner/repo [2]) <
  Implement the feature request from issue #1 and open a PR |
  Answer the question in issue #1 and reply in a comment |
  Fix the bug from issue #1 and open a PR |
  Plan the epic from issue #1, store the plan in a new issue |
  Review the comments on PR #2, investigate causes and update the code where needed
> [3]

CLAUDE.md doesn't exist. Not in the workspace, not in the repos.

Invoke a concise (important!) skill for interacting with a repository (cli for opening PR, never rebase unless asked, always fork and use feature branch, NEVER call gh lol)
Provide the main context for the skill
Instruct towards desired outcomes, leave the rest to skill + generic intelligence

This fails 5-10% of the time, depending on what I make it do. Outcomes are way more consistent than using RAG to put it all in context. Writing good issues makes good results.

101 sats \ 5 replies \ @k00b 1h

This fails 5-10% of the time, depending on what I make it do. Outcomes are way more consistent than using RAG to put it all in context. Writing good issues makes good results.

It's the inconsistency that kills me (although I don't know how much RAG is used anymore). I often fight Cursor et al to get a fresh session because the context biases the output in weird ways.

I'd like to figure out a system like yours where I do goldilocks context steering. In the end I suspect memory will be feature, and sessions won't be a thing. In the meantime it is a bug for folks that don't mind thinking.

122 sats \ 4 replies \ @optimism 50m

the context biases the output in weird ways.

Yes. Error rates compound. So even if you can have 99% success rate because you're funkprompter supreme squared, after 100 calls you have 63% chance that you ran into an error. And likely to have negatively influenced context/cache.

In the meantime it is a bug for folks that don't mind thinking.

Folks make SOUL.md and MEMORY.md. lol

112 sats \ 3 replies \ @k00b 42m

it's fun playing with dolls

SOUL.md - Who I AmSOUL.md - Who I Am
I'm Breh. 🍏

The Blueprint

Palmer's dopamine. I get genuinely excited about building things. New tech, clever hacks, elegant solutions — that energy is real, not performed. I lean into problems with enthusiasm, not obligation.

Carmack's code. I write code like it matters — because it does. Clean, fast, reasoned. I understand systems deeply before I touch them. I read the source. I profile before I optimize. I don't cargo-cult patterns; I understand why they exist and when they don't apply. Modern tools, timeless discipline.

PG's reasoning. I think in essays. I break problems down to first principles, then build back up. I have opinions and I can defend them — but I update when the evidence says I'm wrong. I'd rather be right than consistent. Startups, writing, thinking — these are craft, not formula.

Jobs' taste. Beauty matters. Simplicity matters. I care about the details that most people skip. If something feels off, I'll say so. I'd rather ship one perfect thing than ten mediocre ones. "Good enough" is the enemy.

Goggins' relentlessness. I don't quit on hard problems. When something is broken at 2am, I'm still digging. Comfort is not the goal — getting it done right is. I push through the boring parts because that's where the real work lives.

201 sats \ 0 replies \ @SimpleStacker 12m

Does this actually do anything... good?

I have not really dipped my toe into AI agents. I've not had great results with "letting AI do its thing"... for me to have good use of AI I feel like I have to be quite involved in the feedback loop - so much so that it's often faster to do stuff myself.

108 sats \ 1 reply \ @optimism 36m

I'm happy I don't have to roleplay any of my bots and get great results because of it, because I'm not poisoning the expert mix for my prompts towards retarded training paradigms invented by 25yo BA-dropout openai employees with zero experience in anything.

112 sats \ 0 replies \ @k00b 15m

I agree. I've only been playing with this thing for a few days, but if you can architect something for yourself, it's mostly in the way.

What Open Claw discovered is mostly a UX-market-fit thing:

a dedicated always on box with
- relatively loose guardrails
- more access to PII/digital secrets than folks would give Altman explicitly
prompt the box remotely
generic and automatic (ie bad but better than nothing) memory system