My Biggest Issue with AI in One Picture \ stacker news ~devs

pull down to refresh

423 sats \ 16 comments \ @ek 21 Apr devs

It's stuff like this that makes me scratch my head with all the talk of people saying they vibe coded this and vibe coded that.

How the heck do you vibe code a whole app knowing that the LLM will either hallucinate at some point or trust you too much and follow an erroneous request?

21 sats \ 0 replies \ @WeAreAllSatoshi 21 Apr

It’s probably fine for simple things. I wouldn’t consider this use case simple, despite it being clear in documentation

51 sats \ 4 replies \ @WeAreAllSatoshi 21 Apr

Insecure about itself, just like juniors

233 sats \ 3 replies \ @ek OP 21 Apr

at least somewhat decent juniors would look up the documentation before giving you an elaborate, wrong answer, lol

64 sats \ 2 replies \ @WeAreAllSatoshi 21 Apr

It’s just going from memory lol

0 sats \ 1 reply \ @ek OP 21 Apr

But why does the memory not contain the Postgres documentation 🤔

0 sats \ 0 replies \ @WeAreAllSatoshi 21 Apr

🤷‍♀️

21 sats \ 0 replies \ @NovaRift 21 Apr

It's not confident enough. Ai needs some therapy.

21 sats \ 4 replies \ @joseph_at_nostr_fan 21 Apr

What LLM are you using in this chat exchange?

0 sats \ 3 replies \ @ek OP 21 Apr

I selected claude-3.7-sonnet in Cursor but I don't really know what to select there

217 sats \ 0 replies \ @freetx 21 Apr

I'm not a developer, but I hack on various things.

But my take on the various models avail on Cursor are:

Claude-3.5 is the most laser focused: If you ask it "help me write this function to do ABC" - it will write a function to do ABC.
Claude-3.7 if you ask it "help me write this function to do ABC" - it will write the function to do ABC, and then also cleanup some old stale comments, improve functionX to work the same way that ABC works, and edit a totally different file that needed ABC function to use that.
Gemini-2.5 A happy medium between the two. Tends to write very good code (I assume they trained it on google corps whole code corpus). Seems best at understanding a complete codebase...

I sometimes switch between them based on my need....like if I really just want to optimize a function I may choose Claude-3.5 because the "laser focus" keeps it from doing too many other changes.

67 sats \ 1 reply \ @joseph_at_nostr_fan 21 Apr

Claud is the best... but yeah this sucks anyway.

The best advice I can give you is to have a preamble, meaning a set of instructions before every prompt, the preamble should insist you value accuracy above all, and want reliable responses every time. After the preamble you write your prompts.

0 sats \ 0 replies \ @ek OP 21 Apr

I don't know if that's going to help.

Apparently, it did not know about the Postgres documentation since it clearly states that a @> b means that a is the ancestor of b:

I already knew that when I asked it the first question, but I wanted to see what it will say given a very neutral question and it failed miserably with a lot of words.

My trust in it is broken if it ever existed to begin with 👀

100 sats \ 0 replies \ @scuffed 22 Apr

All my homies hate AI

0 sats \ 0 replies \ @ErroneousMind 18 May

I have a fear that AI + "Vibecoding" will destroy opensource

Vulnerabilities and performance issues will accumulate so badly that governments will only allow BigTech to produce software because "safety" and "energy efficiency"

0 sats \ 0 replies \ @Lazy_AMA 22 Apr

That's your issue, not mine 😏