@anon
sign up
@anon
sign up
pull down to refresh
GDPval: Measuring the performance of our models on real-world tasks - OpenAI
openai.com/index/gdpval/
358 sats
\
3 comments
\
@Scoresby
6h
AI
related
GPT-5 Is More Expensive Than Claude — Here’s Why
385 sats
\
10 comments
\
@Tony
10 Aug
AI
The models are powerful as is. But where are the tools?
sharif.io/28-ideas-2025
173 sats
\
1 comment
\
@supratic
27 Sep
AI
Hacker Releases Jailbroken "Godmode" Version of ChatGPT
futurism.com/hackers-jailbroken-chatgpt-godmode
429 sats
\
8 comments
\
@ch0k1
30 May 2024
news
ChatGPT users outraged as GPT-5 replaces the models they love
arstechnica.com/ai/2025/08/chatgpt-users-outraged-as-gpt-5-replaces-the-models-they-love/
411 sats
\
5 comments
\
@jakoyoh629
8 Aug
AI
OpenAI announces new free model GPT-4o & app
venturebeat.com/ai/openai-announces-new-free-model-gpt-4o-and-chatgpt-for-desktop/
1619 sats
\
11 comments
\
@davidw
13 May 2024
AI
freebie
You can ask GPT-5 to pretend it is dumber than it is
487 sats
\
0 comments
\
@Tony
16 Aug
AI
Large Language Models Pass the Turing Test
arxiv.org/pdf/2503.23674
364 sats
\
11 comments
\
@south_korea_ln
15 Apr
AI
OpenAI o1 vs GPT 4o – Is it worth paying 6x more? - Bind AI
blog.getbind.co/2024/09/13/openai-o1-vs-gpt-4o-is-it-worth-paying-6x-more/
110 sats
\
0 comments
\
@ch0k1
15 Sep 2024
tech
Alibaba has released its flagship Qwen3-Max model with a trillion parameters
chat.qwen.ai/
167 sats
\
0 comments
\
@lunin
25 Sep
AI
Claude 3.5 Sonnet
www.anthropic.com/news/claude-3-5-sonnet
411 sats
\
0 comments
\
@k00b
21 Jun 2024
tech
"Benchwashing" - how do you defend against this?
1648 sats
\
10 comments
\
@optimism
9 Aug
AskSN
OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model
www.searchenginejournal.com/openai-secretly-funded-frontiermath-benchmarking-dataset/537760/
341 sats
\
0 comments
\
@frostdragon
21 Jan
tech
The week in AI, August 4-10, 2025
2323 sats
\
12 comments
\
@optimism
11 Aug
AI
Government Spending By Ideology
721 sats
\
6 comments
\
@antic
10 Aug
econ
OpenAI set to launch store as ChatGPT reaches 100mn users
883 sats
\
0 comments
\
@Bitman
7 Nov 2023
tech
LLM Agents can Autonomously Hack Websites
arxiv.org/pdf/2402.06664.pdf
464 sats
\
2 comments
\
@doofus
25 Feb 2024
security
GPT-5 Integration Doubles Autonomous Pentesting Performance on XBOW Platform
xbow.com/blog/gpt-5
154 sats
\
0 comments
\
@Tony
16 Aug
AI
Ilya Sutskever: AI will replace all human labor - YouTube
www.youtube.com/watch?v=zuZ2zaotrJs
257 sats
\
7 comments
\
@lunin
18 Sep
AI
LLM Rankings: programming | OpenRouter
openrouter.ai/rankings/programming
96 sats
\
0 comments
\
@m0wer
28 May
tech
AI does math: Multiplication using o1-mini vs GPT-4o
160 sats
\
5 comments
\
@zuspotirko
18 Sep 2024
tech
OpenAI launches real-time API and new voice model
193 sats
\
1 comment
\
@lunin
29 Aug
AI
more