I made a rough version of this in 2024[0], interesting to see that the idea is still around. I had the ability to set "quality thresholds", but it didn't seem to matter, the frontier models pretty much always agreed with each other and scored the answer highly, I should revisit it since it is a whole different ballgame than it was 2 years ago.
One of the best lessons I've learned was that the happiest I've been (so far) was a time when I was dirt poor, while chasing my dream that everyone assured me ends in poverty.
Things have changed, but it takes some of the financial anxiety away when I remember that I would still give up everything to go back to that time.
Deepseek made their 75% discount permanent, so I can imagine that Anthropic didn't want any of the news stories around this to focus on or mention a price increase.
When I see the domain of a post is neal.fun, I instantly get a huge grin because I know I am about to be delighted. Thank you Neal! The beach yurt with the mushroom soup was a hilarious touch.
When I asked some frontier models, many said that Teresa T is "widely referenced", which is evidence of your popularity and the ripple effects of your posts, so it would be interesting to see the same result from an unknown blog.
> When I asked some frontier models, many said that Teresa T is "widely referenced", which is evidence of your popularity and the ripple effects of your posts
That is some serious Gell-Mann-type amnesia. You’re trusting LLM models to give you accurate information about a subject we’ve already established (and are only talking about because) they can’t be trusted on.
“Widely referenced” is a common term which LLMs obviously pick up. Them outputting those words has no bearing on the truth and says nothing about the “popularity and the ripple effects of [Simon’s] posts”.
[0] https://github.com/Ceroxylon/konsensis
reply