More

mountainriver · 2025-12-08T22:46:32 1765233992

I’ve found they are able to compose well, let it build small components and stitch them together

visarga · 2025-12-09T05:03:54 1765256634

That is a good approach, bottom up, manage complexity. But the general picture is - you set the direction and hold the model responsible, it does the actual work. Think of it as your work is the negative of the AI work, it writes the code, you ensure it tests that code. The better test harness you create, the better the AI works. The real task is to constrain the AI into a narrow channel of valid work.

mountainriver · 2025-12-08T22:44:46 1765233886

I hear this argument all the time but it seems to leave out code reviews

nsxwolf · 2025-12-08T23:16:28 1765235788

In teams of high performers who have built a lot of mutual trust, code reviews are mostly a formality and a stop gap against the big, obvious accidental blunders. "LGTM!"

I do not know or trust the agents that are putting out all this code, and the code review process is very different.

Watching the Copilot code review plugin complain about Agent code on top of it all has been quite an experience.

bagacrap · 2025-12-09T16:31:08 1765297868

like a vibe coder knows what a code review is, or an LLM knows how to take feedback

mountainriver · 2025-12-06T16:40:44 1765039244

> "Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model"

I believe NVidia’s ProRL showed otherwise right?

mountainriver · 2025-12-05T23:58:10 1764979090

I feel like vibe coding has completely changed this paradigm. Now all I care about is how fast and reliable the language is.

DANmode · 2025-12-06T03:09:21 1764990561

Probably throw in a dash of “is my model trained on this language sufficiently to perform this task well”.

mountainriver · 2025-12-04T03:29:04 1764818944

Relax

mountainriver · 2025-12-04T03:26:50 1764818810

IMO this is mostly just an ego thing. I often see staff+ engineers make up reasons why AI is bad, when really it’s just a prompting skill issue.

When something threatens a thing that gives you value, people tend to hate it

mountainriver · 2025-12-02T23:20:13 1764717613

Agree, Google put a ton of work into making TPUs usable with the ecosystem. Given Amazon’s track record I can’t imagine they would ever do that.

klysm · 2025-12-02T23:25:21 1764717921

There might be enough market pressure right now to make them think about it, but the stock price went up enough from just announcing it so whatever

vachina · 2025-12-03T15:25:12 1764775512

Amazon has no interest in making their platform interoperable.

mountainriver · 2025-11-27T02:17:05 1764209825

Didn’t we just see big pretraining gains from Google and likely Anthropic?

I like Dario’s view on this, we’ve seen this story before with deep learning. Then we progressively got better regularization, initialization, and activations.

I’m sure this will follow the same suit, the graph of improvement is still linear up and to the right

gardnr · 2025-11-27T02:22:26 1764210146

The gains were on benchmarks. Ilya describes why this is a red herring here: https://youtu.be/aR20FWCCjAs?t=286

Libidinalecon · 2025-11-27T03:01:46 1764212506

Gemini 3 is a huge jump. I can't imagine how anyone who uses the models all the time wouldn't feel this.

gardnr · 2025-11-29T08:44:15 1764405855

What does it do that Opus doesn't do?

mountainriver · 2025-11-27T03:52:48 1764215568

I like Ilya's points but its also clearly progress, and we can't just write it off because we like another narrative

mountainriver · 2025-11-27T02:09:14 1764209354

How do? Isn’t the bitter lesson about more search and compute? As opposed to clever algorithms

uoaei · 2025-11-27T04:08:58 1764216538

The subtext especially around shoving more effort into foreseeable dead-ends is the apt aspect here.

mountainriver · 2025-11-29T22:53:37 1764456817

We can't possibly know they are dead ends

mountainriver · 2025-11-26T15:43:11 1764171791

Take it with a grain of salt, this is one man’s opinion, even though he is a very smart man.

People have been screaming about an AI winter since 2010 and it never happened, it certainly won’t happen now that we are close to AGI which is a necessity for national defense.

I prefer Dario’s perspective here, which is that we’ve seen this story before in deep learning. We hit walls and then found ways around them with better activation functions, regularization and initialization.

This stuff is always a progression in which we hit roadblocks and find ways around them. The chart of improvement is still linearly up and to the right. Those gains are the cumulation of small improvements adding up.