More

guccihat · 2025-11-18T10:11:42 1763460702

Link should be updated to the correct article:

https://www.bbc.com/news/articles/cwy7vrd8k4eo

guccihat · 2025-09-15T13:34:19 1757943259

The technical proposal behind this legislation is to enforce on-device AI analysis of all chat communication so your device can notify authorities.

This mass surveillance proposal is so dystopian and broken, I’m genuinely ashamed to be an EU citizen.

guccihat · 2025-08-20T09:48:02 1755683282

Cool demo. A minor nitpick is that the code (and the article) forgets to handle the special case of a point inside the cube that happens to be exactly (0,0,0). This will result in a divide by zero when the vector is normalized.

NoahZuniga · 2025-08-20T11:29:37 1755689377

The chance of this happening is less than 1 in 2^128. This will never happen.

pavel_lishin · 2025-08-20T19:01:14 1755716474

Unless you're demoing it to someone very important, in which case it'll happen twice in a row.

rkomorn · 2025-08-23T21:47:40 1755985660

Especially if you have already had the conversation with anyone and confidently stated that, yes, the possibility exists but it's so remote that it's just not worth addressing.

cluckindan · 2025-08-23T21:44:12 1755985452

With long enough timescales, every event with a non-zero probability will eventually happen.

pavel_lishin · 2025-08-20T19:01:40 1755716500

That nitpick is both minor, and absolutely correct!

guccihat · 2025-04-06T11:06:56 1743937616

The article concludes that the overall translation score of Llama 4 is below that of Llama 3.3. However, the included table shows that Llama 4 scores better on all subcategories included in the test - coherence, idiomaticity and accuracy.

Something does not add up. The conclusion just states "...downgrade from LLama 3.3 in every respect" without further explanation.

smallerize · 2025-04-06T11:25:50 1743938750

Looking at the individual language pages, it does come behind pretty often. And in Japanese for example, it has higher scores but also a much higher refusal rate. The summary page doesn't show a refusal rate column, so not all the data is represented there.

Tomte · 2025-04-06T11:21:16 1743938476

Could be Simpson's Paradox: https://en.wikipedia.org/wiki/Simpson%27s_paradox

guccihat · 2025-03-16T08:14:41 1742112881

> This title is misleading though - where it tries to state that DeepSeek only focuses on research and not revenue.

IMO, the title is quite accurate and supported by the content of the article. The team is prioritizing AI research at the expense of short term profit.

If the title was "DeepSeek is solely a research effort" you would have a case.

yubblegum · 2025-03-16T08:58:24 1742115504

Agreed, per Liang Wenfeng's public comments, his aim was and remains to inspire his fellow Chinese geeks to lead in innovation and not be satisfied at simply following the lead, and creating better versions, of foreign (read Western) mind products.

jstummbillig · 2025-03-16T08:50:10 1742115010

Lest we forget, this is an option because others did figure out the expensive way how most of the things DeepSeek relies upon work. You might even call it research.

patrickhogan1 · 2025-03-16T08:54:54 1742115294

Yes, but the real question is: Why are they able to do that? I understand we're debating nuances here, but my concern is about the overall impression the title gives. It positions DeepSeek as some kind of higher ideal, yet the article achieves this impression by deliberately overlooking key facts.

For example, why can Google afford to run Waymo, a self-driving car company? Is it because Google prioritizes self-driving cars and safety over profit?

No. It's because Google's core business—selling advertisements, monetizing personal data, and essentially profiting from surveillance—generates enormous amounts of money.

With all of this said. I am a fan of DeepSeek and the amount of openness they have.

rangestransform · 2025-03-16T13:16:49 1742131009

I really miss the ZIRP days when every company had some robotics R&D bototmless money pit

rfoo · 2025-03-16T09:22:01 1742116921

If anything, it sounds more like an "EA working as intended" story.

guccihat · 2025-03-14T10:41:28 1741948888

TL;DR - The Bureau of Labor Statistics uses different employment categories for programmers and software developers. While the number of programmers has seen a steady decline since 2000, the number of software developers has seen a steady increase.

guccihat · 2025-03-07T21:10:37 1741381837

Currently, it feels like many of the frontier models have reached approximately the same level of 'intelligence' and capability. No one is leaps ahead of the rest. Microsoft probably figured this is a good time to reconsider their AI strategy.

_giorgio_ · 2025-03-08T06:34:50 1741415690

Clearly you don't use models so much.

Even in the openAI ecosystem there are models that, while similar in theory, produce very different results, so much that some murderous are unusable. So even small differences translate to enormous differences.

guccihat · 2025-03-08T09:16:50 1741425410

I use AI everyday for work, mostly models from OpenAI, Anthropic and DeepSeek. In my experience none of them completely dominate the others. You seem to disagree strongly but then just state your argument, which model or company do you think is the clear leader currently and why?

The AI race is super close and interesting at the moment in my opinion.

_giorgio_ · 2025-03-09T16:11:09 1741536669

guccihat · 2025-02-25T10:23:37 1740479017

> The Exercism problems have proven to be very effective at measuring an LLM's ability to modify existing code

The Aider Polyglot website also states that the benchmark " ...asks the LLM to edit source files to complete 225 coding exercises".

However, when looking at the actual tests [0], it is not about editing code bases, it's rather just solving simple programming exercies? What am I missing?

[0] https://github.com/Aider-AI/polyglot-benchmark

guccihat · 2025-02-21T19:12:44 1740165164

It is "just" the domestic intelligence agency ordering Apple to backdoor their own system be able to supply data for lawful interception. As I read the article, it's not a UK backdoor in the sense they can roam around in every users data. The domestic agencies still need to follow the rules of lawful interception, namely they need a warrant, and it is targeted at UK nationals only. At least that is how I read the article.

guccihat · 2025-02-11T17:29:09 1739294949

When the AI dust settles, I wonder who will be left standing among the groups of developers, testers, scrum masters, project leaders, department managers, compliance officers, and all the other roles in IT.

It seems the general sentiment is that developers are in danger of being replaced entirely. I may be biased, but it seems not to be the most likely outcome in the long term. I can't imagine how such companies will be competitive against developers who replace their boss with an AI.

__MatrixMan__ · 2025-02-11T20:52:48 1739307168

> I can't imagine how such companies will be competitive against developers who replace their boss with an AI.

Me neither, but I think it'll be a gratifying fight to watch.

phist_mcgee · 2025-02-11T22:13:03 1739311983

Please take the scrum masters first.