More

Legend2440 · 2026-02-21T01:46:25 1771638385

This comic sounded like pure science fiction when I first saw it 15 years ago.

Now it's basically here, people really are getting attached to AI bf/gfs. It's only a matter of time until romance scams start using LLMs.

tiagod · 2026-02-21T20:36:30 1771706190

Scams have been using LLMs for a while. know of two people so far that got their card stolen and used to buy chatgpt subs.

Legend2440 · 2026-02-21T01:47:51 1771638471

I have also seen examples in the wild of 'helpful spambots' from https://xkcd.com/810/

They'd give LLM-generated answers on reddit and then include a mention of their product at the end.

Legend2440 · 2026-02-20T19:18:54 1771615134

My facebook feed is mostly low-effort reposted memes from tumblr/twitter/reddit, political ragebait, and screenshots of jokes from TV shows.

It's usually not AI (at least not obviously) but it's still slop.

Legend2440 · 2026-02-20T01:16:51 1771550211

Kind of? They can get you off the first page of Google, which is often enough to keep employers from seeing it.

c22 · 2026-02-20T01:29:41 1771550981

Unless they ask grok!

Legend2440 · 2026-02-19T18:44:07 1771526647

That said, random or exhaustive search is a more scientifically useful method than you might think.

The first commercial antibiotics (Sulfa drugs) were found by systemically testing thousands of random chemicals on infected mice. This was a major drug discovery method up until the 1970s or so, when they had covered most of the search space of biologically-active small molecules.

gus_massa · 2026-02-19T20:06:23 1771531583

A few month ago I went to a similar talk. They got a carboxylic acid from a plant (I forgot the name) that has some activity to kill caterpillar that eat corn, and made like 10 or 15 compounds with organic alcohols to get an ester. They tried different doses on the caterpillars and then make a computer model to predict the activity of similar compounds (QSAR). The idea is to use it in a long list of other organic alcohols and try to find a better compound.

But they choose chemical reactions that are usual in the lab, so they guess they will be able to make it work in the lab, and they keep most of the structure without changes. So it's closer to what they classify here as look nearby the known good points instead of a true random search.

StableAlkyne · 2026-02-19T21:12:49 1771535569

Related, I was talking to a computational chemist at a conference a few years ago. Their work was mostly at the intersection of ML and material science.

An interesting concept they mentioned was this idea of "injected serendipity" when they were screening for novel materials with a certain target performance. They proceed as normal, but 10% or so of the screened materials are randomly sampled from the chemical space.

They claimed this had led them to several interesting candidates across several problems.

Legend2440 · 2026-02-19T17:44:33 1771523073

Y'all are way too skeptical, no matter what cool thing AI does you'll make up an excuse for how they must somehow be cheating.

toraway · 2026-02-19T19:14:56 1771528496

Jeff Dean literally featured it in a tweet announcing the model. Personally it feels absurd to believe they've put absolutely no thought into optimizing this type of SVG output given the disproportionate amount of attention devoted to a specific test for 1 yr+.

I wouldn't really even call it "cheating" since it has improved models' ability to generate artistic SVG imagery more broadly but the days of this being an effective way to evaluate a model's "interdisciplinary" visual reasoning abilities have long since passed, IMO.

It's become yet another example in the ever growing list of benchmaxxed targets whose original purpose was defeated by teaching to the test.

https://x.com/jeffdean/status/2024525132266688757?s=46&t=ZjF...

arcatech · 2026-02-19T18:49:56 1771526996

Or maybe you’re too trusting of companies who have already proven to not be trustworthy?

Legend2440 · 2026-02-19T00:23:24 1771460604

...only if you deliberately attempt to extract it by repeatedly prompting it to complete fragments of the book. They had to do quite a bit of work to make this happen.

dom96 · 2026-02-19T00:42:17 1771461737

so? It demonstrates that LLM models retain the copyrighted material in their weights. This is an important thing to consider about LLMs and shows that there need to be better protections for the creative industry.

PeterStuer · 2026-02-19T05:18:55 1771478335

"there need to be better protections for the creative industry"

Why exactly?

fc417fc802 · 2026-02-19T01:21:25 1771464085

Really? I retain plenty of copyrighted material in my head. What matters is the contexts in which I reproduce it (if any).

A search index might also contain copyrighted material. As long as it's used for search queries as opposed to regurgitation there's no problem. Search indexes and LLMs are both clearly very beneficial tools to have access to.

themafia · 2026-02-19T02:05:09 1771466709

Reproduce it. Sit in a clean room and write it all out. Then go check your accuracy. I'm curious to see what it is.

fc417fc802 · 2026-02-19T02:09:39 1771466979

What does this (thought) experiment accomplish? That is, what point are you trying to make here?

Since we're talking about an electronic system the search index example is the more directly relevant one. Anyone who wants to object to LLMs is going to need to take care to ensure consistency with his views on Google's search index.

themafia · 2026-02-19T02:14:36 1771467276

I wasn't aware I could read 95% of Harry Potter through constructed queries using Google's search index. Can you demonstrate how I might do this?

Also can you point out how copyright law changes because we're using an "electronic system" as opposed to an "analog system?"

fc417fc802 · 2026-02-19T13:11:31 1771506691

You could do the equivalent if they would let you. They don't. That's the point I was getting at. How the thing is used is what actually matters, not that it has "absorbed" copyrighted material.

I never claimed any change in copyright law. Only that one analogy was more direct than the other for the purpose of the current discussion.

You didn't answer my question. What point were you trying to make with your earlier reply?

_DeadFred_ · 2026-02-19T02:20:31 1771467631

Are you a for profit product?

fc417fc802 · 2026-02-19T13:06:03 1771506363

Professional performers could certainly be viewed as such in this analogy. They memorize and then reproduce copyrighted material as a matter of course.

_DeadFred_ · 2026-02-19T19:44:46 1771530286

And when they do is when copyright protections might come into play. But not the basic learning of being a human being.

My playing copyrighted music on my synths at home, or singing lyrics along are different than if I am a professional musician benefiting financially from playing someone else's music in public.

Producing a product = market rules apply Just living as a human = totally different thing

fc417fc802 · 2026-02-20T01:43:07 1771551787

Yes, I agree. That was my entire point when I said: What matters is the contexts in which I reproduce it (if any).

The issue is not (or at least should not be) that LLMs are trained on material subject to copyright or can be very intentionally coaxed into regurgitating copyrighted material. The issue should be people building or using systems with the explicit intent of reproducing copyrighted material in an unauthorized manner.

_DeadFred_ · 2026-02-20T08:06:50 1771574810

If an LLM is a product, and it contains the work (in this case can spit out Harry Potter) it is derivative. Doesn't matter what it's used for.

dragonwriter · 2026-02-21T20:00:00 1771704000

> If an LLM is a product, and it contains the work (in this case can spit out Harry Potter) it is derivative. Doesn't matter what it's used for.

That's not the definition of a derivative work in copyright law; further, whether what legally qualifies as a derivative work is within the scope of the exclusive rights of the copyright holder is, in the US, subject to whether it is within one of the exceptions to exclusive rights in the law, notably the fair use exception, which very much does depend on, among other things, what it is used for.

fc417fc802 · 2026-02-20T13:47:56 1771595276

That's dogma on your part. Rather than practical outcome you're opting for human exceptionalism. I can't accept that.

Merely containing a work doesn't make something derivative. A photograph could inadvertently capture a copyrighted image in the background but so long as it isn't the primary focus I think your line of reasoning there fails.

_DeadFred_ · 2026-02-21T19:54:04 1771703644

TIL that the law is dogma.

I'm opting for the law differentiating between a product and a person.

'We trained our model on Harry Potter and somehow Harry Potter got into our model' is a ridiculous defense.

fc417fc802 · 2026-02-21T23:52:53 1771717973

It is your view that's dogmatic. The law in this area has yet to be fully tested in court, let alone any prospective changes that might be made to it in the near future.

Regardless, I thought this was a discussion about what the law ought to say.

The defense is that the model is not designed to output Harry Potter verbatim, and in fact will not unless you jump through lots of hoops. Image generation would probably provide you with a stronger position here since those setups can easily output likenesses without needing to carefully engineer the prompt to cause them to do so. But even then it is clearly not the intention of the people training or deploying them that they be used that way.

Legend2440 · 2026-02-18T23:39:03 1771457943

But then why haven't the sizes been updated? We've been gaining weight slowly over decades, they've had time.

munificent · 2026-02-19T00:23:30 1771460610

We're all still collectively telling ourselves we'll get back in shape any day now.

Legend2440 · 2026-02-18T23:32:05 1771457525

Mostly though it's the obesity increase.

40% of Americans are obese, and 75% are overweight. 30 years ago only 20% were obese.

lloeki · 2026-02-19T09:50:39 1771494639

This is what I was beginning to think around the "nobody's actually hourglass" section.

I thought it would be worth looking at what the definitions are:

https://www.ergo-eg.com/uploads/books/devarajan_full_106_04%...

> Hourglass. A subject would fall into this shape category when there is a very small difference in the comparison of the circumferences of her bust and hips AND if the ratios of her bust-to-waist and hips-to-waist are about equal and significant (Simmons, 2002)

> Rectangle. A rectangular subject would have her bust and hip measure fairly equal AND her bust-to-waist and hip-to-waist ratios low. She would not possess a clearly discernible waistline (Simmons, 2002)

Over here (E.U) I'd say most women definitely would be "hourglass shaped" in some way more than any other shape - maybe some would be a tie with "rectangle" but I'm breaking the tie by saying it's fair to say hourglass does not mean wasp-waist either - so I couldn't reconcile my anecdotal observation from the stated facts until it dawned on me that this was U.S stats.

> One 2007 study found that half of women (49%) in the U.S. were considered rectangle-shaped. Only 12% of women had a true hourglass figure.

OK let's dig data:

https://www.cdc.gov/nchs/data/hestat/obesity_adult_07_08/obe...

> Results from the 2007–2008 National Health and Nutrition Examination Survey (NHANES), using measured heights and weights, indicate that an estimated 34.2% of U.S. adults aged 20 years and over are overweight, 33.8% are obese, and 5.7% are extremely obese.

And apparently it's worse for women (35.5% obese) than men (32% obese).

Anyway I'm not sure what "true hourglass" is supposed to even mean (wasp-waist?); according to the definition you got some waistline + balanced hip and shoulders => you're hourglass. If you start using "rectangle" as a fallback when in doubt then of course it's going to rate higher.

Funnily enough the very study linked is a comparison with another country (Korea):

https://www.emerald.com/ijcst/article-abstract/19/5/374/1249...

Legend2440 · 2026-02-18T20:25:54 1771446354

This is just the price of being on the bleeding edge.

Unfortunately, prompt injection does strongly limit what you can safely use LLMs for. But people are willing to accept the limitations because they do a lot of really awesome things that can't be done any other way.

They will figure out a solution to prompt injection eventually, probably by training LLMs in a way that separates instructions and data.

Legend2440 · 2026-02-16T00:48:12 1771202892

I've said it before and I'll say it again, all AI discussion feels like a waste of effort.

“yes it will”, “no it won’t” - nobody really knows, it's just a bunch of extremely opinionated people rehashing the same tired arguments across 800 comments per thread.

There’s no point in talking about it anymore, just wait to see how it all turns out.

barfiure · 2026-02-16T01:29:09 1771205349

Nope. Not good enough. Your approach won’t drive engagement. We need the same tired arguments across 1600 comments per thread.