But we don’t know how much larger the models will have to be, how large the data...

bastawhiz · on July 19, 2024

> They could have to be inconceivably large.

The opposite might also be true. Just having better, well curated data goes a long way. LAION worked for a long time because it's huge, but what if all the garbage images were filtered out and the annotations were better?

The early generations of image and video models used middling data because it was the only data. Since then, literally everyone with data has been working their butts off to get it cleaned up to make the next generation better.

Better data, more intricate models, and improvements to the underlying infrastructure could mean these sorts of "improvements" come mostly "for free".

wruza · on July 20, 2024

ADetailer does exactly that. Feels like this large thread above is non-practicing for the most part.

There’s no eyes module in it by default, but it’s trivial-ish to add, and a hires eyes dataset isn’t hard to collect either.

Just found eyes model on https://civitai.com/models/150925/eyes-detection-adetailer (seems anime only)

rocqua · on July 19, 2024

I feel like a GAN method might work better, building a detector, and training the model to defeat the detector.