More

serf · 2026-04-29T01:30:52 1777426252

>I am still baffled by the fact that we have collectively agreed to use agentic harnesses by the same companies that are selling access to their APIs.

the best performing and capable ones are all the ones that aren't tied to a specific api.

serf · 2026-04-28T17:35:43 1777397743

me too. used codex to convert a bunch of riggings between lots of models via blender api.

it felt weak at it , like the corpus wasn't strong with blender/python work to look through , but it got going at it fairly fast with some coaxing.

embedding-shape · 2026-04-28T18:08:36 1777399716

What model you using? With codex and gpt-5.4 set to xhigh (and now gpt-5.5) seems to have zero issues helping me with rigging and fixing glb/fbx models, works as a charm. One time I instructed it to iterate together with screenshots because it was a gnarly task, but usually it figures out everything even when headless.

serf · 2026-04-28T17:14:57 1777396497

isn't this just sort of turning chicken-or-egg?

if an AI uses static analyzers to do work ,is it the tool or the ai ?

if AI is using grep to do the work, is it the AI or grep?

I mean essentially all agent work boils down to "cat or grep?"

serf · 2026-04-28T07:45:41 1777362341

36ish hours a charge w/ HR stuff enabled.

hobo_mark · 2026-04-28T07:51:11 1777362671

Yet, my 10+ years old pebble 2 HR lasts a week, and the new pebble time 2 claims up to a month.

swiftcoder · 2026-04-28T08:07:11 1777363631

Yeah, 36 hours is honestly pretty disappointing. The old Withings ScanWatch easily ran >2 weeks with HR and notifications enabled, I'd have expected similar performance from the Casio.

serf · 2026-04-27T01:56:11 1777254971

in my car circles the 968 was seen as a total pos that was really just sort of trying to compete with the RX-7 and Fairlady, do a worse job at being a good sports car than them, and push the brand into further cheapened territory towards the every-person for the sake of financial incentive while inflating the cost of their premium offering, the 911.

1:1 example, but i'm not sure those were the points being made here.

majormajor · 2026-04-27T02:23:43 1777256623

The 968 is such a weird choice for this when the Boxster exists, did basically everything better, was a major commercial success, and has spawned a line of cars that many argue are better than the 911 except for the name and traditionalist-fandom over exact engine position that prevents Porsche from giving them all the biggest engines and fanciest tech.

But the Boxster didn't try to replace the 911 on day one. Or even go after the other 300ZX/Supra/whatever 2+2s on day one. It was instead nearly a whole-cloth "what if pure 2-seater convertible driver's car, but the best possible version" upscale-Miata initially, which wasn't an existing segment at all, and being roadster-first was a key separator from the also-2-seater Corvette.

(The iPhone or iPad were arguable Apple's Boxster "entry-level that ends up dominating sales and growing into full blown new product lines" anyway, except that the comparison eventually falls down because the form factor difference with the Mac is much more of a fundamental separation. So maybe Apple's Boxster is instead the laptop in the first place, which wiped out most of their desktop workstation business by the early-2010s at latest.)

71bw · 2026-04-27T07:36:01 1777275361

I presume the 968 was chosen because it all seems like the Neo is only the first hurrah into this whole entry-level field for Apple.

keyle · 2026-04-27T04:46:05 1777265165

Yeah this is looking at the 968 with rose tinted glasses. But a lot of the comparison does check out and the Neo is a fine on-ramp for first time macOS users just like the 968.

alamortsubite · 2026-04-27T15:30:13 1777303813

Porsche killed the 944S Turbo because it was accidentally faster than the Carrera and 930, and that was taboo. Its successor, the 968, was the awkward compromise.

serf · 2026-04-24T23:02:32 1777071752

>Such as?

it's crazy that the experiences are still so wildly varying that we get people that use this strategy as a 'valid' gotcha.

AI works for the vast majority of nowhere-near-the-edge CS work -- you know, all the stuff the majority of people have to do every day.

I don't touch any kind of SQL manually anymore. I don't touch iptables or UFW. I don't touch polkit, dbus, or any other human-hostile IPC anymore. I don't write cron jobs, or system unit files. I query for documentation rather than slogging through a stupid web wiki or equivalent. a decent LLM model does it all with fairly easy 5-10 word prompts.

ever do real work with a mic and speech-to-text? It's 50x'd by LLM support. Gone are the days of saying "H T T P COLON FORWARD SLASH FORWARD SLASH W W W".

this isn't some untested frontier land anymore. People that embrace it find it really empowering except on the edges, and even those state-of-the-art edge people are using it to do the crap work.

This whole "Yeah, well let me see the proof!" ostrich-head-in-the-sand thing works about as long as it takes for everyone to make you eat their dust.

hattmall · 2026-04-25T04:20:38 1777090838

People ask for examples because they want to know what other people are doing. Everything you mention here is VERY reasonable. It's exactly the kind of stuff no one is going to be surprised that you are getting good results with the current AI. But none of that is particularly groundbreaking.

I'm not trying to marginalize your or anyone else's usage of AI. The reason people are saying "such as" is to gauge where the value lies. The US GDP is around 30T. Right now there's is something like ~12T reasonably involved in the current AI economy. That's massive company valuations, data center and infrastructure build out a lot of it is underpinning and heavily influencing traditional sectors of the economy that have a real risk of being going down the wrong path.

So the question isn't what can AI do, it can do a lot, even very cheap models can handle most of what you have listed. The real question is what can the cutting edge state of the art models do so much better that is productively value added to justify such a massive economic presence.

leptons · 2026-04-24T23:47:22 1777074442

That's all well and good, but what happens when the price to run these AIs goes up 10x or even 100x.

It's the same model as Uber, and I can't afford Uber most of the time anymore. It's become cost prohibitive just to take a short ride, but it used to cost like $7.

It's all fun and games until someone has to pay the bill, and these companies are losing many billions of dollars with no end in sight for the losses.

I doubt the tech and costs for the tech will improve fast enough to stop the flood of money going out, and I doubt people are going to want to pay what it really costs. That $200/month plan might not look so good when it's $2000/month, or more.

nvader · 2026-04-25T00:39:40 1777077580

Why not try it yourself? Inference providers like BaseTen and AWS Bedrock have perfectly capable open source models as well as some licensed closed source models they host.

You can use "API-style" pricing on these providers which is more transparent to costs. It's very likely to end up more than 200 a month, but the question is, are you going to see more than that in value?

For me, the answer is yes.

leptons · 2026-04-25T02:28:22 1777084102

What makes you think I haven't tried it myself?

The "costs" are subsidized, it's a loss-leader.

jeremyjh · 2026-04-25T22:50:05 1777157405

Bedrock and other third party open weight hosted model costs are not subsidized. What could possibly be the investment strategy for being one of twelve fly-by-night openrouter operators hosting the latest Qwen?

Jach · 2026-04-25T00:20:54 1777076454

It's an important concern for those footing the bill, but I expect companies really in the face of being impacted by it to be able to do a cost-benefit calculation and use a mix of models. For the sorts of things GP described (iptables whatever, recalling how to scan open ports on the network, the sorts of things you usually could answer for yourself with 10-600 seconds in a manpage / help text / google search / stack overflow thread), local/open-weight models are already good enough and fast enough on a lot of commodity hardware to suffice. Whereas now companies might say just offload such queries to the frontier $200/mo plan because why not, tokens are plentiful and it's already being paid for, if in the future it goes to $2000/mo with more limited tokens, you might save them for the actual important or latency-sensitive work and use lower-cost local models for simpler stuff. That lower-cost might involve a $2000 GPU to be really usable, but it pays for itself shortly by comparison. To use your Uber analogy, people might have used it to get to downtown and the airport, but now it's way more expensive, so they'll take a bus or walk or drive downtown instead -- but the airport trip, even though it's more expensive than it used to be, is still attractive in the face of competing alternatives like taxis/long term parking.

Peritract · 2026-04-25T00:02:27 1777075347

None of that is concrete though; it's all alleged speed-ups with no discernable (though a lot of claimed) impact.

> This whole "Yeah, well let me see the proof!" ostrich-head-in-the-sand thing works about as long as it takes for everyone to make you eat their dust.

People will stop asking for the proof when the dust-eating commences.

serf · 2026-04-24T18:35:32 1777055732

same story her with different context.

my father has cardiac issues, serious ones. When a doctor asks what he wants to do he routinely says "Sail around the world, solo!" because that's about the stupidest most risky thing a person with a bad heart could consider.

So now every single doctor reads the transcript and starts with saying "I think it'd be really poorly advised for you to keep considering your worldwide solo voyage."

AI summarization doesn't carry the tone well. Most any but the most serious humans would catch the way he's saying it as a joke.

serf · 2026-04-24T06:48:55 1777013335

it's a weird period we're in where I can read a line like "A Eulogy for .." and just presuppose it'll be about AI and I'm right 90% of the time.

personally I think that phenomena says more about the messaging of the protestors -- on either side, really -- than the true state of things.

now having read it :

1) I don't feel disgusted at novice vibe-coded games. I think it's fun to see people finally have the barrier to entry of programmable computing lifted in front of my eyes.

2) the whole thing reads like gatekeeping. I don't care about your intimate relationship with your extra limb. I'm good at using a hammer, but I welcome improvements. I don't think the next hammer is going to somehow diminish the sanctity of the hammerers union and their divine holy act.

3) Have fun with all the zillions of anti - AI forks. I wish the best , but I have been around the block to know where the future is headed. It's going to be increasingly hard to keep up with the features and bug squashing of AI empowered groups, and crowds are going to move to where the experience is better and the corporations point, not where the ethics lie.

This is one of those losing anti-coporation RMS style fights that I hope the best for, but only a fool would be unable to read the writing on the walls.

serf · 2026-04-23T21:43:00 1776980580

an esp32 on an 1100mah battery will last years on deep sleep, and about a day with wifi on and in high power modes.

a pixel watch 4 says they last 30 hours , ambiguously. they use a battery less than half the size. in reality with constant use they'll drop dead in 6 hours.

the thing is clunky and heavy , anyway -- so if it lasts as long as an off the shelf watch who cares?

also, the primary reason : lilygo shoves ESPs into everything.

explodes · 2026-04-24T00:23:45 1776990225

Several Garmin watches last for weeks (24 days full charge, actual 1-2 weeks with heavy gps and fitness tracking), and I struggle to understand why consumers accept anything less. It seems like consumers don't realize what's available.

herbst · 2026-04-24T07:04:30 1777014270

This. Why would I accept anything else than that?

NoboruWataya · 2026-04-24T09:29:40 1777022980

I think cost is one factor. I have a Vivoactive 4 and I love it but it has a reported battery life of 8 days and I get maybe half that with regular run tracking. I'm guessing the 24 days/1-2 weeks is for a considerably more expensive model.

MrMember · 2026-04-27T14:31:29 1777300289

The Garmin Instinct 2 is around $200 and will last several weeks on a charge even using the GPS regularly. Without GPS use I wouldn't be surprised if it lasted a month.

saghm · 2026-04-24T05:32:28 1777008748

I'm not sure if my Pixel Watch 3 is much more efficient than your 4 or if by "constant use" you literally mean scrolling through it actively for hours at a time, but I only charge mine maybe twice a week. It's on at all times, connected to my phone via Bluetooth and to my Wi-Fi network when I'm home, and I actively manage any push notifications I get from it, but otherwise it seems to idle fairly efficiently.

shadowpho · 2026-04-23T23:32:04 1776987124

I’ll need to check my notes on power consumption. I’ve spent quite a bit of time trying out different modes and configurations… it’s not great. I would not expect years at all. You gotta be very careful about what has to stay on and off.

Furthermore, bugs. To this time there’s random crashes that happen with sleep which limits their use

sho_hn · 2026-04-24T00:02:04 1776988924

I have some relevant experience. I built this a few years ago:

https://imgur.com/a/diy-automatic-e-ink-newspaper-using-rust...

After careful optimization, the v1 got about 6 months out of a 1100 mAh battery. Later improvements and bumping to a 3300 mAh battery got me to 14 months, before my kid yanked it off the wall, total'd the panel and I rebuilt it. The test continues.

That said--op isn't wrong. If power usage is the metric you optimize for, there's much better BOM than an esp32.

brewtide · 2026-04-24T02:09:59 1776996599

Galaxy watch 7 (I think) weather here. Also about 30 hours. It's a charge every day thing, but allowing for some forgetfulness. It's not ideal, but it's manageable and certainly functional.

serf · 2026-04-22T03:26:02 1776828362

counter-point : the companies that are most famous for A/B testing routinely are also the ones with the most notoriously non-existent customer service departments globally, facebook/google/amazon/ebay. Groups that harbor dissatisfied customers by essentially being 'the only show in town.'.

so, what i'm saying is : I think a lot of companies align themselves with the cash first and then measure whether or not the negative image/user impact is manageable .

(in fact I know they operate this way.)