More

tyre · 2025-12-28T19:19:14 1766949554

It would be cool if the bottoms of MacBooks weren’t flat and instead wavy or rippling to increase surface area. There are probably a lot of cool designs (ayyyyy) you could machine in.

Marsymars · 2025-12-28T19:36:47 1766950607

There are laptops like that (e.g. the fanless Lenovo Chromebook Plus 14). No idea if it was actually engineered like that for thermal reasons.

LoganDark · 2025-12-28T20:19:16 1766953156

Is there a value to increasing surface area on the top or bottom case of a MacBook? I'd imagine most of the thermal management is achieved by fan-directed airflow through the internal heatsinks and convection through the keyboard.

Lord-Jobo · 2025-12-29T04:02:06 1766980926

Well the MacBook Air has no fans so it’s a different beast from a design perspective. If I recall, at least with an earlier m series MacBook, notably improved performance could be gained by inserting a thermal pad between the chassis’ bottom panel and the compute module. Apple probably didn’t do this in an effort to avoid uncomfortably hot temperatures contacting people’s thighs.

LoganDark · 2025-12-29T10:12:28 1767003148

Oh, I saw that video too!: https://www.youtube.com/watch?v=NnllZYEhvDQ

It's really cool that performance cores are the same between base, Pro, Max (, Ultra) chips of a generation. That really feels like Apple did it right.

tyre · 2025-12-28T00:19:14 1766881154

Play Hades 2!

tyre · 2025-12-27T20:53:24 1766868804

This is always the case. Negotiate equity, but assume it’s worth zero. It’s not liquid and highly speculative. It’s a nice to have.

edit: which doesn’t mean join companies you don’t believe in! Please do. But don’t expect it to be there, don’t include it in life plans, don’t pay attention to valuations, etc.

nerdsniper · 2025-12-27T21:08:41 1766869721

If the owners try to say the equity is valuable, have them convert it to salary. Then it will be clear they don't believe their own words.

tyre · 2025-12-27T07:45:26 1766821526

Not really a perverse incentive. The government isn’t making any money here. They’re paying someone from their own pocket only to take it away again?

At that point it really is just slavery, which they can already do as protected in the US Constitution.

(I’m not arguing for this. I agree with restitution and believe that sentences longer than a certain point are also pointless and a net negative to society.)

devsda · 2025-12-27T08:25:41 1766823941

> The government isn’t making any money here.

Hypothetically let's say govt is allowed to use unpaid labour outside menial tasks and the prison system is setup in a way to efficiently utilize the skills of their labour pool and is allowed to outsource their skills to private entities at attractive rate for covering prison costs (i.e. more money left for govt spending)

E.g. tradesmen employed on their related jobs. A programmer employed in software jobs or a technician "loaned" to a nearby lab etc.

Don't you think the local/state governments will then have incentive to fill their pool with "missing" talent according to the job requirements.

tyre · 2025-12-27T03:51:14 1766807474

This is a pretty broad generalization!

The fastest iterating people engineers I’ve worked with often have a deep user focus rather than a language affiliation.

tyre · 2025-12-23T17:13:18 1766509998

ATMs do this as well. Always decline the bank doing the conversion.

tyre · 2025-12-23T17:09:04 1766509744

I remember when they were seeking approval to provide blow jobs on flights (free in business class iirc.) The only thing that they won’t up charge. They even tried to get approval to charge for bathroom access.

Wild company, but they are entirely on brand.

To be fair, consumers have driven airlines this way. They’ve shown that they’ll buy based almost entirely on price and suffer any amount of agony in exchange.

I just don’t find basic economy or early flights or shitty airlines worth the bad stress.

arethuza · 2025-12-23T17:16:41 1766510201

The advantage of Ryanair and a lot of the other low cost carriers is that they do a lot of point to point flights between regional hubs - for example we flew Edinburgh to Marrakesh with them a few years back which was fine and I think they were the only airline offering direct flights. Going via Heathrow, Gatwick or CDG would have been a nightmare and we were only going for a few days.

petesergeant · 2025-12-23T17:27:12 1766510832

I assumed you were making some poorly executed joke, but no!

https://www.smh.com.au/national/ryanair-ceo-talks-free-sex-o...

> He then asked the translator the German word for oral sex. After being told there wasn't one, he remarked "terrible sex life in Germany".

wkat4242 · 2025-12-24T04:46:59 1766551619

It's not true though, they call it 'blasen'.

chopin · 2025-12-23T21:18:11 1766524691

Which, of course, isn't true.

tyre · 2025-12-23T00:12:45 1766448765

Given the cost of the system, how long would it take to be less expensive than, for example, a $200/mo Claude Max subscription with Opus running?

mechagodzilla · 2025-12-23T00:23:13 1766449393

It's not really an apples-to-apples comparison - I enjoy playing around with LLMs, running different models, etc, and I place a relatively high premium on privacy. The computer itself was $2k about two years ago (and my employer reimbursed me for it), and 99% of my usage is for research questions which have relatively high output per input token. Using one for a coding assistant seems like it can run through a very high number of tokens with relatively few of them actually being used for anything. If I wanted a real-time coding assistant, I would probably be using something that fit in the 24GB of VRAM and would have very different cost/performance tradeoffs.

mark_l_watson · 2025-12-23T12:11:11 1766491871

For what it is worth, I do the same thing you do with local models: I have a few scripts that build prompts from my directions and the contents of one or more local source files. I start a local run and get some exercise, then return later for the results.

I own my computer, it is energy efficient Apple Silicon, and it is fun and feels good to do practical work in a local environment and be able to switch to commercial APIs for more capable models and much faster inference when I am in a hurry or need better models.

Off topic, but: I cringe when I see social media posts of people running many simultaneous agentic coding systems and spending a fortune in money and environmental energy costs. Maybe I just have ancient memories from using assembler language 50 years ago to get maximum value from hardware but I still believe in getting maximum utilization from hardware and wanting to be at least the ‘majority partner’ in AI agentic enhanced coding sessions: save tokens by thinking more on my own and being more precise in what I ask for.

Workaccount2 · 2025-12-23T02:58:07 1766458687

Never, local models are for hobby and (extreme) privacy concerns.

A less paranoid and much more economically efficient approach would be to just lease a server and run the models on that.

g947o · 2025-12-23T05:23:40 1766467420

This.

I spent quite some time on r/LocalLLaMA and yet need to see a convincing "success story" of productively using local models to replace GPT/Claude etc.

hasperdi · 2025-12-23T08:35:30 1766478930

I have several my own little success stories:

- For polishing Whisper speech to text output, so I can dictate things to my computer and get coherent sentences, or for shaping the dictation to specific format eg. "generate ffmpeg to convert mp4 video to flac with fade in and out, input file is myvideo.mp4 output is myaudio flac with pascal case" -> Whisper -> "generate ff mpeg to convert mp4 video to flak with fade in and out input file is my video mp4 output is my audio flak with pascal case" -> Local LLM -> "ffmpeg ..."

- Doing classification / selection type of work eg. classifying business leads based on the profile

Basically the win for local llm is that the running cost (in my case, second hand M1 Ultra) is so low, I can run large quantity of calls that don't need frontier models.

g947o · 2025-12-23T12:14:37 1766492077

My comment was not very clear. I specifically meant Claude Code/Codex like workflows where the agent generates/run code interactively with user feedback. My impression is that consumer grade hardware is still too slow for these things to work.

hasperdi · 2025-12-23T14:02:03 1766498523

You are right, consumer grade hardware is mostly too slow... although it's a relative thing right. For instance you can get Mac Studio Mx Ultra with 512GB RAM, run GLM-4.5-Air and have a bit of patience. It could work

FuckButtons · 2025-12-23T20:44:27 1766522667

I was able to run a batch job that lasted ~2 weeks of inference time on my m4 max by running it over night against a large dataset I wanted to mine. It cost me pennies in electricity and writing a simple python script as a scheduler.

dimava · 2025-12-23T17:23:20 1766510600

Tokens will cost same on Mac and on API because electricity is not free

And you can only generate like $20 of tokens a month

Cloud tokens made on TPU will always be cheaper and waaay faster then anything you can make at home

reissbaker · 2025-12-23T17:38:49 1766511529

This generally isn't true. Cloud vendors have to make back the cost of electricity and the cost of the GPUs. If you already bought the Mac for other purposes, also using it for LLM generation means your marginal cost is just the electricity.

Also, vendors need to make a profit! So tack a little extra on as well.

However, you're right that it will be much slower. Even just an 8xH100 can do 100+ tps for GLM-4.7 at FP8; no Mac can get anywhere close to that decode speed. And for long prompts (which are compute constrained) the difference will be even more stark.

foobar10000 · 2025-12-24T04:22:17 1766550137

A question on the 100+ tps - is this for short prompts? For large contexts that generate a chunk of tokens at context sizes at 120k+, I was seeing 30-50 - and that's with 95% KV cache hit rate. Am wondering if I'm simply doing something wrong here...

reissbaker · 2025-12-29T11:09:09 1767006549

Depends on how well the speculator predicts your prompts, assuming you're using speculative decoding — weird prompts are slower, but e.g. TypeScript code diffs should be very fast. For SGLang, you also want to use a larger chunked prefill size and larger max batch sizes for CUDA graphs than the defaults IME.

oceanplexian · 2025-12-23T11:20:10 1766488810

It doesn't matter if you spend $200, $20,000, or $200,000 a month on an Anthropic Subscription.

None of them will keep your data truly private and offline.

tyre · 2025-12-22T02:43:54 1766371434

I say this with love, compassion, grace, and all due respect: Who gives a shit?

markdown · 2025-12-22T06:09:47 1766383787

It's a website for web devs. Web devs give a shit.

ecshafer · 2025-12-22T15:16:35 1766416595

Ruby isn't necessarily for web devs. Ruby is popular for all sorts of business line applications. In Japan is popular for lower level programming. You can do game programming via something like Dragon Ruby. Sure its very popular for Rails, but you don't necessarily need to do web dev.

tyre · 2025-12-22T02:36:04 1766370964

Not sure if you're single, but go on some dates. Getting excited about another human being can be a huge boost. You don't need to replace work with other intellectualism (though you certainly can!)

nubg · 2025-12-22T03:31:11 1766374271

Thank you.