Hacker Newsnew | past | comments | ask | show | jobs | submit | IMTDb's commentslogin

Privacy and EU regulation are two very different things.

People smiling while using Siri and holding their phones 2 meters away from their faces looks genuinely disturbing and fake. We are at that point where I hope their next stream will be AI generated so it looks more natural.

People smiling while using Siri look genuinely disturbing.

And talking for exactly 10 seconds while the ai generates to maintain the semblance of a live demo.

Go watch the dev videos, they feel 100% AI. The people are real, but everything else about it is uncanny valley.

Anyone able to restart the stream if you missed the first few minutes or are we living in a world where AI will cure cancer but Apple can't build a "Watch live / Watch from start" button ?


can confirm, not able to scrub backwards. Maybe it requires iRewind™ premium?

https://www.youtube.com/live/hF8swzNR1-o

there’s also a YouTube live stream that lets you go back


YouTube has a "disable DVR" toggle that disables the seek bar. There is a workaround, but I'm not sure if there's one that work in-browser.

This UserJS worked for me with Violentmonkey - https://greasyfork.org/en/scripts/485020-dvr-chan-force-enab...

What is the non-browser workaround? E.g., can streamlink do it?


Does it ? I can pause but I can't seem to go back. Clicking on the timeline does nothing. Clicking on the "live" indicator does nothing.

This isn't letting me drag back either.

Yeah, explicitly disabled. Corporate control freak behavior - that tracks.

You can't scrub back and forth on the timeline until the live stream is finished.

"It's a feature, not a bug" – Apple

No kidding. I missed the first 16 minutes because something needed my attention. I'd like to be able to restart the stream.

What GP meant is that the CTO of a $1b company wold absolutely not fire someone for going Azure because at those scale it's very likely they have a set of customers that exclusively want to work on Azure, so that choice makes sense.

It's easy to do blanket statements like "never choose azure", "avoid GCP at all cost" or "never again on AWS". Until real world comes your way and you are forced to deal with it.

That being said: I'd fire anyone choosing to deploy a workload on GCP.


Another reality is that at that scale you need to diversify your vendor portfolio so you never get stuck in a single-vendor scenario (for contracts, liability or scale). Many companies half this size have infrastructure across all three - AWS, Azure and GCP. The primary reason is redundancy, but that also gives them potential leverage for contract negotiation.

> They don't want you to have access to frontier models. And you will not have it. See Mythos as Exibit A.

"They" fully well know that they current frontier model are maybe 6 month ahead of what people will have access to without their control. See Deepseek as Exibit B

The reason you can't run these locally are more with the fact that those mythos sized models require extreme amount of memory and processing power to run at acceptable speeds. And neither you, nor I can afford to pay for those resources to run those models locally. A big reason is that "running locally" means running on your own hardware. And for almost everyone this means "running on hardware that will spent a big portion of its time just sleeping". Because data center and providers have higher utilization rates, they can easily outpace you. That and the fact that when they place an order it's usually for hundreds of thousands of units.


I am convinced the (mainly chinese) open weights models are the only reason OpenAI and Anthropic release at the pace they do. Without them being on their heels, we would have seen a stagnant duopoly in terms of public releases.

That is why the huge lobby machine is grinding away to make those models illegal.


Although, I wonder how many orders of magnitude in terms of affordability the utilization rate actually gets them. Realistically if you use a self-hosted LLM for your job, you might be using it, what, a solid 6 hours per day? Assuming you can keep it actually fed, while working (so, some agentic thing might be necessary, I guess it will need to be more than VSCode autocomplete and responding to individual prompts). Anyway, that starts you out at 1/4’th the utilization, a 4X price increase might be worth paying for privacy and stability (no sudden change in model behavior, no price changes, no days when the system is over-utilized for reasons outside your control).

Rather I think it is just hard for local LLMs to compete in this early stage when the cloud providers are allowed by investors to be unprofitable.


> Realistically if you use a self-hosted LLM for your job, you might be using it, what, a solid 6 hours per day?

You can grow the utilization rate well beyond that if you don't always care about getting a quick, real-time response. (And if you do, then maybe the cloud model was the better deal after all!)


Isn't Mythos that screw up where Anthropic failed to ship something that was no better than the product OpenAI launched a few weeks later?

And, assuming the allegations are true, don't things like Deepseek and Qwen offer existence proofs that frontier models are (and will forever be) trivially distilled down to run domain-specific tasks on boxes that cost a few months of Claude Max subscription?


They said it was impossible for them to reverse the decision. Nothing has changed; they won’t be in charge of that. The state will.

A significant reason of the “impossibility” of reversing the decision is the regulations around nuclear. Take the problem of micro tears in the concrete. Engie could have maintained the concrete. Because the plant was scheduled for decommission, they did not. So there will be small tears in the concrete. The law does not allow those small tears. Repairing the concrete now is too expensive.

The plant will be owned by the state so now the state has two options: (1) invest a truckload of money to repair the concrete or (2) change the law to allow small tears which have virtually no security consequences anyway.

We all know that the state will choose option (2) but there would be far more opposition if they did so while the plant were owned by a private company that is making profit rather than owned by the state which is operating at a deficit.


The problem is that at that scale, the alternative is building your own data centers. You'd probably want at least 2 in the US, 2 in Europe, 2 in Asia, maybe 1 in Africa and 1 in LATAM. So 8-10, and you need at least half of them ready "on time."

What does "on time" mean? You'll need to negotiate with local authorities, some friendly, some not. Data centers aren't exactly popular neighbors these days. Then negotiate with the local power utility. Fingers crossed the political landscape doesn't shift and your CEO doesn't sign a contract with an army using your product to pick bombing targets, because you'll watch those permits evaporate fast.

Then there's sourcing: CPUs, GPUs, memory, networking. You need all of it. Did you know the lead time for an industrial power transformer is 5+ years? Don't get me started on the water treatment pumps and filters you can't even get permitted without. What will you do in the meantime ? You surely aren't gonna get preferential treatment from AWS / Google / ... if they know you are moving away anyway. Your competition will.

The risk and complexity are just too big. AI/LLM is already an incredibly complex and brittle environment with huge competition. Getting distracted building data centers isn't enticing for these companies, it's a death sentence.


For AI inference you don't need to geographically distribute your data centers. Latency, throughput, and routes don't matter here. When it's 10 seconds for the first token and then a 1KB/sec streamed response, whatever is fine. You can serve Australia from the US and it'll barely matter. You can find a spot far outside populated areas with cheap power, available water, and friendly leadership, then put all of your data centers there. If you're worried about major disasters, you can pick a second city. You definitely don't need a data center in every continent.

You're not wrong about the rest but no AI company would ever build a data center in every continent for this, even if they were prepared to build data centers. AI inference isn't like general purpose hosting.


>Latency, throughput, and routes don't matter here. When it's 10 seconds for the first token and then a 1KB/sec streamed response, whatever is fine. You can serve Australia from the US and it'll barely matter.

This may be true for simpler cases where you just stream responses from a single LLM in some kind of no-brain chatbot. If the pipeline is a bit more complex (multiple calls to different models, not only LLMs but also embedding models, rerankers, agentic stuff, etc.), latencies quickly add up. It also depends on the UI/UX expectations.

Funny reading this, because the feature I developed can't go live for a few months in regions where we have to use Amazon Bedrock (for legal reasons), simply because Bedrock has very poor latency and stakeholders aren't satisfied with the final speed (users aren't expected to wait 10-15 seconds in that part of the UI, it would be awkward). And a single roundtrip to AWS Ireland from Asia is already like at least 300ms (multiply by several calls in a pipeline and it adds up to seconds, just for the roundtrips), so having one region only is not an option.

Funny though, in one region we ended up buying our own GPUs and running the models ourselves. Response times there are about 3x faster for the same models than on Bedrock on average (and Bedrock often hangs for 20+ seconds for no reason, despite all the tricks like cross-region inference and premium tiers AWS managers recommended). For me, it's been easier and less stressful to run LLMs/embedders/rerankers myself than to fight cloud providers' latencies :)

>then put all of your data centers there

>You definitely don't need a data center in every continent.

Not always possible due to legal reasons. Many jurisdictions already have (or plan to have) strict data processing laws. Also many B2B clients (and government clients too), require all data processing to stay in the country, or at least the region (like EU), or we simply lose the deals. So, for example, we're already required to use data centers in at least 4 continents, just 2 more continents to go (if you don't count Antarctica :)


Sounds like you're betting that the performance users experience today will be the same as the performance they'll expect tomorrow. I wouldn't take that bet.


You can build geographically close one tomorrow, when you start earning money today. US-EU latency is like 100ms, AI can handle it just fine


You mean that if you were Anthropic, you'd build the data centers on every continent? Can you explain your reasoning?

We're talking about billions of dollars of extra capex if you take the "let's build them everywhere" side of the bet instead of "let's build them in the cheapest possible place" side. It seems to me that you'd have to be really sure that you need the data center to be somewhere uneconomical. I think if you did build them in the cheap place, it's a safe bet that you'll always have at least enough latency-insensitive workloads to fill it up. I doubt that we would transition entirely to latency-sensitive workloads in the future, and that's what would have to happen for my side of the bet to go wrong. The other side goes wrong if we don't see a dramatic uptick in latency-sensitive inference workloads. As another comment pointed out, voice agents are the one genuinely latency-sensitive cloud inference workload we have right now; they do need low latency for it. Such workloads exist, but it's a slim percentage so far.

I believe I'm taking the safe bet that lets Anthropic make hay while the sun shines without risking a major misstep. Nothing stops them from using their own data centers for cheap slow "base load" while still using cloud partners for less common specialized needs. I just can't see why they would build the international data centers to reduce cloud partner costs on latency-sensitive workloads before those workloads actually show up in significant numbers.


latency absolutely matters? this is such a weird thing to say. for training sure, but customers absolutely want low latency


They want it, sure. Customers want everything if it's free, but this is about what they value with their money. In this thought experiment, you're Anthropic, not the customer. You're making a choice that's best for Anthropic. Will Anthropic lose customers because the latency is higher? No way. Customers want low cost and lots of usage more than they want low latency. In a cutthroat race to the bottom, there's no room to "give away" massively expensive freebies like a data center near every population center when the customer doesn't value those extras with actual money. It's the same reason we all tolerate the relatively slow batched token generation rate--the batching dramatically lowers the cost, and we need low cost inference more than we want fast generation. If the cost goes up we'll actually leave, for real.

After the initial announcement of "fast mode" in Claude Code, did you ever hear about anyone using it for real? I didn't. Vanishingly few people are willing to pay extra for faster inference.

Remember that the time-to-first-token is dominated by the time to process the prompt. It's orders of magnitude more latency than the network route is adding. An extra 200 milliseconds of network delay on a 5-10 second time-to-first-token is not even noticeable; it's within the normal TTFT jitter. It would be foolish to spend billions of dollars to drop data centers around the world to reduce the 200 milliseconds when it's not going to reduce the 5-10 seconds. Skip the exotic locales and put your data centers in Cheap Power Tax Haven County, USA. Perhaps run the numbers and see if Free Cooling City, Sweden is cheaper.


They’re unwilling to pay for fast mode because of the current step function price increase once you hit your quota. It’s a psychological effect. Because most shops I know in the US currently paying $125/mo per seat for Claude would happily - HAPPILY - pay 2x, and begrudgingly pay 10x that amount for the same service. If fast mode was priced 25% or 50% more they’d happily pay for that too. But it’s just not priced that way currently with weird growth subsidization & psychology.


The only AI use case that cares about latency is interactive voice agents, where you ideally want <200ms response time, and 100ms of network latency kills that. For coding and batch job agents anything under 1s isn't going to matter to the user.


tbh, that's a good point about the voice agents that I hadn't considered. I guess there are some latency-sensitive inference workloads. Thanks for pointing that out.


Yeah, also stuff like robotics which might not really exist today but could be big in the future.


You'll want the time-sensitive parts (motor control) to be running locally anyway.


A customer service chatbot can require more than one LLM call per response to the point that latency anywhere in the system starts to show up as a degraded end-user experience.


Easy solution - use hyperscalers with super expensive API charge only when latency really matters. Otherwise build your own DC. Easy to expect customers don't care latency that much over money.


Other than data sovereignty, does the data center location really matter that much? Current inference systems are not exactly low latency.


It’s the power and water needs.

Large data centers consume as much power as a small city. The location decision is about being able to connect to a power grid that is ready to supply that.

Evaporative cooling also needs steady water supply. There are data centers which don’t operate on evaporative cooling but it’s more equipment intensive and expensive.

Latency doesn’t matter. You can get fast enough internet connected to these sites much more easily than finding power.


Location matters for disaster recovery, if they want to survive WWIII. Though I think Data Sovereignty is probably a bigger thing, especially if they're going to be selling to governments around the world.


Why do they need to sell to government around the world. I mean I highly doubt Europe governemnt is in the top 100 customer of any US lab.


* not every task is waiting on the inference. lowering latency on other, serial tasks, can still have a noticable effect. Login, mcp queries, etc.

* data transit across the world can be very slow when there's network issues (a fiber is cut somewhere, congestion, bgp does it's thing, etc). having something more local can mitigate this

* several countries right now have demented leaders with idiotic cult-like followers. Best not to put all your eggs in those baskets.

* wars, earthquakes, fires, floods, and severe weather rarely affect the whole planet at once, but can have rippling effects across a continent.

And frankly, the real question isn't "why spread out the DCs?", its "what reason is there to put them close to each other?".


Btw where does this obsession with datacenters come from? If you can tolerate ~150ms ping (which chatbots certainly can, as their internal processing can take much longer), you can serve US and Europe from a single US location, and the whole planet if you can tolerate ~300ms (Asian websites are usually very slow to load for me, I think it has to do with the way the internet is set up, not any physical limitations, but mostly commercial ones, as Western companies rarely have good market penetration in Asia)


Maybe for right now, but even in the very near future it seems like data center expertise would absolutely be a core competency of any AI leaders.

Heck, look at Facebook. Granted, they got started slightly before AWS, but not by much. Owning all of their own data centers is a huge competitive advantage for them, and unlike most of the other hyperscalers they don't sell compute to other companies (AFAIK).

Again, the commitment is for $100 billion in spend. Building lots of data centers for a lot cheaper than that price should absolutely be doable. Also, geographic distribution isn't nearly as important for AI companies given the way LLMs work. The primary benefit of being close to your data center is reduced latency, but if you think about your average chatbot interface, inference time absolutely swamps latency, so it's not as big a deal. Sure, you'd probably need data centers in different locales for legal reasons, and for general diversification, but, one more time, $100 billion should buy a lot of data centers.


It's interesting that you mention Facebook. They have a ton of their own data centers and yet they are now also spending tens of billions on cloud. It's not that easy to build hundreds of data centers on short notice.


Translation: Antropic never intends to spend $100 billion on AWS.

Every single argument you've brought up is irrelevant in the face of billions of dollars. If you intend to consume $100 billion dollars in data center infrastructure, you're going to find a way to accomplish it while cutting out the middlemen.

Meanwhile if you're flaky and never intend to spend that money, you're going to come up with a way to pay someone else to deal with those problems and quit paying the moment they don't.

You'd never do both at the same time. You'd never commit your money and give them control over your business critical infrastructure.

Hence the deal is a sham. The $100 billion are a lie. Thank you for telling us.


Take the approach Geohot is suggesting. Take a shipping container, make a standard layout, cooling and compute load. Find a cheap source of electricity.. Place it and have compute.


Surely if it was that easy it'd be done?


It has been done... We used to get our POP gear built out from Dell (?) in shipping containers - pre-racked, wired, and cooled - just add network/power feeds. We'd have them dropped places we needed more capacity but there wasn't space available in the DC.


Interesting thanks for sharing


not sure what you are describing, however a random item is that in 2026 low-tech Chile is building sixty datacenters in or near Santiago, in the business news.


I have found out that the main phone providers (Apple, Google, Samsung) have extremely long support period. I really don't get the "planned obsolescence" thing.

As an example, in Jan 2026, Apple published iOS 12.5.8 which provides updates for iPhone 5s which released in Sept 2013. That's 12.5 years ago. The equivalent would be to connect to the internet using ADSL in Jan 2000 with your IBM PS/2 rocking in intel 8086, 512 kb of RAM and expecting an update for your DOS operating system.


>As an example, in Jan 2026, Apple published iOS 12.5.8 which provides updates for iPhone 5s which released in Sept 2013. That's 12.5 years ago. The equivalent would be to connect to the internet using ADSL in Jan 2000 with your IBM PS/2 rocking in intel 8086, 512 kb of RAM and expecting an update for your DOS operating system.

The updates for ios 12 are all security updates, not feature updates, so your comparison to "connect to the internet using ADSL in Jan 2000 with your IBM PS/2 rocking in intel 8086" doesn't really make sense. The phones stuck on ios 15 are basically unusable because many apps don't support it anymore. At best you can download an older version from a few years ago, but that depends on whether the backend servers were updated. Apps that insist you use the latest version (eg. banking/finance apps) basically unusable.


A phone is not unusable because some banking apps don't work on it. It didn't even ship with said apps installed.


Believe it or not, "apps" are an important "feature" of a smartphone, even if it's not theoretically bundled with it. Moreover it's not just banking apps, those are just the first ones to go, but any that don't keep backend compatibility will eventually break.


Isn't it the banks/apps that are choosing not to support the phone, not Apple?


It is neither Apple's nor the third party app's obligation to make the third party app work on ancient phones.


The entire point of the cellphone is that third party apps are required to live a modern life. If I cannot run the apps required to pay for a parking spot or perform a 2FA ritual then there’s really no point in even having a phone. The first party software isn’t compelling enough to justify the pocket space.


You could always keep your phone and get a second dirt-cheap phone just for the 2FA (or use your banks' non-phone 2FA methods). But if we take your requirement that one phone should be able to do everything that new phones can do, it's somewhat tautological that you have to replace your phone frequently to stay on the cutting edge.


IBM PC DOS 2000 was a thing that was published and sold. It would have ran fine on a system similar to what you describe. It addressed the only pressing thing in that space at that time that PC DOS 7 did not: Y2K compliance.

(I never had a PS/2, or ADSL, but I was goofing around with a low-memory 8088 box back then for fun. It had no hard drive. It bootstrapped from floppy, loaded the rest over the LAN with its built-in 10base2 Ethernet jack from my Linux box, and connected to dual-channel ISDN for Internet access. It worked. It even had a graphical web browser.

Being clever with an old iPhone is a very different thing.)


The updates often require more power. Which drains the battery more than it was designed for. Which helps shorten the life of the device.

BTW: DOS was supported until 2001, and Win95 could boot DOS standalone.


Machines were roughly doubling in performance every year back in 2000.

Nowadays they are doubling in performance every... 5 years?


Only in some edge cases, in others it takes even longer than 5 years and that time is getting longer and longer.


Indeed you can still get a battery replaced by Apple for an old iPhone 6.


Congratulations.

How do you handle SSL pinning ? Most of the apps I interact with have some sort of SSL pinning, which is the hard part to circumvent. I tried Kampala but got stuck at the usual place; as soon as I enable it, chatGPT stops working. Most of my iPhone apps stop responding etc.

I would love to try using this tool to build an agent that can simply subscribe me to my gym lessons instead of me having to go on the horrible app. But even that relatively simple (iOS) app stopped working as soon as I enabled the proxy.


Unfortunately we can’t do much around SSL pinning yet. Not sure how deep you want to go, but there are several Frida scripts that patch common pinning implementations.

I also think mitmproxy (open source) has an option to spin up a virtual Android device that can bypass pinning via AVD. I have not tested how reliable it is though.

FWIW, it could also be a cert trust issue. I would try a quick Safari search to confirm the cert is fully trusted. ChatGPT is pinned, but the gym app makes me think it might be a trust or config issue on your device.

Happy to take a look as well. Email me at alex at zatanna dot ai.


> why one person potentially being responsible for hundreds or thousands of deaths is acceptable

I am not sure who exactly is that one person ? Is it Altman, who is according to many people not that knowledgeable in AI in the first place; the scientist who found a breakthrough (who is it ?); is it the president of the United States who is greenlighting the strikes; the general who is choosing the target (based on AI suggestions); the missile designer; the manufacturer; the pilot who flew the plane ?

I get the point of concentrating power in fewer hands, but the whole "all the problems of this world are caused by an extremely narrow set of individuals" always irks me. Going as far as saying there is just one is even mor ludicrous.


I’m fine with holding them all accountable to varying degrees. For example, yes, ultimately the president is responsible, but so is the person who dropped bombs instead of refusing an illegal order; just like the street dealer, gang banger, trafficker, and cartel boss are all guilty of all of their various crimes.

What do you find difficult to understand about that?


Accountability sinks are good value and wealthy people always make sure they have enough of them


Ah the old 'everyone is responsible so nobody is responsible' canard.

I will give you a helpful rule of thumb: when in doubt the guy with a bank account larger than the total lifetime income of hundreds of thousands of people is probably the one to blame.


Ah the old ‘in case of doubt just go after the rich guy’. That makes stuff simple doesn’t it ?

You can establish responsibilities just by counting the number of zeroes in a bank account. On top of this, it works for everything: the same dude is responsible for wars, the climate, world hunger, child cancer and your bathroom mirror being fogged this morning.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: