More

OliverGuy · 2025-10-30T16:26:41 1761841601

Tbh, for most companies/orgs the cost/complexity of multi region just isn't worth it.

The cost of a work days worth of downtime is rarely enough to justify the expense of trying to deploy across multiple regions or clouds.

Esp if you are public facing and not internal. You just go 'well everyone else was down to because of aws' and your customers just go 'ah okay fair enough'

OliverGuy · 2025-10-28T16:09:33 1761667773

Cisco etc have truly insane pricing on optics, like $1000 for something generic that cost $20-50 from fs.com etc. The only difference is how it presents itself to the switch (ie, says its a Cisco optic), not actual difference in performance.

Often Cisco/etc will refuse support cases if you aren't using their optics, if the switches/routers even work with them in the first case, which isn't a given as often they'll refuse to work with non branded optics.

Really just a money grab by the big network vendors.

This box allows you to flash the firmware on the optic to say its from whatever brand you want (Cisco, Dell, Aruba, Juniper etc) so that you can get it to work in that companies switch/router.

For most SMEs, the brand of optics makes no difference. Maybe keep a few legit branded ones around for debugging and when you need to raise a support case. But otherwise, the generic ones flashed to look like branded ones work just fine.

Sesse__ · 2025-10-28T17:45:16 1761673516

> Often Cisco/etc will refuse support cases if you aren't using their optics, if the switches/routers even work with them in the first case, which isn't a given as often they'll refuse to work with non branded optics.

As others here have pointed out, Cisco reserves the right to do this but doesn't do it in practice. They don't even have a realistic chance to _detect_ a Cisco-programmed FS SFP, since it simply identifies the same as a genuine Cisco module.

If your case was directly related to the SFP (“I can't get a link on this fiber port”), then yes, they could probably refuse it. But if your case is about basically anything else on the switch, they won't care.

bnjms · 2025-10-28T19:35:33 1761680133

> If your case was directly related to the SFP (“I can't get a link on this fiber port”), then yes, they could probably refuse it.

I have zero doubt they will. But also you prove nothing and are doing yourself and the vendor a disservice if you fake it. There’s no telling what your 3rd party transceiver is doing incorrectly. Better to get one single supported sfp and get that fixed which will probably fix your other issue too.

FS is so big they’re probably fine. Another option is to get one supported sfp, find if it’s encoded to an oem part, then buy and install the oem part directly. Easy to twist the arm of your var to do this.

Sesse__ · 2025-10-28T19:39:07 1761680347

> But also you prove nothing and are doing yourself and the vendor a disservice if you fake it. There’s no telling what your 3rd party transceiver is doing incorrectly.

If I report an IS-IS problem and the root cause is an OEM SFP on a completely unrelated port, then the design of the switch is pretty awful. :-)

bnjms · 2025-10-30T19:16:41 1761851801

Oh! I’m in complete agreement.

I’ve never heard of a vendor being so difficult. My comment applies only to interface errors. (Up/down status, rx/tx errors, fec issue, etc) Any vendor without an override for 3rd party sfp should be rejected after RFP.

cturner · 2025-10-28T16:25:28 1761668728

"The only difference is how it presents itself to the switch (ie, says its a Cisco optic), not actual difference in performance."

That's not the only difference. I have had situations where I ran equivalent optics side-by-side, and then touched one and it was hot, and touched the other and it was not hot. They do contain different components. In the case of that test - the atgbics SFP was cool, and the other clone unit was hot. My dealer was able to get me in contact with someone technical at atgbics (the cool-running unit) who explained the difference, "The DSP might be say 13nm where more modern more expensive ones are 5nm."

But you definitely do not need to pay for "genuine" optics to get high-reliability optics. You just need to shop around the clones - atgbics is a clone.

OliverGuy · 2025-10-20T09:14:57 1760951697

Their latest update on the status page says it's a Dynamodb DNS issue

shawabawa3 · 2025-10-20T09:21:18 1760952078

but the cause of that could be anything, including some kind of config getting wiped due to a temporary power outage

OliverGuy · 2025-10-14T09:39:26 1760434766

How representative is this platform of the bigger GB200 and GB300 chips?

Could I write code that runs on Spark and effortlessly run it on a big GB300 system with no code changes?

egeres · 2025-10-14T21:47:21 1760478441

All three (GB10, GB200 and GB300) are part of the Blackwell family, which means they have Compute Capability >= 10.X. You could potentially develop kernels to optimize MoE inference (given the large available unified memory, 128Gb, it makes the most sense to me) with CUDA >= 12.9 then ship the fatbins to the "big boys". As many people have pointed out across the thread, the spark doesn't really has the best perf/$, it's rather a small portable platform for experimentation and development

xs83 · 2025-10-14T11:30:56 1760441456

If you mean CUDA specific then yes. The biggest benefit of these machines over the others is the CUDA ecosystem and tools like cuDF, cuGraph etc

OliverGuy · 2025-10-01T17:59:18 1759341558

The NetApps are just disk shelves, can plug it into a SAS controller and use whatever software stack you please.

tarasglek · 2025-10-01T18:01:27 1759341687

but they have multiple head nodes, so its some distributed setup or just active/passive type thing?

nee1r · 2025-10-02T01:29:48 1759368588

We have a custom barebones solution that uses a hashring to route the files!

hnav · 2025-10-01T18:09:49 1759342189

I'm guessing the client software (outside the dc) is responsible for enumerating all the nodes which all get their own IP.

toast0 · 2025-10-02T07:37:46 1759390666

I think each rack is one head node and several disk shelves (10?). No dual headed shelves.

OliverGuy · 2025-10-01T17:58:10 1759341490

Aren't those netapp shelves pretty old at this point? See a lot of people recommending against them even for homelab type uses. You can get those 60 drive SuperMicro JBODs for pretty cheap now, and those aren't too old, would have been my choice.

Plus, the TCO is already way under the cloud equiv. so might as well spend a little more to get something much newer and more reliable

g413n · 2025-10-01T20:38:59 1759351139

yeah it's on the wishlist to try

bobbob1921 · 2025-10-02T19:39:05 1759433945

Thanks to op for actually replying to the various comments here - really appreciate that (and for the initial of course!)

OliverGuy · 2025-09-03T06:59:32 1756882772

I can put some AWS Creds in my terminal and Claude Code is perfectly happy writing AWS CLI commands (or whole python scripts if necessary) to work out what it needs to about my infrastructure.

OliverGuy · 2025-09-02T13:39:10 1756820350

SageMaker have serverless inference endpoints

BoredPositron · 2025-09-02T15:15:33 1756826133

Only if the pipe is defined if you bring your own pipe sage maker offers nothing.

OliverGuy · 2025-08-07T10:33:11 1754562791

If you are trying to get facts out of an LLM you are using it wrong, if you want a fact it should use a tool (eg we search, rag etc) to get the information that contains the fact (Wikipedia page, documentation etc) and then parse that document for the fact and return it to you.

CraigRood · 2025-08-10T18:09:33 1754849373

These tools are literally being marketed as AI, yet it presents false information as fact. 'using it wrong' can't be an argument here. I would rather then tool is honest about confidence levels and mechanisms to research further - then feed that fact back into 'AI' for the next step.

OliverGuy · 2025-08-07T10:31:33 1754562693

Nope, doesn't have any of the required hardware to even process graphics iirc

diggan · 2025-08-07T11:15:02 1754565302

Although the RTX Pro 6000 is not consumer-grade, it does come with graphics ports (four Displayports) and does render graphics like a consumer card :) So seems the difference between the segments is becoming smaller, not bigger.

simpleintheory · 2025-08-07T11:54:35 1754567675

That’s because it’s intended as a workstation GPU not one used in servers

diggan · 2025-08-07T12:01:58 1754568118

Sure, but it still sits in the 'business-grade hardware whose main purpose is AI training or running inference for LLMs" segment parent mentioned, yet have graphics connectors so the only thing I'm saying is that just looking at that won't help you understand what segment the GPU goes into.

namibj · 2025-08-08T11:39:07 1754653147

I'd Like to point at the first revision AMD MI50/MI60 cards which were at the time the most powerful GPUs on the market at least by memory bandwidth.

Defining GPU as "can output contemporary display connector signal and is more than just a ramdac/framebuffer-to-cable translator, starting with even just some 2D blitting acceleration.