More

Catloafdev · 2026-06-16T21:38:35 1781645915

This is very far from an obstacle, space-functional radiators and thermal management systems do exist.

Catloafdev · 2026-06-16T00:25:27 1781569527

ctrl+f rentier: 0 hits

ya...

Catloafdev · 2026-06-13T17:25:16 1781371516

If you want frontier-level, the economically reasonable option is OpenRouter or a direct sub to frontier-of-your-choice.

The reality is that they do not offer configurations that would allow a consumer to run that much VRAM on a single setup to protect datacenter margins. Apple used to, and they stopped, those devices are going for ~$20k+ each on ebay now.

You can get very, very capable models on a 3090/4090/5090/6000 series card. But if you want 'frontier level' you are investing ~22k at a bare minimum if you go new. Used you can probably build your own server for much cheaper up-front cost but it's likely going to be 4-6x+ electricity usage.

theossuary · 2026-06-13T17:45:18 1781372718

I truly think by 2028 we'll have integrated chip systems that'll be able to run opus 4.8 level models at ~500 watts at acceptable performance. Honestly I think now is the worst time to invest in AI hardware. Get your harness ready and processes perfected with hosted models, and wait a few years to buy hardware to transition to running models locally

baq · 2026-06-13T17:49:31 1781372971

Burning weights onto a chip in an efficient way and exposing that via USB would be acceptable for a good enough model tbh

ajbourg · 2026-06-13T18:02:02 1781373722

This is pretty close to what Taalas is doing.

calgoo · 2026-06-13T19:22:54 1781378574

Trying Taalas is almost scary, there is something unsettling with that speed! Even with that small model, because of the speed, you could run hundreds of sample runs in a second, and pick from the best.

Can't wait for their next release!

AlexCoventry · 2026-06-14T23:16:03 1781478963

Right now, we seem to be shambling toward a war which would hit globalized industrial processes very hard. Buying decent hardware now might wind up looking like good insurance against that.

CamperBob2 · 2026-06-13T17:55:38 1781373338

Honestly I think now is the worst time to invest in AI hardware.

That position is not without its own risks, though. Maybe Opus 4.8 will run on a single chip by 2028... and maybe you won't be allowed to touch it.

And what if Xi makes a play for Taiwan? That would be stupid, but so was invading Ukraine with tanks from Temu, and it still happened.

dudisubekti · 2026-06-13T22:04:59 1781388299

Other than Taiwan declaring independence, I don't see any reason why China will rush to take the island.

At the very least they would wait until they cracked EUV and mass-produce the chips, and that is still 4-5 years away at the earliest.

CptFribble · 2026-06-13T19:19:48 1781378388

> so was invading Ukraine

the difference is that Putin's hand was forced by age, (possibly) illness, and the last several decades of how he chose to run his country. Putin's power base is a relatively small group of elites and oligarchs who would happily snuff out the man who pushes them out of windows if they get too uppity, if they were given the chance. He needed the cover of war to maintain the fiction of his type of strongman "only I can save us" leadership.

Xi's power base is the simple fact that his leadership has transformed China into the #2, and now because of Trump possibly soon the #1 world superpower. He has also acted aggressively in the last decade to find and remove corruption and prevent individuals from accumulating the kind of wealth and influence that could threaten his power from outside official Party channels. Of course, as I'm not Chinese myself, I have no clue what the internals of Party politics actually look like. But as an outside observer it seems clear that Xi et. al. do not actually need Taiwan for anything other than national pride. They know the US would go to the mat to protect it as TSMC is extremely vital to US military power. And since China cannot compete in that arena and has too much to lose, they instead have focused on weakening the US from within, quite successfully of late.

By the time China finally takes Taiwan it will be with little fanfare and little consequence - they won't touch it until the US either has lost its military capabilities, or the US has its own internal chip industry. Anything else is an existential risk for the coastal cities that are China's entire economic advantage.

hurtigioll · 2026-06-13T18:15:22 1781374522

if such hardware becomes available, it will be bought by the data-centers, just like they buy all the RAM today

daemonologist · 2026-06-13T17:32:47 1781371967

There are also significant economies of scale (namely: utilization and batching), which tend to make inference on a shared server more economical even after the operator takes a cut.

zozbot234 · 2026-06-13T19:28:39 1781378919

You can use batching on consumer hardware, it just requires a KV-cache efficient model (or short context only) and keeping multiple inference flows running in parallel. This is most useful in combination with streamed inference, since the compute intensity of decode with those newer KV-compressed models is high enough that you have limited compute headroom when running at the speed of RAM.

Catloafdev · 2026-06-13T04:17:19 1781324239

Ya that'd be an awesome project, the only issue is how do you verify it's not being poisoned? To actually validate it would require more analysis than the training took to run. It would require a trusted network, not an open one, unless that can get solved somehow.

sgsjchs · 2026-06-13T11:53:03 1781351583

Make multiple nodes do the same job, compare results.

Catloafdev · 2026-06-10T16:34:19 1781109259

Firefox thankfully offers sync and imports your Chrome data.

Makes switching easy.

Catloafdev · 2026-06-10T01:30:50 1781055050

And how exactly do you propose making it "difficult enough"?

reissbaker · 2026-06-10T01:40:26 1781055626

The same way Anthropic is making it difficult to compete with them. They intentionally train the model (via PEFT, as called out in the model card) to be dumber when attempting to do things Anthropic doesn't want — in this case, competing with them, but you could apply the same training process for other domains such as actually-malicious use cases.

fc417fc802 · 2026-06-10T03:10:01 1781061001

The same way pursuing a bachelor's degree in order to achieve a nefarious end goal does. Refuse to handhold the user on risky topics and outright refuse to answer if an explicit scenario that appears to be harmful is provided. Provide only textbook level technical explanations for such topics the same as any STEM student has ready access to.

Catloafdev · 2026-06-09T17:29:04 1781026144

It's a relatively new benchmark but from what I can tell it has serious cred behind it. I assume it will be picked up as part of the standard suite of CS-related benchmarks soon enough.

Catloafdev · 2026-06-07T23:59:16 1780876756

If you're talking about frontier AI, they have no issue de-obfuscating any of these accurately.

Chu4eeno · 2026-06-08T13:52:16 1780926736

Yeah, Claude, Grok and Gemini immediately recognized it as a SUBLEQ VM (and worked out from there all the details), only chatgpt got stumped and just said it was a "visual simulator".

All the big labs have plenty of proprietary (i. e. paying PhD holders good money for writing stuff) and synthetic training data now compensating for the lack of "naturally occuring" SUBLEQ and similar stuff.

Catloafdev · 2026-06-07T17:50:25 1780854625

Your hostility isn't helping things either. "You've been here long enough to understand" followed by an incorrect usage of "goal post moving" makes you look like an agitator.

Catloafdev · 2026-06-05T20:28:10 1780691290

Being able to run the 12B on 8gb VRAM is huge. It's crazy to see how fast these small local models have evolved.