More

mentalically · on Nov 2, 2024

Some studies are also linking it to increased risk of aggressive cancers.¹ ²

1: https://www.nature.com/articles/s41598-023-36013-7

2: https://pmc.ncbi.nlm.nih.gov/articles/PMC10202899/

mentalically · on Nov 2, 2024

Anticipated by Stanislaw Lem in his History of Intellectronics.

mentalically · on Nov 1, 2024

Eventually people will figure out how to nest neural networks in the nodes and edges of an arbitrary graph.

mentalically · on Nov 1, 2024

There is an old saying about the future being here but unevenly distributed. The pods look nice and would be even nicer with some kind of daily meal kit for a few extra dollars.

henry2023 · on Nov 1, 2024

Soon we'll have MaaS (Malnutrition as a Service) to fit in smaller, cheaper pods.

pclmulqdq · on Nov 1, 2024

That's already here and it's called "Ozempic."

Spivak · on Nov 1, 2024

I mean you're not wrong, do people not know how these drugs work? They're not magic, they make you never want to eat and take away the "feeling like shit" when you don't.

hightrix · on Nov 1, 2024

That's not the experience my wife has had. She's been on one of these drugs for a while and still gets insanely hungry, but she's able to feel "full" with a much smaller meal than before taking the drug. If she overeats, she feels like shit.

swarnie · on Nov 1, 2024

What if we could pass the protein slop directly in to you via some form of tube?

Would that help you adjust to your new pod?

mentalically · on Nov 1, 2024

That's a little too sci-fi for me but I am sure many youngsters with higher risk tolerance would be happy to pay a subscription fee for more streamlined nutrition delivery systems.

lm28469 · on Nov 1, 2024

Yeah that's exactly what I was thinking about, something like this would be perfect: https://www.syfy.com/sites/syfy/files/styles/scale_1280/publ...

mentalically · on Nov 1, 2024

I see those have the streamlined nutrition delivery and excrement management systems. It's a little out of my pay grade.

eviks · on Nov 1, 2024

You forgot curtains for some privacy, then that would be perfect

jameslk · on Nov 1, 2024

Bugs and Soylent? Then you’d be getting the full future experience /s

mentalically · on Nov 1, 2024

Preferably with some spices to make it palatable.

mentalically · on Nov 1, 2024

Well done. I'd change a few things to make things technically more precise. In a few places you use words like "learnable" parameter but I think this tends to confuse people more than help them understand what is going on. People can learn but parameters can only be modified according to some rule that minimizes or maximizes some objective function of those parameters. People who understand the technical details tend to use words/phrases like "learning" as shorthand but in an introductory post like this it is useful to be technically precise and not use anthropomorphisms that can confuse beginners.

jcoblin · on Nov 1, 2024

That's a great point. I tend to take for granted the intuition required for some of the terminology - best to keep it precise, as you suggest.

mentalically · on Oct 31, 2024

I wonder how long it will go before it devolves into complete incoherence. It already seems incoherent so probably in a few updates it will be completely unreadable.

namaria · on Nov 1, 2024

Kafka would be proud. We're gone from the dream of a semantic web to industrial grade non-sense spreading automatically.

mentalically · on Nov 1, 2024

It does seem like everything is heading in that direction.

mentalically · on Oct 31, 2024

The value proposition of Cerebras is that they can compile existing graphs to their hardware and allow inference at lower costs and higher efficiencies. The title does not say anything about creating or optimizing new architectures from scratch.

germanjoey · on Oct 31, 2024

the title says "Cerebras Trains Llama Models"...

mentalically · on Oct 31, 2024

That's correct and if you read the whole thing you will realize that it is followed by "... to leap over GPUs" which indicates that they're not literally referring to optimizing the weights of the graph on a new architecture or freshly initialized variables on existing ones.

pama · on Oct 31, 2024

This is as clickbaity as it gets.

Trains has no other sensible interpretation in the context of LLM models. My impression was that they trained the models to be better than the models trained by GPUs, presumably because they trained faster and managed to train for longer than Meta, but this interpretation was far from the content.

Also interesting to see the ommission of deepinfra from the price table, presumably because it would be cheaper than Cerebras, though I didnt even bother to check at that point because I hate these cheap clickbaity pieces that attempt to enrich some player at the cost of everyone’s time or money.

Good luck with their IPO. We need competition but we dont need confusion.

mentalically · on Oct 31, 2024

What are you confused about? Their value proposition is very simple and obvious, custom hardware with a compiler that transforms existing graphs into a format that can run at lower cost and higher efficiency because it utilizes a special instruction set only available on Cerebras silicon.

fancyfredbot · on Oct 31, 2024

The title is clickbait but that's how marketing works whether we like it or not. The achievement is real - Cerberas improved their software and the inference is much faster as a result. I find it easy to forgive annoying marketing tactics when they're being used to promote something cool.

pama · on Oct 31, 2024

It is textbook bait and switch. If the achievemt is important, use the correct title. An advance in actual training performance or a better model is very important and interests a different set of people with deeper pockets than those who care about inference.

mentalically · on Oct 31, 2024

It was probably generated with an LLM and as far as I can tell it does seem like complete nonsense.

mentalically · on Oct 30, 2024

On May 3, 2021 I wrote a note to myself about the type of people OpenAI was hiring and this was the note: looks like OpenAI is getting into the military business by hiring a former CIA clandestine operator Will Hurd https://en.wikipedia.org/wiki/Will_Hurd. Seems like I was right but this should be expected because every corporation is in one way or another linked to the military industrial complex.

mentalically · on Oct 30, 2024

In the most general case there is no technique that can determine if two programs are equivalent other than running both programs on some set of inputs and verifying that the outputs (after termination) are the same. Every other technique must cut out all possible sources of non-termination to get around the halting problem in order to make the resulting equivalence relation on the set of programs effectively computable and constructively provable.