The Next Platform
  • Home
  • Compute
  • Store
  • Connect
  • Control
  • Code
  • AI
  • HPC
  • Enterprise
  • Hyperscale
  • Cloud
  • Edge
Latest
  • [ September 4, 2024 ] TACC Fires Up “Vista” Bridge To Future “Horizon” Supercomputer HPC
  • [ September 3, 2024 ] The First AI Benchmarks Pitting AMD Against Nvidia Compute
  • [ August 30, 2024 ] Dell’s AI Server Business Now Bigger Than VMware Used To Be AI
  • [ August 29, 2024 ] Reduce Manual Effort, Achieve Better Coverage With AI And Formal Techniques AI
  • [ August 28, 2024 ] Nvidia Says “Blackwell” GPU Issues Are Fixed, Ramp Starts In Fiscal Q4 Compute
  • [ August 28, 2024 ] Interview: Post-Earnings Insight With Nvidia CFO Colette Kress AI
  • [ August 27, 2024 ] VMware Wants To Redefine Private Cloud With VCF 9 Control
  • [ August 27, 2024 ] IBM Shows Off Next-Gen AI Acceleration, On Chip DPU For Big Iron Compute
Homeinference

inference

Compute

The First AI Benchmarks Pitting AMD Against Nvidia

September 3, 2024 Timothy Prickett Morgan 3

Rated horsepower for a compute engine is an interesting intellectual exercise, but it is where the rubber hits the road that really matters. …

AI

Stacking Up Intel Gaudi Against Nvidia GPUs For AI

June 13, 2024 Timothy Prickett Morgan 12

Updated: Here is something we don’t see much anymore when it comes to AI systems: list prices for the accelerators and the base motherboards that glue a bunch of them together into a shared compute complex. …

AI

Talking AI Costs And Addressable Markets With SambaNova

February 14, 2024 Timothy Prickett Morgan 1

The only way to accurately predict the future is to live it, but just the same, prognostication is one of the things that we humans love to do. …

AI

How AWS Can Undercut Nvidia With Homegrown AI Compute Engines

December 4, 2023 Timothy Prickett Morgan 1

Amazon Web Services may not be the first of the hyperscalers and cloud builders to create its own custom compute engines, but it has been hot on the heels of Google, which started using its homegrown TPU accelerators for AI workloads in 2015. …

AI

Groq Says It Can Deploy 1 Million AI Inference Chips In Two Years

November 27, 2023 Timothy Prickett Morgan 2

If you are looking for an alternative to Nvidia GPUs for AI inference – and who isn’t these days with generative AI being the hottest thing since a volcanic eruption – then you might want to give Groq a call. …

AI

Big Blue Can Still Catch The AI Wave If It Hurries

November 6, 2023 Timothy Prickett Morgan 5

It has been two and a half decades since we have seen a rapidly expanding universe of a new kind of compute that rivals the current generative AI boom. …

AI

Optimizing AI Inference Is As Vital As Building AI Training Beasts

September 11, 2023 Timothy Prickett Morgan 8

The history of computing teaches us that software always and necessarily lags hardware, and unfortunately that lag can stretch for many years when it comes to wringing the best performance out of iron by tweaking algorithms. …

AI

Chiplet Cloud Can Bring The Cost Of LLMs Way Down

July 12, 2023 Timothy Prickett Morgan 18

If Nvidia and AMD are licking their lips thinking about all of the GPUs they can sell to the hyperscalers and cloud builders to support their huge aspirations in generative AI – particularly when it comes to the OpenAI GPT large language model that is the centerpiece of all of the company’s future software and services – they had better think again. …

AI

Meta Platforms Crafts Homegrown AI Inference Chip, AI Training Next

May 18, 2023 Timothy Prickett Morgan 5

As we pointed out a year ago when some key silicon experts were hired from Intel and Broadcom to come work for Meta Platforms, the company formerly known as Facebook was always the most obvious place to do custom silicon. …

AI

A Peek Into The Future Of AI Inference At Nvidia

March 31, 2023 Timothy Prickett Morgan 5

The best kinds of research are those that test new ideas and that also lead to practical innovations in real products. …

Posts navigation

1 2 … 4 »
About

The Next Platform is part of the Situation Publishing family, which includes the enterprise and business technology publication, The Register.

TNP  offers in-depth coverage of high-end computing at large enterprises, supercomputing centers, hyperscale data centers, and public clouds. Read more…

Newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

  • RSS
  • Twitter
  • Facebook
  • LinkedIn
  • Email the editor
  • About
  • Contributors
  • Contact
  • Sales
  • Newsletter
  • Books
  • Events
  • Privacy
  • Ts&Cs
  • Cookies
  • Do not sell my personal information

All Content Copyright The Next Platform