• Home
  • News
  • Personal Finance
    • Savings
    • Banking
    • Mortgage
    • Retirement
    • Taxes
    • Wealth
  • Make Money
  • Budgeting
  • Burrow
  • Investing
  • Credit Cards
  • Loans

Subscribe to Updates

Get the latest finance news and updates directly to your inbox.

Top News

Is It Time For Retirees To Cash In Their Stock Market Gains?

January 16, 2026

Experts Urge Homebuyers to Do This at Least 5 Days Before Applying for a Mortgage

January 16, 2026

Workers Are Torn Between Ambition and Anxiety in 2026, According to Survey

January 16, 2026
Facebook Twitter Instagram
Trending
  • Is It Time For Retirees To Cash In Their Stock Market Gains?
  • Experts Urge Homebuyers to Do This at Least 5 Days Before Applying for a Mortgage
  • Workers Are Torn Between Ambition and Anxiety in 2026, According to Survey
  • 8 Reasons You Fail to Hit Your Financial Goals (and What to Do About It)
  • I’m a CPA: 7 Tax Breaks Seniors Forget to Claim
  • The Best Budgeting Apps for Getting Your Finances Together
  • Layoff and Automation Fears Are Front and Center for Workers in 2026
  • 5 Ways to Spot Fake Business Reviews Before You Get Suckered
Friday, January 16
Facebook Twitter Instagram
FintechoPro
Subscribe For Alerts
  • Home
  • News
  • Personal Finance
    • Savings
    • Banking
    • Mortgage
    • Retirement
    • Taxes
    • Wealth
  • Make Money
  • Budgeting
  • Burrow
  • Investing
  • Credit Cards
  • Loans
FintechoPro
Home » Nvidia reveals new A.I. chip, says costs of running LLMs will ‘drop significantly’
News

Nvidia reveals new A.I. chip, says costs of running LLMs will ‘drop significantly’

News RoomBy News RoomAugust 9, 20236 Views0
Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email Tumblr Telegram

Nvidia announced a new chip designed to run artificial intelligence models on Tuesday as it seeks to fend off competitors in the AI hardware space, including AMD, Google and Amazon.

Currently, Nvidia dominates the market for AI chips with over 80% market share, according to some estimates. The company’s specialty is graphics processing units, or GPUs, which have become the preferred chips for the large AI models that underpin generative AI software, such as Google’s Bard and OpenAI’s ChatGPT. But Nvidia’s chips are in short supply as tech giants, cloud providers and startups vie for GPU capacity to develop their own AI models.

Nvidia’s new chip, the GH200, has the same GPU as the company’s current highest-end AI chip, the H100. But the GH200 pairs that GPU with 141 gigabytes of cutting-edge memory, as well as a 72-core ARM central processor.

“We’re giving this processor a boost,” Nvidia CEO Jensen Huang said in a talk at a conference on Tuesday. He added, “This processor is designed for the scale-out of the world’s data centers.”

The new chip will be available from Nvidia’s distributors in the second quarter of next year, Huang said, and should be available for sampling by the end of the year. Nvidia representatives declined to give a price.

Oftentimes, the process of working with AI models is split into at least two parts: training and inference.

First, a model is trained using large amounts of data, a process that can take months and sometimes requires thousands of GPUs, such as, in Nvidia’s case, its H100 and A100 chips. Then the model is used in software to make predictions or generate content, using a process called inference. Like training, inference is computationally expensive, and it requires a lot of processing power every time the software runs, like when it works to generate a text or image. But unlike training, inference takes place near-constantly, while training is only required when the model needs updating.

“You can take pretty much any large language model you want and put it in this and it will inference like crazy,” Huang said. “The inference cost of large language models will drop significantly.”

Nvidia’s new GH200 is designed for inference since it has more memory capacity, allowing larger AI models to fit on a single system, Nvidia VP Ian Buck said on a call with analysts and reporters on Tuesday. Nvidia’s H100 has 80GB of memory, versus 141GB on the new GH200. Nvidia also announced a system that combines two GH200 chips into a single computer for even larger models.

“Having larger memory allows the model to remain resident on a single GPU and not have to require multiple systems or multiple GPUs in order to run,” Buck said.

The announcement comes as Nvidia’s primary GPU rival, AMD, recently announced its own AI-oriented chip, the MI300X, which can support 192GB of memory and is being marketed for its capacity for AI inference. Companies including Google and Amazon are also designing their own custom AI chips for inference.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

RSS Feed Generator, Create RSS feeds from URL

News November 22, 2024

X CEO Linda Yaccarino addresses Musk’s ‘go f—- yourself’ comment to advertisers

News November 30, 2023

67-year-old who left the U.S. for Mexico: I’m happily retired—but I ‘really regret’ doing these 3 things in my 20s

News November 30, 2023

U.S. GDP grew at a 5.2% rate in the third quarter, even stronger than first indicated

News November 29, 2023

Americans are ‘doom spending’ — here’s why that’s a problem

News November 29, 2023

Jim Cramer’s top 10 things to watch in the stock market Tuesday

News November 28, 2023
Add A Comment

Leave A Reply Cancel Reply

Demo
Top News

Experts Urge Homebuyers to Do This at Least 5 Days Before Applying for a Mortgage

January 16, 20260 Views

Workers Are Torn Between Ambition and Anxiety in 2026, According to Survey

January 16, 20261 Views

8 Reasons You Fail to Hit Your Financial Goals (and What to Do About It)

January 15, 20261 Views

I’m a CPA: 7 Tax Breaks Seniors Forget to Claim

January 15, 20260 Views
Don't Miss

The Best Budgeting Apps for Getting Your Finances Together

By News RoomJanuary 14, 2026

Irene Miller / Shutterstock.comUnless you’re an accountant, budgeting is nerve-wracking. Images of spreadsheets, endless numbers…

Layoff and Automation Fears Are Front and Center for Workers in 2026

January 14, 2026

5 Ways to Spot Fake Business Reviews Before You Get Suckered

January 13, 2026

7 Side Hustles That Are Actually Worth the Time — and 3 That Are Not

January 13, 2026
Facebook Twitter Instagram Pinterest Dribbble
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact
© 2026 FintechoPro. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.