Saturday, March 7, 2026
No Result
View All Result
Blockchain 24hrs
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoins
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Blockchain Justice
  • Analysis
Crypto Marketcap
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoins
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Blockchain Justice
  • Analysis
No Result
View All Result
Blockchain 24hrs
No Result
View All Result

NVIDIA Unveils Llama 3.1-Nemotron-51B: A Leap in Accuracy and Efficiency

Home Blockchain
Share on FacebookShare on Twitter




Luisa Crawford
Sep 24, 2024 10:02

NVIDIA’s Llama 3.1-Nemotron-51B units new benchmarks in AI with superior accuracy and effectivity, enabling excessive workloads on a single GPU.





NVIDIA has introduced the discharge of a groundbreaking language mannequin, Llama 3.1-Nemotron-51B, which guarantees to ship unprecedented accuracy and effectivity in AI efficiency. Derived from Meta’s Llama-3.1-70B, the brand new mannequin employs a novel Neural Structure Search (NAS) method, considerably enhancing each its accuracy and effectivity. In line with the NVIDIA Technical Weblog, this mannequin can match on a single NVIDIA H100 GPU even below excessive workloads, making it extra accessible and cost-effective.

Superior Throughput and Workload Effectivity

The Llama 3.1-Nemotron-51B mannequin outperforms its predecessors with 2.2 occasions quicker inference speeds whereas sustaining almost the identical degree of accuracy. This effectivity permits for 4 occasions bigger workloads on a single GPU throughout inference, due to its diminished reminiscence footprint and optimized structure.

Optimized Accuracy Per Greenback

One of many vital challenges in adopting giant language fashions (LLMs) is their inference price. The Llama 3.1-Nemotron-51B mannequin addresses this by providing a balanced tradeoff between accuracy and effectivity, making it an economical resolution for numerous functions, starting from edge programs to cloud knowledge facilities. This functionality is especially advantageous for deploying a number of fashions by way of Kubernetes and NIM blueprints.

Simplifying Inference with NVIDIA NIM

The Nemotron mannequin is optimized with TensorRT-LLM engines for larger inference efficiency and is packaged as an NVIDIA NIM inference microservice. This setup simplifies and accelerates the deployment of generative AI fashions throughout NVIDIA’s accelerated infrastructure, together with cloud, knowledge facilities, and workstations.

Below the Hood – Constructing the Mannequin with NAS

The Llama 3.1-Nemotron-51B-Instruct mannequin was developed utilizing environment friendly NAS know-how and coaching strategies, permitting for the creation of non-standard transformer fashions optimized for particular GPUs. This method features a block-distillation framework to coach numerous block variants in parallel, guaranteeing environment friendly and correct inference.

Tailoring LLMs for Numerous Wants

NVIDIA’s NAS method permits customers to pick their optimum steadiness between accuracy and effectivity. As an example, the Llama-3.1-Nemotron-40B-Instruct variant was created to prioritize velocity and value, reaching a 3.2 occasions velocity improve in comparison with the mother or father mannequin with a average lower in accuracy.

Detailed Outcomes

The Llama 3.1-Nemotron-51B-Instruct mannequin has been benchmarked in opposition to a number of business requirements, demonstrating its superior efficiency in numerous situations. It doubles the throughput of the reference mannequin, making it cost-effective throughout a number of use circumstances.

The Llama 3.1-Nemotron-51B-Instruct mannequin supplies a brand new set of alternatives for customers and corporations aiming to make the most of extremely correct basis fashions cost-effectively. Its steadiness between accuracy and effectivity makes it a horny choice for builders and showcases the effectiveness of the NAS method, which NVIDIA plans to increase to different fashions.

Picture supply: Shutterstock



Source link

Tags: 3.1Nemotron51BAccuracyEfficiencyLeapLlamaNVIDIAUnveils
Previous Post

Hackers Target OpenAI’s X Account, Promote Phishing Scam

Next Post

Ethereum price surge lifts Lido TVL by 10% despite 26k ETH withdrawals

Related Posts

ElevenLabs Launches Generative Voice AI Tool for Custom Synthetic Voices
Blockchain

ElevenLabs Launches Generative Voice AI Tool for Custom Synthetic Voices

March 6, 2026
Expert Tips to Become a Web3 Expert
Blockchain

Expert Tips to Become a Web3 Expert

March 6, 2026
OpenAI Deploys ChatGPT on Pentagon’s GenAI.mil Platform for 3M Defense Personnel
Blockchain

OpenAI Deploys ChatGPT on Pentagon’s GenAI.mil Platform for 3M Defense Personnel

March 6, 2026
OpenAI Launches €500K Grant for Youth AI Safety Research in EMEA
Blockchain

OpenAI Launches €500K Grant for Youth AI Safety Research in EMEA

March 5, 2026
NVIDIA Releases Flash Attention Optimization Guide for Blackwell GPUs
Blockchain

NVIDIA Releases Flash Attention Optimization Guide for Blackwell GPUs

March 4, 2026
OpenAI Releases GABRIEL Toolkit to Transform Social Science Research
Blockchain

OpenAI Releases GABRIEL Toolkit to Transform Social Science Research

March 3, 2026
Next Post
Ethereum price surge lifts Lido TVL by 10% despite 26k ETH withdrawals

Ethereum price surge lifts Lido TVL by 10% despite 26k ETH withdrawals

AUTOMA 2024 Tackles Key Challenges in Oil & Gas

AUTOMA 2024 Tackles Key Challenges in Oil & Gas

Facebook Twitter Instagram Youtube RSS
Blockchain 24hrs

Blockchain 24hrs delivers the latest cryptocurrency and blockchain technology news, expert analysis, and market trends. Stay informed with round-the-clock updates and insights from the world of digital currencies.

CATEGORIES

  • Altcoins
  • Analysis
  • Bitcoin
  • Blockchain
  • Blockchain Justice
  • Crypto Exchanges
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • NFT
  • Regulations
  • Web3

SITEMAP

  • About Us
  • Advertise With Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact Us

Copyright © 2024 Blockchain 24hrs.
Blockchain 24hrs is not responsible for the content of external sites.

  • bitcoinBitcoin(BTC)$67,994.00-3.37%
  • ethereumEthereum(ETH)$1,984.08-3.43%
  • tetherTether(USDT)$1.000.00%
  • binancecoinBNB(BNB)$629.71-1.55%
  • rippleXRP(XRP)$1.37-2.07%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • solanaSolana(SOL)$84.55-3.19%
  • tronTRON(TRX)$0.283591-1.04%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.02-1.05%
  • dogecoinDogecoin(DOGE)$0.090377-2.46%
No Result
View All Result
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoins
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Blockchain Justice
  • Analysis
Crypto Marketcap

Copyright © 2024 Blockchain 24hrs.
Blockchain 24hrs is not responsible for the content of external sites.