Thursday, April 23, 2026
No Result
View All Result
Blockchain 24hrs
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoins
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Blockchain Justice
  • Analysis
Crypto Marketcap
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoins
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Blockchain Justice
  • Analysis
No Result
View All Result
Blockchain 24hrs
No Result
View All Result

NVIDIA Unveils Llama 3.1-Nemotron-70B-Reward to Enhance AI Alignment with Human Preferences

Home Blockchain
Share on FacebookShare on Twitter




Felix Pinkston
Oct 06, 2024 14:20

NVIDIA introduces Llama 3.1-Nemotron-70B-Reward, a number one reward mannequin that improves AI alignment with human preferences utilizing RLHF, topping the RewardBench leaderboard.





NVIDIA has launched a groundbreaking reward mannequin, Llama 3.1-Nemotron-70B-Reward, geared toward enhancing the alignment of huge language fashions (LLMs) with human preferences. This growth is a part of NVIDIA’s efforts to leverage reinforcement studying from human suggestions (RLHF) to enhance AI techniques, in keeping with NVIDIA Technical Weblog.

Developments in AI Alignment

Reinforcement studying from human suggestions is essential for creating AI techniques that may emulate human values and preferences. This system permits superior LLMs comparable to ChatGPT, Claude, and Nemotron to generate responses that mirror person expectations extra precisely. By incorporating human suggestions, these fashions exhibit improved decision-making capabilities and nuanced habits, fostering belief in AI purposes.

Llama 3.1-Nemotron-70B-Reward Mannequin

The Llama 3.1-Nemotron-70B-Reward mannequin has achieved the highest place on the Hugging Face RewardBench leaderboard, which evaluates the capabilities, security, and pitfalls of reward fashions. With a powerful rating of 94.1% on Total RewardBench, the mannequin demonstrates a excessive capability to determine responses aligning with human preferences.

This mannequin excels throughout 4 classes: Chat, Chat-Exhausting, Security, and Reasoning, notably attaining 95.1% and 98.1% accuracy in Security and Reasoning, respectively. These outcomes underscore the mannequin’s capability to securely reject unsafe responses and its potential assist in domains like arithmetic and coding.

Implementation and Effectivity

NVIDIA has optimized the mannequin for top compute effectivity, boasting a measurement solely a fifth of the Nemotron-4 340B Reward whereas sustaining superior accuracy. The mannequin’s coaching utilized CC-BY-4.0-licensed HelpSteer2 knowledge, making it appropriate for enterprise use circumstances. The coaching course of mixed two standard approaches, making certain excessive knowledge high quality and advancing AI capabilities.

Deployment and Accessibility

The Nemotron Reward mannequin is offered as an NVIDIA NIM inference microservice, facilitating straightforward deployment throughout varied infrastructures, together with cloud, knowledge facilities, and workstations. NVIDIA NIM employs inference optimization engines and industry-standard APIs to ship high-throughput AI inference that scales with demand.

Customers can discover the Llama 3.1-Nemotron-70B-Reward mannequin straight from their browsers or make the most of the NVIDIA-hosted API for large-scale testing and proof of idea growth. The mannequin is accessible for obtain on platforms like Hugging Face, offering builders with versatile choices for integration.

Picture supply: Shutterstock



Source link

Tags: 3.1Nemotron70BRewardAlignmentEnhanceHumanLlamaNVIDIAPreferencesUnveils
Previous Post

Number Of Ethereum Whales Holding 10,000 ETH Down By 7% — Implication For Price?

Next Post

Web3 charts a challenging course on the long road to mass adoption

Related Posts

GSR Launches Multi-Asset Crypto ETF ‘BESO’ on Nasdaq
Blockchain

GSR Launches Multi-Asset Crypto ETF ‘BESO’ on Nasdaq

April 23, 2026
Litecoin Eyes  Breakout as Technical Setup Aligns for May Rally
Blockchain

Litecoin Eyes $62 Breakout as Technical Setup Aligns for May Rally

April 23, 2026
Blockchain.com Adds Perps Trading to Self-Custody Wallets
Blockchain

Blockchain.com Adds Perps Trading to Self-Custody Wallets

April 22, 2026
Google’s Deep Research Max Raises Bar for Autonomous AI Tools
Blockchain

Google’s Deep Research Max Raises Bar for Autonomous AI Tools

April 21, 2026
Success Story: Douglas Vernon’s Learning Journey with 101 Blockchains
Blockchain

Success Story: Douglas Vernon’s Learning Journey with 101 Blockchains

April 21, 2026
Tether Acquires 8.2% Stake in Bitcoin Mining Lender Antalpha
Blockchain

Tether Acquires 8.2% Stake in Bitcoin Mining Lender Antalpha

April 20, 2026
Next Post
Web3 charts a challenging course on the long road to mass adoption

Web3 charts a challenging course on the long road to mass adoption

A Complete Guide to the Flow Blockchain in 2024

A Complete Guide to the Flow Blockchain in 2024

Facebook Twitter Instagram Youtube RSS
Blockchain 24hrs

Blockchain 24hrs delivers the latest cryptocurrency and blockchain technology news, expert analysis, and market trends. Stay informed with round-the-clock updates and insights from the world of digital currencies.

CATEGORIES

  • Altcoins
  • Analysis
  • Bitcoin
  • Blockchain
  • Blockchain Justice
  • Crypto Exchanges
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • NFT
  • Regulations
  • Web3

SITEMAP

  • About Us
  • Advertise With Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact Us

Copyright © 2024 Blockchain 24hrs.
Blockchain 24hrs is not responsible for the content of external sites.

  • bitcoinBitcoin(BTC)$77,894.00-0.98%
  • ethereumEthereum(ETH)$2,323.85-3.14%
  • tetherTether(USDT)$1.000.00%
  • rippleXRP(XRP)$1.43-0.50%
  • binancecoinBNB(BNB)$637.54-0.84%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$85.72-1.97%
  • tronTRON(TRX)$0.329202-0.08%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.041.59%
  • dogecoinDogecoin(DOGE)$0.0969070.34%
No Result
View All Result
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoins
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Blockchain Justice
  • Analysis
Crypto Marketcap

Copyright © 2024 Blockchain 24hrs.
Blockchain 24hrs is not responsible for the content of external sites.