feedzop-word-mark-logo
searchLogin
Feedzop
homeFor YouIndiaIndia
You
bookmarksYour BookmarkshashtagYour Topics
Trending
Terms of UsePrivacy PolicyAboutJobsPartner With Us

© 2026 Advergame Technologies Pvt. Ltd. ("ATPL"). Gamezop ® & Quizzop ® are registered trademarks of ATPL.

Gamezop is a plug-and-play gaming platform that any app or website can integrate to bring casual gaming for its users. Gamezop also operates Quizzop, a quizzing platform, that digital products can add as a trivia section.

Over 5,000 products from more than 70 countries have integrated Gamezop and Quizzop. These include Amazon, Samsung Internet, Snap, Tata Play, AccuWeather, Paytm, Gulf News, and Branch.

Games and trivia increase user engagement significantly within all kinds of apps and websites, besides opening a new stream of advertising revenue. Gamezop and Quizzop take 30 minutes to integrate and can be used for free: both by the products integrating them and end users

Increase ad revenue and engagement on your app / website with games, quizzes, astrology, and cricket content. Visit: business.gamezop.com

Property Code: 5571

Home / Technology / Google AI Learns Smarter Reasoning

Google AI Learns Smarter Reasoning

17 Jan

•

Summary

  • New AI technique steers internal activations for reasoning.
  • Internal RL bypasses token-by-token prediction limits.
  • This could enable autonomous agents for complex tasks.
Google AI Learns Smarter Reasoning

Researchers at Google have introduced a new method called internal reinforcement learning (internal RL) to improve AI's ability to handle complex reasoning tasks. This technique steers the AI model's internal activations, guiding it towards developing high-level, step-by-step solutions rather than relying on traditional next-token prediction. This approach aims to overcome the limitations of autoregressive models, which struggle with long-horizon planning and sparse rewards.

The internal RL method utilizes an "internal neural network controller" that modifies the model's internal activations. This controller learns high-level actions through unsupervised, self-supervised learning by analyzing sequences of behavior and inferring the underlying intent. The researchers found that applying this controller to a frozen pre-trained model was more effective, enabling it to discover key subgoals without human labels.

Experiments demonstrated that internal RL significantly outperforms traditional methods like GRPO on complex tasks with sparse rewards. This advancement could lead to the development of autonomous agents capable of handling intricate reasoning and real-world robotics, potentially offering a more efficient path to advanced AI capabilities.

Disclaimer: This story has been auto-aggregated and auto-summarised by a computer program. This story has not been edited or created by the Feedzop team.
Google's internal RL technique guides an AI model's internal activations to develop step-by-step reasoning solutions, improving performance on complex tasks.
Internal RL focuses on steering the AI's internal states towards abstract goals, unlike next-token prediction which generates output one word at a time.
Internal RL could enable more capable autonomous agents for complex reasoning tasks, real-world robotics, and multi-modal AI without constant human guidance.

Read more news on

Technologyside-arrowGoogleside-arrowArtificial Intelligence (AI)side-arrow
trending

Srinagar flights cancelled due weather

trending

Sakhalin Russia sees 'sun dogs'

trending

Tripura rooftop solar generation

trending

Amsterdam cruise terminal may close

trending

Alexander Zverev advances in Australia

trending

Gaethje beats Pimblett, wins title

trending

Alcaraz seeks Australian Open title

trending

Umar Nurmagomedov defeats Figueiredo

trending

Jean Silva edges Arnold Allen

You may also like

Google Search Gets Personal: Your Data, Your Results

23 Jan • 10 reads

article image

CEOs Fear Revenue Slump: Tech Fears Grip Business Leaders

21 Jan • 78 reads

article image

AI Progress Shifts: System Design Over Model Size

18 Jan • 28 reads

article image

Robots Learn Like Humans with New AI Model

13 Jan • 58 reads

article image

AI's Backbone: Arista Fuels Data Superhighways

25 Dec, 2025 • 142 reads

article image