feedzop-word-mark-logo
searchLogin
Feedzop
homeFor YouIndiaIndia
You
bookmarksYour BookmarkshashtagYour Topics
Trending
trending

Istanbul flights save hundreds

trending

Faf du Plessis retires from IPL

trending

Wockhardt share price jumps

trending

HDFC Bank stock live updates

trending

Paytm share price rallies

trending

IPL auction: 1355 players register

trending

Hardik Pandya returns to cricket

trending

Kospi index rises on buying

trending

Bangladesh wins T20I series

Terms of UsePrivacy PolicyAboutJobsPartner With Us

© 2025 Advergame Technologies Pvt. Ltd. ("ATPL"). Gamezop ® & Quizzop ® are registered trademarks of ATPL.

Gamezop is a plug-and-play gaming platform that any app or website can integrate to bring casual gaming for its users. Gamezop also operates Quizzop, a quizzing platform, that digital products can add as a trivia section.

Over 5,000 products from more than 70 countries have integrated Gamezop and Quizzop. These include Amazon, Samsung Internet, Snap, Tata Play, AccuWeather, Paytm, Gulf News, and Branch.

Games and trivia increase user engagement significantly within all kinds of apps and websites, besides opening a new stream of advertising revenue. Gamezop and Quizzop take 30 minutes to integrate and can be used for free: both by the products integrating them and end users

Increase ad revenue and engagement on your app / website with games, quizzes, astrology, and cricket content. Visit: business.gamezop.com

Property Code: 5571

Home / Technology / Open-Weight AI Fails Under Sustained Chat Attacks

Open-Weight AI Fails Under Sustained Chat Attacks

2 Dec

•

Summary

  • Multi-turn AI attacks bypass defenses previously thought secure.
  • Attack success rates jump from 13% to 92% with conversational probing.
  • Security researchers highlight the need for context-aware AI guardrails.
Open-Weight AI Fails Under Sustained Chat Attacks

Open-weight AI models exhibit a critical weakness: while effective against single-turn attacks, they collapse under sustained conversational pressure. Cisco's research quantifies this, showing attack success rates escalating dramatically from an average of 13.11% for single prompts to 64.21% for multi-turn assaults, with some models reaching over 92% failure. This stark contrast underscores that current safety benchmarks fail to capture real-world adversarial tactics.

These multi-turn attacks exploit conversational persistence through techniques like information decomposition, contextual ambiguity, and role-playing. Researchers found that models struggle to maintain contextual defenses over extended dialogues, allowing attackers to refine prompts and bypass safeguards. This vulnerability is systemic, affecting numerous leading open-weight models tested, regardless of their alignment focus, though capability-first models show wider gaps.

To bridge this security gap, enterprises must prioritize context-aware guardrails, model-agnostic runtime protections, and continuous red-teaming focused on multi-turn strategies. Ignoring this systemic vulnerability could lead to catastrophic failures, emphasizing that securing AI conversations, not just individual prompts, is crucial for unlocking wider adoption and mitigating significant security risks.

Disclaimer: This story has been auto-aggregated and auto-summarised by a computer program. This story has not been edited or created by the Feedzop team.
Multi-turn attacks exploit AI models' difficulty in maintaining contextual defenses over extended dialogues, allowing attackers to refine prompts and bypass safeguards.
Techniques include information decomposition, contextual ambiguity, crescendo attacks, role-play, and refusal reframing, all exploiting conversational persistence.
Enterprises should implement context-aware guardrails, model-agnostic runtime protections, continuous red-teaming, and hardened system prompts.

Read more news on

Technologyside-arrow

You may also like

AI Revolutionizes Accounting: $93B Market by 2032

3 hours ago • 12 reads

article image

AI's Sexist Slip-Up: "You Can't Understand Quantum Algorithms"

29 Nov • 36 reads

article image

AI Observability: From Black Box to Trustworthy Systems

30 Nov • 11 reads

article image

AI Earnings Fuel Global Travels

28 Nov • 19 reads

article image

Service Robots Market Surges Towards $147 Billion

27 Nov • 40 reads

article image