feedzop-word-mark-logo
searchLogin
Feedzop
homeFor YouUnited StatesUnited States
You
bookmarksYour BookmarkshashtagYour Topics
Trending
trending

Wembanyama returns against OKC Thunder

trending

Knicks reach NBA Cup Final

trending

Pakistan to launch 5G

trending

Houston TV legend Dave Ward

trending

Lake effect snow warning issued

trending

Shooting at Brown University

trending

Flames beat LA Kings

trending

Graham Ike leads Gonzaga

trending

Bondi Beach: developing incident

Terms of UsePrivacy PolicyAboutJobsPartner With Us

© 2025 Advergame Technologies Pvt. Ltd. ("ATPL"). Gamezop ® & Quizzop ® are registered trademarks of ATPL.

Gamezop is a plug-and-play gaming platform that any app or website can integrate to bring casual gaming for its users. Gamezop also operates Quizzop, a quizzing platform, that digital products can add as a trivia section.

Over 5,000 products from more than 70 countries have integrated Gamezop and Quizzop. These include Amazon, Samsung Internet, Snap, Tata Play, AccuWeather, Paytm, Gulf News, and Branch.

Games and trivia increase user engagement significantly within all kinds of apps and websites, besides opening a new stream of advertising revenue. Gamezop and Quizzop take 30 minutes to integrate and can be used for free: both by the products integrating them and end users

Increase ad revenue and engagement on your app / website with games, quizzes, astrology, and cricket content. Visit: business.gamezop.com

Property Code: 5571

Home / Technology / AI Sees, Reasons, and Acts: New Vision Model Unveiled

AI Sees, Reasons, and Acts: New Vision Model Unveiled

9 Dec

•

Summary

  • New open-source vision-language model enhances multimodal reasoning.
  • Introduces native function calling for direct tool integration.
  • Achieves state-of-the-art results across over 20 benchmarks.
AI Sees, Reasons, and Acts: New Vision Model Unveiled

Chinese AI startup Zhipu AI has unveiled its GLM-4.6V series, a new generation of open-source vision-language models. These models are optimized for multimodal reasoning and frontend automation, featuring native function calling that allows direct use of tools with visual inputs. The series boasts a 128,000 token context length and state-of-the-art results across over 20 benchmarks, positioning it as a strong competitor in the AI landscape.

The GLM-4.6V models utilize an encoder-decoder architecture with a Vision Transformer and an LLM decoder, supporting arbitrary image resolutions and video inputs. A key innovation is native multimodal function calling, which eliminates the need for text-only conversions when integrating visual assets with tools. This enables tasks like generating structured reports from mixed documents and performing visual web searches.

Distributed under the permissive MIT license, GLM-4.6V is suitable for enterprise adoption, offering flexibility for proprietary systems and local deployments. The models have demonstrated high performance, with the 106B version outperforming larger models on long-context tasks and video summarization, while the 9B Flash variant excels among lightweight models.

Disclaimer: This story has been auto-aggregated and auto-summarised by a computer program. This story has not been edited or created by the Feedzop team.
GLM-4.6V is an open-source vision-language model from Zhipu AI, designed for multimodal reasoning and automation with native tool integration.
It features native function calling for direct visual tool use, supports a 128,000 token context length, and achieves state-of-the-art results on many benchmarks.
Yes, GLM-4.6V is released under the MIT license, allowing for free commercial and non-commercial use, modification, and redistribution.

Read more news on

Technologyside-arrowArtificial Intelligence (AI)side-arrow

You may also like

Robots Join Human Workforce: 10,000 Deployed Soon

1 day ago • 11 reads

article image

AI's Cyber Threat Escalates: OpenAI Prepares Defenses

1 day ago • 19 reads

article image

OpenAI Data Breach: Your Data's Safety at Risk

1 day ago • 18 reads

article image

BigBear.ai Stock: AI Hype vs. Reality

17 hours ago • 4 reads

article image

New GPT-5.2: Brevity and 'Go Signals' Frustrate Users

1 day ago • 11 reads

article image