xAI releases Grok-1.5V, a multimodal AI model with vision support
xAI, an artificial intelligence company under Musk, announced the launch of its first multimodal AI model, Grok-1.5V. In addition to its powerful text processing capabilities, Grok can also process various visual information, including documents, charts, screenshots, and photos. In benchmark tests in multiple fields, Grok-1.5V's performance is comparable to existing cutting-edge multimodal models. Especially in xAI's newly launched RealWorldQA benchmark test, Grok surpassed similar models in its ability to understand the real-world space. The RealWorldQA dataset contains more than 700 images and aims to evaluate the basic understanding ability of multimodal models in the physical world. Grok-1.5 will soon be open to early testers and existing users.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Today’s Crypto Highlights: Ripple Courts Circle, BTC Updates & More
Behind the Scenes: Ripple's Circle Acquisition Attempt and Unfolding Crypto Adoption Developments

Bitcoin Shift Causes Market Disruption – Analyzing a Whale’s $170M Move and its Impact
Unraveling the Underlying Implications of a Massive Bitcoin Transfer - Possible Harbinger of Major Market Movements?

Moon soon? XRP's strongest spot premium aligns with 70% rally setup
Price predictions 5/2: BTC, ETH, XRP, BNB, SOL, DOGE, ADA, SUI, LINK, AVAX
Trending news
MoreCrypto prices
More








