Uh-oh, X's Grok AI can now 'understand' images

New "Vision" Grok will be available to testers and select users.
By
Chase DiBenedetto
 on 
A phone displaying the Grok xAI logo, which is a white box with a black slash across it.
X's new Grok "Vision" can process complex visual information.Credit: Jaap Arriens/NurPhoto via Getty Images

Elon Musk's AI chatbot can now "understand" images, including information-riddled diagrams and charts. Sorry, doesn't everyone use the platform once known as Twitter for multi-disciplinary research and optimizing their work flows??

Introduced as Grok-1.5V — Or Grok 1.5 "Vision," the company's "first-generation multimodal model" — the bot will be able to not only respond to your uploaded pictures and screenshots but also reason through complex documents, science diagrams, charts, screenshots, and photographs, the company says. Additionally, Grok-1.5V will gain "real-world spatial understanding" to better understand the physical world depicted in the images uploaded by its users.

"Advancing both our multimodal understanding and generation capabilities are important steps in building beneficial AGI that can understand the universe," the company wrote in its' announcement. "In the coming months, we anticipate to make significant improvements in both capabilities, across various modalities such as images, audio, and video."

Example use cases include translating a diagram into Python code, turning a child's drawing into a bedroom story, pinpointing the largest object among a group of many, and telling a driver if they have enough space to drive around an obstacle.

Mashable Light Speed
Want more out-of-this world tech, space and science stories?
Sign up for Mashable's weekly Light Speed newsletter.
By clicking Sign Me Up, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up!

Grok-1.5V is released along with xAI's RealWorldQA, an image and prompt dataset designed to test other GenAI models against Grok's real world reasoning.

Competition is the least of Grok's worries, however. Despite xAI's continued investment, Grok has yet to stick with early users and staff — a new report alleges its own developers struggle to use the slow xAI API. That same report, published by Fortune this week, highlighted X employee concerns about Musk suggesting Grok write paid user's posts for them, despite warnings from developers and staff. Last week, Grok came under fire for generating fake news headlines from an alternate reality where Iran had assailed Tel Aviv with a military arsenal — not its first time.

While GenAI chatbots hallucinating realities and generating fake news is par for the course, Grok's gaffe is indicative of yet another site wide issue. The bot, a par for the course response to ChatGPT from Musk, is integrating into a platform that has slowly whittled away at its defenses against AI gone bad. Combined with X's all around poor reputation for moderation and the CEO's own refusal to address misinformation in aid of the site's "citizen journalists," Grok occupies a precarious spot in the platform's besieged information ecosystem.

Grok-1.5V will be available to early testers and select users soon.

Chase sits in front of a green framed window, wearing a cheetah print shirt and looking to her right. On the window's glass pane reads "Ricas's Tostadas" in red lettering.
Chase DiBenedetto
Social Good Reporter

Chase joined Mashable's Social Good team in 2020, covering online stories about digital activism, climate justice, accessibility, and media representation. Her work also captures how these conversations manifest in politics, popular culture, and fandom. Sometimes she's very funny.


Recommended For You
Grok now has a memory
Grok logo

Elon Musk unveils Grok 3 and 'Deep Search' tool
Grok 3 logo

How Grok 3 compares to ChatGPT, DeepSeek and other AI rivals
grok 3 app on a smartphone in front of a computer screen showing the grok 3 interface.

Grok blocked sources accusing Elon Musk of spreading misinformation
A zoomed-in photo of the Grok AI Assistant desktop page.


More in Tech
A tariff survival guide to buying refurbished smartphones, TVs, and headphones
Blue tinted Asus laptop, Apple AirPods Max headphones, and Amazon Kindle arranged on blue and green backdrop

LinkedIn and Adobe announce partnership to help creators protect their work against AI
A phone showing the LinkedIn app in the app store.

The Anker Solix C1000 portable power station is better than half price
Anker Solix C1000 portable power station on gradient background

Save over $900 on the Ecovacs Deebot X1 Omni robot vacuum
Ecovacs Deebot X1 Omni robot vacuum on gradient background

This obnoxiously large Samsung TV is on sale for over $2,000 off — get the lowest-ever price
Samsung Crystal UHD DU9000 TV on gradient background

Trending on Mashable
NYT Connections hints today: Clues, answers for April 27, 2025
Connections game on a smartphone

Wordle today: Answer, hints for April 27, 2025
Wordle game on a smartphone

NYT Strands hints, answers for April 27
A game being played on a smartphone.

NYT Connections hints today: Clues, answers for April 26, 2025
Connections game on a smartphone

How to watch the 2025 MotoGP Grand Prix of Spain online for free
Ducati Lenovo's Italian rider Francesco Bagnaia races ahead of other riders
The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!
close