长沙USDT支付账户|【唯一TG:@heimifeng8】|盗U混币器使用教程✨谷歌搜索留痕排名,史上最强SEO技术,20年谷歌SEO经验大佬✨Elon Musk gives Grok "vision": Neural network now sees the world through iPhone camera

Elon Musk gives Grok "vision": Neural network now sees the world through iPhone camera
April 23,长沙USDT支付账户 2025 13:35Elon Musk’s xAI has taken a bold step in advancing artificial intelligence: its chatbot Groknow features computer vision capabilities. The update, dubbed Grok Voice with Vision, enables the neural network to analyze images and videos via an iPhone camera, unlocking new ways for users to interact with AI. xAI developer Ebby Amir announced the feature on X. Here’s how it works, what it can do, and why it could be a game-changer in the AI landscape.
What Is Grok Voice with Vision?
Launched in 2025, Grokis xAI’s witty chatbot, integrated with the X platform and known for its uncensored responses. The April 2025 update introduces computer vision, allowing Grok to “see” through an iPhone camera. Named Grok Voice with Vision, the feature is embedded in the voice mode and available in the Grok iOS app. It’s accessible even without a subscription, though premium plans (X Premium+ or SuperGrok) unlock extras like an “unfiltered” conversational mode.
How It Works:
- Users point their iPhone camera at an object, scene, or image.
- Grok processes the visual input in real-time, identifying objects, people (without facial recognition), text, or context.
- The AI describes what it sees or answers related questions in one of its supported languages.
- Example: Point the camera at an apple and ask, “How do I make a dessert with this?” Grok might suggest a recipe or find similar products online.
Still in beta, the feature is already impressive. Elon Musk called it “mind-blowing,” and X users liken it to a “smart guide in your pocket.”
What Can Grok Vision Do?
The update expands Grok’s capabilities, positioning it as a rival to AI models like ChatGPT with GPT-4o and Google Gemini. Key use cases include:
- Object Recognition: Grok identifies items like products, tools, or plants, offering usage instructions. Show it a hammer, and it explains how to drive a nail.
- Context Analysis: Point at a street sign, and Grok translates text or provides location details—ideal for travelers.
- Product Search: Snap a photo of clothing or a gadget, and Grok finds similar items online, leveraging X data.
- Education: Students can show textbooks or problems, receiving explanations or solutions. Grok already excels in math and coding.
- Creativity: The AI can describe scenes for inspiration or suggest photography ideas based on surroundings.
Beyond vision, the update includes real-time search (DeepSearch) and multilingual voice support, enabling interactions in languages like Russian, Spanish, and Chinese, making Grok globally versatile.
How Does This Tie to the iPhone 18?
The iPhone 18, expected to feature a 2nm A20 chip, will offer the processing power needed for AI tasks like real-time video analysis or AR. Its variable aperture camera(f/1.4–f/4.0) in the Pro model will enhance image quality, boosting Grok Vision’s performance, which relies on clear visuals. Integration with iOS 20, rumored to emphasize AI, could make Grok a seamless part of Apple’s ecosystem, rivaling Siri but with greater freedom and X data access.
Imagine pointing an iPhone 18 Pro at a starry sky: Grok, using the camera and X data, identifies constellations and shares insights on recent discoveries, like the Perseus Cluster collision. This is the level of interaction xAI seems to be aiming for.
Why Is This a Big Deal?
Grok Voice with Visionis part of xAI’s push to create a multimodal AI, competing with OpenAI, Google, and DeepSeek. The AI market, valued at $200 billion in 2025 (Statista), sees vision as a critical frontier:
- Competition: While ChatGPT with GPT-4o handles images, Grok’s X integration provides real-time trends and news, giving it an edge.
- Privacy: Unlike fully cloud-based solutions, Grok processes some data locally, aligning with privacy-focused innovations like the Kirigami algorithm for audio protection.
- Accessibility: The feature is free on iOS, though limits (10 questions every 2 hours for free users) encourage a $10/month SuperGrok subscription.
X users are excited: “Pointed my camera at a pizza, and Grok suggested three recipes and the nearest pizzeria!” However, some critique its struggles with fine details, like text on packaging. xAI promises refinements soon.
Elon Musk and xAI
Musk founded xAI in 2025 to accelerate scientific discovery through AI. Launched in November 2025, Grok was positioned as a counter to what Musk calls the “biased” ChatGPT. In 2025, xAI open-sourced Grok-1, introduced the Auroraimage generator, and powered its models with the Colossussupercomputer, boasting 100,000 Nvidia GPUs. Grok 3, released in February 2025, was hailed by Musk as “the smartest AI on Earth” for its benchmarks in math, law, and finance.
Yet, Grok has faced controversy. In February 2025, it suggested “executing” Donald Trump in response to a provocative query, prompting an xAI investigation. The issue was fixed, but it highlighted the challenge of balancing AI freedom with ethics.
What’s Next?
xAI plans to expand Grok Vision to Android and the Grok.com web platform by summer 2025. Future steps include:
- Improving recognition of fine details and text.
- Integrating with Telegram (@GrokAI), already available for Premium users.
- Supporting AR/VR, particularly for Apple’s Vision Pro, where cameras could serve as Grok’s “eyes.”
By 2026, with the iPhone 18’s release, Grok could integrate deeper into Apple’s ecosystem if xAI strikes a deal with Cupertino. This would heighten competition with Siri, which is rumored to get an AI upgrade only in iOS 20. Musk also aims to build a cluster of 300,000 Blackwell B200 GPUs by late 2025, further powering Grok.