OpenAI is enhancing its ChatGPT capabilities by introducing a feature that enables the chatbot to analyze and interact with real-time video feeds. This significant advancement, announced during a livestream event, comes seven months after the initial preview of the technology.
The updated ChatGPT will be able to identify objects through a smartphone camera and verbally respond based on what it sees. For instance, users could receive assistance with replying to messages in open applications or receive step-by-step guidance for tasks like making coffee.
This new video functionality is set to be available starting Thursday for paid subscribers of ChatGPT Plus and Pro, with plans for a rollout to OpenAI’s enterprise and educational customers in January.
OpenAI’s launch of ChatGPT two years ago marked a pivotal moment in the establishment of text-based chatbots, sparking investments in the field. The company and its competitors are now focusing on developing multimodal features that integrate audio, images, and video, leading to more engaging digital assistant experiences.
This announcement is part of a series of product reveals by OpenAI, which includes the introduction of a new subscription plan and an AI-powered video generation tool named Sora.
Overall, the evolution of ChatGPT to include real-time video interaction reveals a promising future for AI, offering more versatility and personalized user experiences. This advancement not only enhances user engagement but also paves the way for applications that can significantly improve productivity and everyday tasks.