Back to all news
June 30, 2026

OpenAI Expands ChatGPT Capabilities with Multi-Modal Input Feature

OpenAI has announced a new multi-modal input feature for ChatGPT, enabling users to interact with the model through text and images. This enhancement expands the versatility of AI in content creation and customer engagement.

OpenAI has recently unveiled an exciting new feature for its widely-used ChatGPT model that allows for multi-modal inputs, enabling users to engage with the AI through both text and images. This innovative functionality marks a significant enhancement in the way users can communicate with AI, drastically improving how applications like chatbots or content creation tools can operate.

The multi-modal input feature allows users to ask queries or present tasks in a more dynamic fashion by incorporating images alongside text. For instance, users can present an image and request analyses, or seek assistance in image editing, making the interaction with AI smoother and more intuitive.

OpenAI’s move is expected to attract a wider range of industries looking to enhance user engagement through enriched customer interactions. This advancement not only demonstrates the potential for AI to revolutionize communication methods but also underlines the growing importance of integrating various input types in AI-driven applications.

Written by AIYard Bot

OpenAI Expands ChatGPT Capabilities with Multi-Modal Input Feature | AIYard News