Back to Search
Start Over
OpenAI announces new multimodal desktop GPT with new voice and vision capabilities.
- Source :
-
Computerworld (Online Only) . 5/13/2024, p1-6. 6p. - Publication Year :
- 2024
-
Abstract
- OpenAI has announced the release of GPT-4o, a new language model that can interact with users through text, voice, and visual prompts. The model can recognize and respond to screenshots, photos, documents, and charts, as well as facial expressions and handwritten information. It has improved response times, matching human conversation speeds, and offers better performance in vision and audio understanding. While some analysts believe OpenAI is catching up to competitors, GPT-4o demonstrates impressive conversational capabilities and multilingual proficiency. The model will be rolled out iteratively, with extended access for developers and new features for paying users. [Extracted from the article]
Details
- Language :
- English
- ISSN :
- 00104841
- Database :
- Academic Search Index
- Journal :
- Computerworld (Online Only)
- Publication Type :
- Periodical
- Accession number :
- 177300773