📄️ Basic Principles
All our models have multimodal capabilities. This means that they understand not only text, but also images. This capability opens up a host of possibilities that conventional machine learning methods can only achieve with costly data labeling and training. There are many applications for this technology, from Optical Character Recognition (OCR) to image classification. Just click on the "Multimodal" tab in the Playground to get started!
Still not sure where to start? Find some example prompts below.