📄️ Basic Principles
Some of our models have multimodal capabilities. This means that they understand not only text, but also images. This capability opens up a host of possibilities that conventional machine learning methods can only achieve with costly data labeling and training. There are many applications for this technology, from object detection to image classification. Just click on the "Multimodal" tab in the Playground to get started!
📄️ Examples
Still not sure where to start? Find some completion example prompts below.