TypingMind | GPT Vision
GPT-4 with Vision allows the model to take in images and answer questions about them. Historically, language model systems have been limited by taking in a single input modality, text. For many use cases, this constrained the areas where models like GPT-4 could be used. GPT-4 with vision is an augmentative set of capabilities for the model.
TypingMind | GPT Vision endpoints
| Method | Endpoint | Description |
|---|---|---|
| POST |
GPT Vision / |
Allows the model to take in images and answer questions about them |
TypingMind | GPT Vision pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | $1 / month | 100 / minute |
|
| PRO | $5 / month | 200 / minute |
|
| ULTRA Recommended | $25 / month | 300 / minute |
|
| MEGA | $50 / month | 500 / minute |
|