LMMs
GPT-4 Vision
Sends an image and a textual prompt to the OpenAI API and returns the API's textual response.
This function integrates an image with a user-defined prompt to generate a response using OpenAI's API.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
api_key |
str
|
The API key for authenticating requests to the OpenAI API. |
required |
image |
ndarray
|
The image to be sent to the API. used in OpenCV. |
required |
prompt |
str
|
The textual prompt to accompany the image in the API request. |
required |
Returns:
Name | Type | Description |
---|---|---|
str |
str
|
The textual response from the OpenAI API based on the input image and prompt. |
Raises:
Type | Description |
---|---|
ValueError
|
If there is an error in encoding the image or if the API response contains an error. |
Source code in maestro/lmms/gpt4.py
74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 |
|