Vision (Image-to-Text)
Pass both text and image content to a multimodal model.Parameters
When using vision models withgenerateText, the following parameters are supported:
The model instance to use for generation.
Array of message objects representing the conversation history. For vision, pass
type: 'image' along with the image data (URL or Base64) and mimeType.Controls randomness (0.0 to 2.0).
The maximum number of tokens to generate.
Nucleus sampling probability.
Limits sampling to the top K probable tokens.
Encourages the model to talk about new topics.
Prevents the model from repeating words.
Attempts deterministic generation.
Custom sequences that stop the model from generating further text.
