All Classes and Interfaces
Class
Description
Abstract class for the Bedrock API.
Encapsulates the metrics about the model invocation.
Abstract class that serves as a base for chat memory advisors.
Abstract
StructuredOutputConverter
implementation that uses a pre-configured
DefaultConversionService
to convert the LLM output into the desired type
format.Deprecated.
Abstract implementation of the
EmbeddingModel
interface that provides
dimensions calculation caching.AbstractFilterExpressionConverter is an abstract class that implements the
FilterExpressionConverter interface.
Deprecated, for removal: This API element is subject to removal in a future version.
since 1.0.0-M1 in favor of
AbstractToolCallSupport
The AbstractMessage class is an abstract implementation of the Message interface.
Deprecated.
Use the
AbstractMessageOutputConverter
instead.Abstract
StructuredOutputConverter
implementation that uses a pre-configured
MessageConverter
to convert the LLM output into the desired type format.Abstract base class for tool call support.
Represents the response received when requesting an access token.
The data of the chat client request that can be modified before the execution of the
ChatClient's call method
Parent advisor interface for all advisors.
Context used to store metadata for chat client advisors.
Interface for an
ObservationConvention
for chat client advisors.Java client for the Bedrock Jurassic2 chat model.
Ai21 Jurassic2 models version.
AI21 Jurassic2 chat request parameters.
Penalty with float scale value.
Penalty with integer scale value.
Ai21 Jurassic2 chat response.
Explains why the generation process was halted for a specific completion.
The generatedToken fields.
The prompt includes the raw text, the tokens with their log probabilities, and the top-K alternative tokens
at each position, if requested.
The textRange field indicates the start and end offsets of the token in the decoded text string.
Provides detailed information about each token in both the prompt and the completions.
The topTokens field is a list of the top K alternative tokens for this position, sorted by probability,
according to the topKReturn request parameter.
Collection of attribute keys used in AI observations (spans, metrics, events).
Collection of event names used in AI observations.
Collection of metric attributes used in AI observations.
Collection of metric names used in AI observations.
Metadata associated with an AI operation (e.g.
Types of operations performed by AI systems.
Collection of systems providing AI functionality.
Utility methods for creating native runtime hints.
Types of tokens produced and consumed in an AI operation.
AllOfGenerateResponseDetails
AllOfStreamResponseDetails
Based on Bedrock's Anthropic
Claude Messages API.
Anthropic models version.
AnthropicChatRequest encapsulates the request parameters for the Anthropic messages model.
AnthropicChatResponse encapsulates the response parameters for the Anthropic
messages model.
AnthropicChatStreamingResponse encapsulates the streaming response parameters for
the Anthropic messages model.
Encapsulates a delta.
The streaming type of this message.
Encapsulates the metrics about the model invocation.
Message comprising the conversation.
The role of the author of this message.
The source of the media content.
The type of this message.
Input messages.
Check the Models
overview and model
comparison for additional details and options.
The source of the media content.
The ContentBlock type.
The evnt type of the streamed chunk.
The role of the author of this message.
Special event used to aggregate multiple tool use events into a single event with
list of aggregated ContentBlockToolUse.
Usage statistics.
Anthropic models version.
AnthropicChatRequest encapsulates the request parameters for the Anthropic chat model.
AnthropicChatResponse encapsulates the response parameters for the Anthropic chat model.
The
ChatModel
implementation for the Anthropic service.The options to be used when sending a chat request to the Anthropic API.
Anthropic Chat autoconfiguration properties.
Anthropic API connection properties.
RateLimit
implementation for OpenAI.The AnthropicRuntimeHints class is responsible for registering runtime hints for
Anthropic API classes.
Usage
implementation for AnthropicApi.The ApiUtils class provides utility methods for working with API requests and
responses.
Lets the generative know the content was generated as a response to the user.
Represents a response returned by the AI.
Represents an audio transcription prompt for an AI model.
QianFan abstract authentication API.
Converts
Filter.Expression
into Azure Search OData filter syntax.AzureOpenAI audio transcription client implementation for backed by
OpenAIClient
.Segment of the transcribed text and its corresponding details.
Extracted word and it's corresponding timestamps.
Audio transcription metadata implementation for AzureOpenAI.
ChatModel
implementation for Microsoft Azure AI backed by
OpenAIClient
.The configuration information for a chat completions request.
Azure Open AI Embedding Model implementation.
The configuration information for the embedding requests.
Usage
implementation for Microsoft Azure OpenAI Service embedding.Represents the metadata for image generation using Azure OpenAI.
ImageModel
implementation for Microsoft Azure AI backed by
OpenAIClient
.The configuration information for a image generation request.
Configuration properties for Azure OpenAI image generation options.
Represents metadata associated with an image response from the Azure OpenAI image
model.
Utility enumeration for representing the response format that may be requested from the
Azure OpenAI model.
Usage
implementation for Microsoft Azure OpenAI Service chat.Uses Azure Cognitive Search as a backing vector store.
Contract for batching
Document
objects so that the call to embed them could be
optimized.An implementation of
StructuredOutputConverter
that transforms the LLM output
to a specific object type using JSON schema.Deprecated.
Use the
BeanOutputConverter
instead.Auto-configuration
for Bedrock Jurassic2 Chat Client.Java
ChatModel
for the Bedrock Jurassic2 chat generative model.Request body for the /complete endpoint of the Jurassic-2 API.
Penalty object for frequency, presence, and count penalties.
Configuration properties for Bedrock Ai21Jurassic2.
Auto-configuration
for Bedrock Anthropic Chat Client.Java
ChatModel
and StreamingChatModel
for the Bedrock Anthropic chat
generative.Configuration properties for Bedrock Anthropic Claude 3.
Auto-configuration
for Bedrock Anthropic Chat Client.Java
ChatModel
and StreamingChatModel
for the Bedrock Anthropic chat
generative.Configuration properties for Bedrock Anthropic.
Configuration properties for Bedrock AWS connection.
Auto-configuration
for Bedrock Cohere Chat Client.Bedrock Cohere Chat autoconfiguration properties.
Auto-configuration
for Bedrock Cohere Embedding Model.EmbeddingModel
implementation that uses the
Bedrock Cohere Embedding API.Bedrock Cohere Embedding autoconfiguration properties.
Auto-configuration
for Bedrock Llama Chat Client.Java
ChatModel
and StreamingChatModel
for the Bedrock Llama chat
generative.Configuration properties for Bedrock Llama.
The BedrockRuntimeHints class is responsible for registering runtime hints for Bedrock
AI API classes.
Auto-configuration
for Bedrock Titan Chat Client.Bedrock Titan Chat autoconfiguration properties.
Auto-configuration
for Bedrock Titan Embedding Model.EmbeddingModel
implementation that uses the
Bedrock Titan Embedding API.Bedrock Titan Embedding autoconfiguration properties.
Usage
implementation for Bedrock API.BestOfSequence
The Call Around Advisor Chain is used to invoke the next Around Advisor in the chain.
Create a CassandraChatMemory like
CassandraChatMemory.create(CassandraChatMemoryConfig.builder().withTimeToLive(Duration.ofDays(1)).build());
For example @see org.springframework.ai.chat.memory.CassandraChatMemoryGiven a string sessionId, return the value for each primary key column.
The CassandraVectorStore is for managing and querying vector data in an Apache
Cassandra db.
Indexes are automatically created with COSINE.
Configuration for the Cassandra vector store.
Given a string document id, return the value for each primary key column.
Given a list of primary key column values, return the document id.
The Categories class represents a set of categories used to classify content.
This class represents the scores for different categories of content.
Client to perform stateless requests to an AI Model, using a fluent API.
A mutable builder for creating a
ChatClient
.Auto-configuration
for ChatClient
.Builder for configuring a
ChatClient.Builder
.Configuration properties for the chat client builder.
Callback interface that can be used to customize a
ChatClient.Builder
.An
ObservationFilter
to include the chat prompt content in the observation.Context used to store metadata for chat client workflows.
Interface for an
ObservationConvention
for chat client workflows.Documented conventions for chat client observations.
Abstract Data Type (ADT) encapsulating information on the completion choices in the AI
response.
The ChatMemory interface represents a storage for chat conversation history.
An
ObservationFilter
to include the chat completion content in the observation.Handler for including the chat completion content in the observation as a span event.
Marker interface, to be used to store info on the model such as the current context
length.
Handler for generating metrics from chat model observations.
Utilities to process the prompt and completion content in observations for chat models.
Context used to store metadata for chat model exchanges.
Interface for an
ObservationConvention
for chat model exchanges.Documented conventions for chat model observations.
Events for chat model operations.
High-cardinality observation key names for chat model operations.
Low-cardinality observation key names for chat model operations.
An
ObservationFilter
to include the chat prompt content in the observation.Handler for including the chat prompt content in the observation as a span event.
Auto-configuration for Spring AI chat model observations.
Configuration properties for chat model observations.
The ChatOptions represent the common options, portable across different chat models.
A PromptTemplate that lets you specify the role as a string should the current
implementations and their roles not suffice for your needs.
The chat completion (e.g.
Models common AI provider metadata returned in an AI response.
Single-class Chroma API implementation based on the (unofficial) Chroma REST API.
Add embeddings to the chroma data store.
Chroma embedding collection.
Request to create a new collection with the given name and metadata.
Request to delete embedding from a collection.
Single query embedding response.
Object containing the get embedding results.
Get embeddings from a collection.
Request to get the nResults nearest neighbor embeddings for provided
queryEmbeddings.
A QueryResponse object containing the query results.
Converts
Filter.Expression
into Chroma metadata filter expression format.ChromaVectorStore
is a concrete implementation of the VectorStore
interface.Java client for the Bedrock Cohere chat model.
Cohere models version.
CohereChatRequest encapsulates the request parameters for the Cohere command model.
Builder for the CohereChatRequest.
Prevents the model from generating unwanted tokens or incentivize the model to include desired tokens.
(optional) Specify how and if the token likelihoods are returned with the response.
Specifies how the API handles inputs longer than the maximum token length.
CohereChatResponse encapsulates the response parameters for the Cohere command model.
Generated result along with the likelihoods for tokens requested.
The reason the response finished being generated.
Cohere Embedding API.
Cohere Embedding model ids.
The Cohere Embed model request.
Cohere Embedding API input types.
Specifies how the API handles inputs longer than the maximum token length.
Cohere Embedding response.
CompatGenerateRequest
Data structure that contains content and metadata.
Converts the Document text and metadata into an AI, prompt-friendly text
representation.
ContentFormatTransformer processes a list of documents by applying a content formatter
to each document.
Implementation of the
CallAroundAdvisorChain
and
StreamAroundAdvisorChain
.The default implementation of
ChatClient
as created by the
ChatClient.Builder.build()
} method.DefaultChatClientBuilder is a builder class for creating a ChatClient.
Default conventions to populate observations for chat client workflows.
Default conventions to populate observations for chat model operations.
Default conventions to populate observations for embedding model operations.
Default conventions to populate observations for image model operations.
Default conventions to populate observations for vector store operations.
Details
A document is a container for the content and metadata of a document.
EmbeddingModel is a generic interface for embedding models.
Represents a request to embed a list of documents.
ElasticsearchAiSearchFilterExpressionConverter is a class that converts
Filter.Expression objects into Elasticsearch query string representation.
The ElasticsearchVectorStore class implements the VectorStore interface and provides
functionality for managing and querying documents in Elasticsearch.
Provided Elasticsearch vector option configuration.
Represents a single embedding vector.
EmbeddingModel is a generic interface for embedding models.
Handler for generating metrics from embedding model observations.
Context used to store metadata for embedding model exchanges.
Interface for an
ObservationConvention
for embedding model exchanges.Documented conventions for embedding model observations.
High-cardinality observation key names for embedding model operations.
Low-cardinality observation key names for embedding model operations.
Auto-configuration for Spring AI embedding model observations.
Embedding response object.
Common AI provider metadata returned in an embedding response.
A RateLimit implementation that returns zero for all property getters
A EmpytUsage implementation that returns zero for all property getters
ErrorResponse
Represents an evaluation request, which includes the user's text, a list of content
data, and a chat response.
A utility to reformat extracted text content before encapsulating it in a
Document
.The
Builder
class is a nested static class of
ExtractedTextFormatter
designed to facilitate the creation and
customization of instances of ExtractedTextFormatter
.The FactCheckingEvaluator class implements a method for evaluating the factual accuracy
of Large Language Model (LLM) responses against provided context.
Writes the content of a list of
Document
s into a file.Portable runtime generative for metadata filter expressions.
Triple that represents and filter boolean expression as
left type right
.Filter expression operations.
Represents expression grouping (e.g.
String identifier representing an expression key.
Mark interface representing the supported expression types:
Filter.Key
,
Filter.Value
, Filter.Expression
and Filter.Group
.Represents expression value constant or constant array.
DSL builder for
Filter.Expression
instances.Converters a generic, portable
Filter.Expression
into a
VectorStore
specific expression language
format.Parse a textual, vector-store agnostic, filter expression language into
Filter.Expression
.Helper class providing various boolean transformation.
This class provides an empty implementation of
FiltersListener
, which can be
extended to create a listener which only needs to handle a subset of the available
methods.This class provides an empty implementation of
FiltersVisitor
, which can be
extended to create a visitor which only needs to handle a subset of the available
methods.This interface defines a complete listener for a parse tree produced by
FiltersParser
.This interface defines a complete generic visitor for a parse tree produced by
FiltersParser
.Gets or Sets FinishReason
This class extends PDFTextStripper to provide custom text extraction and formatting
capabilities for PDF pages.
Implementations of this interface provides instructions for how the output of a
language generative should be formatted.
Deprecated.
Use the
FormatProvider
instead.Represents a model function call handler.
A Spring
ApplicationContextAware
implementation that provides a way to retrieve
a Function
from the Spring context and wrap it into a FunctionCallback
.Note that the underlying function is responsible for converting the output into format
that can be consumed by the Model.
Builder for
FunctionCallingOptions
.A VectorStore implementation backed by GemFire.
GenerateParameters
GenerateRequest
GenerateResponse
Represents a response returned by the AI.
The Generation class represents a response from a moderation process.
The SAP HANA Cloud vector engine offers multiple use cases in AI scenarios.
The
HanaCloudVectorStoreConfig
class represents the configuration for the
HanaCloudVectorStore.The
HanaVectorEntity
is an abstract class that represents a mapped superclass
for entities that have a vector representation stored in a HANA vector repository.The
HanaVectorRepository
interface provides methods for interacting with a HANA
vector repository, which allows storing and querying vector embeddings.An implementation of
ChatModel
that interfaces with HuggingFace Inference
Endpoints for text generation.Interface for generating unique document IDs.
Context used to store metadata for image model exchanges.
Interface for an
ObservationConvention
for image model exchanges.Documented conventions for image model observations.
Events for image model operations.
High-cardinality observation key names for image model operations.
Low-cardinality observation key names for image model operations.
An
ObservationFilter
to include the image prompt content in the observation.Auto-configuration for Spring AI image model observations.
Configuration properties for image model observations.
ImageOptions represent the common options, portable across different image generation
models.
The image completion (e.g.
Represents metadata associated with an image response.
Info
The InMemoryChatMemory class is an implementation of the ChatMemory interface that
represents an in-memory storage for chat conversation history.
A SHA-256 based ID generator that returns the hash as a UUID.
A class that reads JSON documents and converts them into a list of
Document
objects.Utility class for JSON processing.
Estimates the number of tokens in a given text or message using the JTokkit encoding
library.
Keyword extractor that uses generative to extract 'excerpt_keywords' metadata field.
Returns a new list of content (e.g list of messages of list of documents) that is a
subset of the input list of contents and complies with the max token size constraint.
StructuredOutputConverter
implementation that uses a
DefaultConversionService
to convert the LLM output into a
List
instance.Deprecated.
Use the
ListOutputConverter
instead.Java client for the Bedrock Llama chat model.
Llama models version.
LlamaChatRequest encapsulates the request parameters for the Meta Llama chat model.
LlamaChatResponse encapsulates the response parameters for the Meta Llama chat model.
The reason the response finished being generated.
StructuredOutputConverter
implementation that uses a pre-configured
MappingJackson2MessageConverter
to convert the LLM output into a
java.util.Map<String, Object> instance.Deprecated.
Use the
MapOutputConverter
instead.Reads the given Markdown resource and groups headers, paragraphs, or text divided by
horizontal lines (depending on the
MarkdownDocumentReaderConfig.horizontalRuleCreateDocument
configuration) into
Document
s.Common configuration for the
MarkdownDocumentReader
.The Media class represents the data and metadata of a media attachment in a message.
Utility class for merging ChatCompletions instances and their associated objects.
The Message interface represents a message that can be sent or received in a chat
application.
Helper that for streaming chat responses, aggregate the chat response messages into a
single AssistantMessage.
Memory is retrieved added as a collection of messages to the prompt
Converts a list of messages to a prompt for bedrock models.
The MessageType enum represents the type of message in a chat application.
Converts
Filter.Expression
into Milvus metadata filter expression format.Parameters for Milvus client connection.
Configuration for the Milvus vector store.
Gemini supports the following MIME types:
image/gif
image/png
image/jpeg
video/mov
video/mpeg
video/mp4
video/mpg
video/avi
video/wmv
video/mpegps
video/flv
https://cloud.google.com/vertex-ai/docs/generative-ai/model-reference/gemini
Single class implementation of the MiniMax Chat Completion API and
MiniMax Embedding API.
Represents a chat completion response returned by model, based on the provided input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
An array of content parts with a defined type.
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
Helper factory that creates a tool_choice of type 'none', 'auto' or selected function by name.
MiniMax Chat Completion Models:
MiniMax Model.
List of multiple embedding responses.
MiniMax Embeddings Models:
Embeddings.
Creates an embedding vector representing the input text.
MiniMax Embeddings Types
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics for the completion request.
Common value constants for MiniMax api.
MiniMaxChatOptions represents the options for performing chat completion using the
MiniMax API.
MiniMax Embedding Model implementation.
This class represents the options for MiniMax embedding.
The MiniMaxRuntimeHints class is responsible for registering runtime hints for MiniMax
API classes.
Helper class to support Streaming function calling.
Usage
implementation for MiniMax.Single-class, Java Client library for Mistral AI platform.
Represents a chat completion response returned by model, based on the provided
input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based
on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
The role of the author of this message.
The relevant tool call.
Creates a model request for chat conversation.
An object specifying the format that the model must output.
Specifies a tool the model should use.
List of well-known Mistral chat models.
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
List of well-known Mistral embedding models.
Creates an embedding vector representing the input text.
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics.
Represents a Mistral AI Chat Model.
Provides the Mistral AI Embedding Model.
The MistralAiRuntimeHints class is responsible for registering runtime hints for
Mistral AI API classes.
Helper class to support Streaming function calling.
Usage
implementation for Mistral AI.The Model interface provides a generic API for invoking AI models.
Describes an AI model's basic characteristics.
Context used when sending a request to a machine learning model and waiting for a
response from the model provider.
Interface representing the customizable options for AI model interactions.
Utility class for manipulating
ModelOptions
objects.Interface representing a request to an AI model.
Interface representing the response received from an AI model.
This interface provides methods to access the main output of the AI model and the
metadata associated with this result.
Generate metrics about the model usage in the context of an AI operation.
The Moderation class represents the result of a moderation process.
An interface that represents metadata associated with the results of a moderation
generation process.
Represents a single message intended for moderation, encapsulating the text content.
The ModerationModel interface defines a generic AI model for moderation.
Represents the options for moderation.
A builder class for creating instances of ModerationOptions.
Represents a prompt for moderation containing a single message and the options for the
moderation model.
Represents a response from a moderation process, encapsulating the moderation metadata
and the generated content.
Defines the metadata associated with a moderation response, extending a base response
interface.
Represents the result of a moderation process, indicating whether content was flagged,
the categories of moderation, and detailed scores for each category.
Converts
Filter.Expression
into MongoDB Atlas metadata filter expression
format.Single-class, Java Client library for Moonshot platform.
Represents a chat completion response returned by model, based on the provided
input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based
on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
Helper factory that creates a tool_choice of type 'none', 'auto' or selected
function by name.
Moonshot Chat Completion Models:
MOONSHOT_V1_8K - moonshot-v1-8k
MOONSHOT_V1_32K - moonshot-v1-32k
MOONSHOT_V1_128K - moonshot-v1-128k
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Usage statistics.
The MoonshotRuntimeHints class is responsible for registering runtime hints for
Moonshot API classes.
Helper class to support Streaming function calling.
Converts
Filter.Expression
into Neo4j condition expression format.An enum to configure the distance function used in the Neo4j vector index.
Configuration for the Neo4j vector store.
Root of the hierarchy of Model access exceptions that are considered non-transient -
where a retry of the same operation would fail unless the cause of the Exception is
corrected.
EmbeddingModel
implementation that uses the
OCI GenAI Embedding API.The configuration information for OCI embedding requests
Java Client for the Ollama API.
Chat request object.
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Ollama chat response object.
Deprecated, for removal: This API element is subject to removal in a future version.
Use
OllamaApi.EmbeddingsRequest
instead.Deprecated, for removal: This API element is subject to removal in a future version.
Use
OllamaApi.EmbeddingsResponse
instead.Generate embeddings from a model.
The response object returned from the /embedding endpoint.
Deprecated, for removal: This API element is subject to removal in a future version.
Deprecated, for removal: This API element is subject to removal in a future version.
Chat message object.
The role of the message in the conversation.
The relevant tool call.
The function definition.
Auto-configuration
for Ollama Chat Client.ChatModel
implementation for Ollama.Ollama Chat autoconfiguration properties.
Ollama connection autoconfiguration properties.
EmbeddingModel
implementation for Ollama.Ollama Embedding autoconfiguration properties.
Helper class for common Ollama models.
Helper class for creating strongly-typed Ollama options.
The OllamaRuntimeHints class is responsible for registering runtime hints for Ollama AI
API classes.
Usage
implementation for OllamaSingle class implementation of the
OpenAI Chat Completion
API and OpenAI
Embedding API.
Represents a chat completion response returned by model, based on the provided
input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based
on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
An array of content parts with a defined type.
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
JSON schema object that describes the format of the JSON object.
Helper factory that creates a tool_choice of type 'none', 'auto' or selected function by name.
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
OpenAI Embeddings Models:
Embeddings.
Creates an embedding vector representing the input text.
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics for the completion request.
Breakdown of tokens used in a completion
Thrown on 4xx client errors, such as 401 - Incorrect API key provided, 401 - You must
be a member of an organization to use the API, 429 - Rate limit reached for requests,
429 - You exceeded your current quota , please check your plan and billing details.
Common value constants for OpenAI api.
Enumeration
of OpenAI API response headers.Turn audio into text or text into audio.
Request to generates audio from the input text.
The format to audio in.
Builder for the SpeechRequest.
The voice to use for synthesis.
The Transcription
Object represents a verbose json transcription response returned by model,
based on the provided input.
Segment of the transcribed text and its corresponding details.
Extracted word and it corresponding timestamps.
Request to transcribe an audio file to text.
The format of the transcript and translation outputs, in one of these options:
json, text, srt, verbose_json, or vtt.
Request to translate an audio file to English.
TTS is an AI model that converts text to natural sounding spoken text.
Whisper is a
general-purpose speech recognition model.
OpenAI audio speech client implementation for backed by
OpenAiAudioApi
.Options for OpenAI text to audio - speech synthesis.
Configuration properties for OpenAI audio speech.
Audio speech metadata implementation for OpenAI.
OpenAI audio transcription client implementation for backed by
OpenAiAudioApi
.Audio transcription metadata implementation for OpenAI.
Open AI Embedding Model implementation.
OpenAI Image API.
OpenAI Image API model.
OpenAiImageModel is a class that implements the ImageModel interface.
OpenAI Image API options.
OpenAI Image autoconfiguration properties.
OpenAI Moderation API.
OpenAiModerationModel is a class that implements the ModerationModel interface.
OpenAI Moderation API options.
OpenAI Moderation autoconfiguration properties.
RateLimit
implementation for OpenAI.Utility used to extract known HTTP response headers for the OpenAI API.
The OpenAiRuntimeHints class is responsible for registering runtime hints for OpenAI
API classes.
Helper class to support Streaming function calling.
Usage
implementation for OpenAI.
Integration of Oracle database 23ai as a Vector Store.
Deprecated.
Use the
StructuredOutputConverter
instead.Groups the parsed PDF pages into
Document
s.The ParagraphManager class is responsible for managing the paragraphs and hierarchy of
a PDF document.
Represents a document paragraph metadata and hierarchy.
Uses the PDF catalog (e.g.
Utility methods for
String
parsing.Common configuration builder for the
PagePdfDocumentReader
and the
ParagraphPdfDocumentReader
.Re-implement the PDFLayoutTextStripperByArea on top of the PDFLayoutTextStripper
instead the original PDFTextStripper.
The PdfReaderRuntimeHints class is responsible for registering runtime hints for PDFBox
resources.
Converts
Filter.Expression
into PgVector metadata filter expression format.Uses the "vector_store" table to store the Spring AI vector data.
Defaults to CosineDistance.
By default, pgvector performs exact nearest neighbor search, which provides perfect
recall.
Converts
Filter.Expression
into Pinecone metadata filter expression format.A VectorStore implementation backed by Pinecone, a cloud-based vector database.
Configuration class for the PineconeVectorStore.
Auto-configuration class for PostgresMlEmbeddingModel.
PostgresML EmbeddingModel
Configuration properties for Postgres ML.
PrefillToken
Converts
Filter.Expression
into test string format.The Prompt class represents a prompt used in AI model requests.
Memory is retrieved added into the prompt's system text.
Abstract Data Type (ADT) modeling metadata gathered by the AI during request
processing.
Abstract Data Type (ADT) modeling filter metadata for all prompts sent during an AI
request.
Qdrant vectorStore implementation.
Deprecated, for removal: This API element is subject to removal in a future version.
since 1.0.0 in favor of
QdrantVectorStore
.Represents an access token for the QianFan API.
Single class implementation of the QianFan Chat Completion API and Embedding API.
Represents a chat completion response returned by model, based on the provided input.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Message comprising the conversation.
The role of the author of this message.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
QianFan Chat Completion Models:
QianFan Model.
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
QianFan Embeddings Models:
Embeddings.
Creates an embedding vector representing the input text.
Usage statistics for the completion request.
QianFanAuthenticator is a class that authenticates and requests access token for the
QianFan API.
QianFanChatOptions represents the options for performing chat completion using the
QianFan API.
The ApiUtils class provides utility methods for working with API requests and
responses.
QianFan Embedding Client implementation.
This class represents the options for QianFan embedding.
QianFan Image API.
QianFan Image API model.
QianFanImageModel is a class that implements the ImageModel interface.
QianFan Image API options.
QianFan Image autoconfiguration properties.
The QianFanRuntimeHints class is responsible for registering runtime hints for QianFan
API classes.
Usage
implementation for QianFan.Context for the question is retrieved from a Vector Store and added to the prompt's
user text.
A random ID generator that returns a UUID.
Abstract Data Type (ADT) encapsulating metadata from an AI provider's API rate limits
granted to the API key in use and the API key's current balance.
Converts
Filter.Expression
into Redis search filter expression format.The RedisVectorStore is for managing and querying vector data in a Redis database.
Configuration for the Redis vector store.
Deprecated.
since 1.0.0 M3 please use
CallAroundAdvisor
or
StreamAroundAdvisor
instead.Service that helps caching remote
Resource
s on the local file system.Miscellaneous Resource utility methods.
Represents a
Model
response that includes the
entire response along withe specified response entity type.Interface representing metadata associated with an AI model's response.
Interface representing metadata associated with the results of an AI model.
RetryUtils is a utility class for configuring and handling retry operations.
A
CallAroundAdvisor
and StreamAroundAdvisor
that filters out the
response if the user input contains any of the sensitive words.Similarity search request builder.
https://www.elastic.co/guide/en/elasticsearch/reference/master/dense-vector.html
max_inner_product is currently not supported because the distance value is not
normalized and would not comply with the requirement of being between 0 and 1
A simple logger advisor that logs the request and response messages.
SimpleVectorStore is a simple implementation of the VectorStore interface.
The Speech class represents the result of speech synthesis from an AI model.
The
SpeechMessage
class represents a single text message to be converted to
speech by the OpenAI TTS API.The
SpeechModel
interface provides a way to interact with the OpenAI
Text-to-Speech (TTS) API.The
SpeechPrompt
class represents a request to the OpenAI Text-to-Speech (TTS)
API.Creates a new instance of SpeechResponse with the given speech result.
Types of Spring AI constructs which can be observed.
Exponential Backoff properties.
Represents the StabilityAI API.
Represents metadata associated with the image generation process in the StabilityAI
framework.
StabilityAiImageModel is a class that implements the ImageModel interface.
StabilityAiImageOptions is an interface that extends ImageOptions.
The StreamAroundAdvisorChain is used to delegate the call to the next
StreamAroundAdvisor in the chain.
StreamDetails
Helper class to support streaming function calling.
The StreamingModel interface provides a generic API for invoking an AI models with
streaming response.
The
StreamingSpeechModel
interface provides a way to interact with the OpenAI
Text-to-Speech (TTS) API using a streaming approach, allowing you to receive the
generated audio in a real-time fashion.StreamResponse
Converts the (raw) LLM output into a structured responses of type.
Enum representing different styles for images.
Title extractor with adjacent sharing that uses generative to extract
'section_summary', 'prev_section_summary', 'next_section_summary' metadata fields.
A message of the type 'system' passed as input.
A document reader that leverages Apache Tika to extract text from a variety of document
formats, such as PDF, DOC/DOCX, PPT/PPTX, and HTML.
Java client for the Bedrock Titan chat model.
Titan models version.
TitanChatRequest encapsulates the request parameters for the Titan chat model.
Titan request text generation configuration.
TitanChatResponse encapsulates the response parameters for the Titan chat model.
The reason the response finished being generated.
Titan response result.
Titan chat model streaming response.
Java client for the Bedrock Titan Embedding model.
Titan Embedding model ids.
Titan Embedding request parameters.
TitanEmbeddingRequest builder.
Titan Embedding response.
Token
Token count based strategy implementation for
BatchingStrategy
.Estimates the number of tokens in a given text or message.
Helper class that reuses the
AbstractToolCallSupport
to implement the function
call handling logic on the client side.Helper used to provide only the function definition, without the actual function
call implementation.
Represents the context for tool execution in a function calling scenario.
The ToolResponseMessage class represents a message with a function content in a chat
application.
Utilities to prepare and process traces for observability.
An implementation of the AbstractEmbeddingModel that uses ONNX-based Transformer models
for text embeddings.
Configurations for the
HuggingFaceTokenizer
used to convert sentences into
tokens.Root of the hierarchy of Model access exceptions that are considered transient - where
a previously failed operation might be able to succeed when the operation is retried
without any intervention by application-level functionality.
A utility class that provides methods for resolving types and classes related to
functions.
Converts
Filter.Expression
into Typesense metadata filter expression format.Abstract Data Type (ADT) encapsulating metadata on the usage of an AI provider's API
per AI request.
A message of the type 'user' passed as input Messages with the user role are from the
end-user or developer.
The
VectorStore
interface defines the operations for managing and querying
documents in a vector database.Memory is retrieved from a VectorStore added into the prompt's system text.
Collection of attribute keys used in vector store observations (spans, metrics,
events).
Auto-configuration for Spring AI vector store observations.
Utilities to process the query content in observations for vector store operations.
Context used to store metadata for vector store operations.
Documented conventions for vector store observations.
High-cardinality observation key names for vector store operations.
Low-cardinality observation key names for vector store operations.
Collection of event names used in vector store observations.
Configuration properties for vector store observations.
Collection of systems providing vector store functionality.
An
ObservationFilter
to include the Vector Store search response content in the
observation.Handler for including the query response content in the observation as a span event.
Types of similarity metrics used in vector store operations.
Auto-configuration for Vertex AI Gemini Chat.
VertexAiEmbeddingConnectionDetails represents the details of a connection to the Vertex
AI embedding service.
Configuration properties for Vertex AI Embedding.
Utility class for constructing parameter objects for Vertex AI embedding requests.
Auto-configuration for Vertex AI Gemini Chat.
Configuration properties for Vertex AI Gemini Chat.
Configuration properties for Vertex AI Gemini Chat.
The VertexAiGeminiRuntimeHints class is responsible for registering runtime hints for
Vertex AI Gemini API classes.
Implementation of the Vertex AI Multimodal Embedding Model.
VertexAI Embedding Models: - Text
embeddings - Multimodal
embeddings
Class representing the options for Vertex AI Multimodal Embedding.
Configuration properties for Vertex AI Gemini Chat.
Vertex AI API client for the Generative Language model.
A list of floats representing the embedding.
Message generation request body.
The response from the model.
Content filtering metadata associated with processing a single request.
Reasons why content may have been blocked.
The base unit of structured text.
A collection of source attributions for a piece of content.
A citation to a source for a portion of a specific response.
All of the structured input text passed to the model as a prompt.
An input/output example used to instruct the Model.
Information about a Generative Language Model.
API error response.
Error details.
A class representing a Vertex AI Text Embedding Model.
VertexAI Embedding Models: - Text
embeddings - Multimodal
embeddings
Configuration properties for Vertex AI Gemini Chat.
The VertexRuntimeHints class is responsible for registering runtime hints for Vertex AI
API classes.
Java Client for the watsonx.ai API.
WatsonX.ai autoconfiguration class.
ChatModel
implementation for watsonx.ai.Helper class for creating watsonx.ai options.
Chat properties for Watsonx.AI Chat.
Java class for Watsonx.ai Chat Request object.
Java class for Watsonx.ai Chat Response object.
Java class for Watsonx.ai Chat Results object.
WatsonX.ai connection autoconfiguration properties.
EmbeddingModel
implementation for Watsonx.ai.The configuration information for the embedding requests.
Watsonx.ai Embedding autoconfiguration properties.
Java class for Watsonx.ai Embedding Request object.
Java class for Watsonx.ai Embedding Response object.
Java class for Watsonx.ai Embedding Results object.
The WatsonxAiRuntimeHints class is responsible for registering runtime hints for
Watsonx AI API classes.
Converts
Filter.Expression
into Weaviate metadata filter expression format.A VectorStore implementation backed by Weaviate vector database.
Configuration class for the WeaviateVectorStore.
https://weaviate.io/developers/weaviate/concepts/replication-architecture/consistency#tunable-consistency-strategies
Single class implementation of the ZhiPuAI Chat Completion API and
ZhiPuAI Embedding API.
Represents a chat completion response returned by model, based on the provided input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
An array of content parts with a defined type.
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
Helper factory that creates a tool_choice of type 'none', 'auto' or selected function by name.
ZhiPuAI Chat Completion Models:
ZhiPuAI Model.
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
ZhiPuAI Embeddings Models:
Embeddings.
Creates an embedding vector representing the input text.
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics for the completion request.
ZhiPuAiChatOptions represents the options for the ZhiPuAiChat model.
ZhiPuAI Embedding Model implementation.
The ZhiPuAiEmbeddingOptions class represents the options for ZhiPuAI embedding.
ZhiPuAI Image API.
ZhiPuAI Image API model.
ZhiPuAiImageModel is a class that implements the ImageModel interface.
ZhiPuAiImageOptions represents the options for image generation using ZhiPuAI image
model.
The ZhiPuAiRuntimeHints class is responsible for registering runtime hints for ZhiPu AI
API classes.
Helper class to support Streaming function calling.
Usage
implementation for ZhiPuAI.Common value constants for ZhiPu api.
AbstractConversionServiceOutputConverter
instead.