All Classes and Interfaces

Class
Description
Abstract class for the Bedrock API.
Encapsulates the metrics about the model invocation.
Abstract class that serves as a base for chat memory advisors.
 
Abstract StructuredOutputConverter implementation that uses a pre-configured DefaultConversionService to convert the LLM output into the desired type format.
Deprecated.
Abstract implementation of the EmbeddingModel interface that provides dimensions calculation caching.
AbstractFilterExpressionConverter is an abstract class that implements the FilterExpressionConverter interface.
Deprecated, for removal: This API element is subject to removal in a future version.
since 1.0.0-M1 in favor of AbstractToolCallSupport
The AbstractMessage class is an abstract implementation of the Message interface.
Deprecated.
Abstract StructuredOutputConverter implementation that uses a pre-configured MessageConverter to convert the LLM output into the desired type format.
 
 
Abstract base class for tool call support.
Represents the response received when requesting an access token.
The data of the chat client request that can be modified before the execution of the ChatClient's call method
 
 
 
Parent advisor interface for all advisors.
Context used to store metadata for chat client advisors.
 
 
Interface for an ObservationConvention for chat client advisors.
 
 
 
Java client for the Bedrock Jurassic2 chat model.
Ai21 Jurassic2 models version.
AI21 Jurassic2 chat request parameters.
 
Penalty with float scale value.
Penalty with integer scale value.
Ai21 Jurassic2 chat response.
 
Explains why the generation process was halted for a specific completion.
The generatedToken fields.
The prompt includes the raw text, the tokens with their log probabilities, and the top-K alternative tokens at each position, if requested.
The textRange field indicates the start and end offsets of the token in the decoded text string.
Provides detailed information about each token in both the prompt and the completions.
The topTokens field is a list of the top K alternative tokens for this position, sorted by probability, according to the topKReturn request parameter.
Collection of attribute keys used in AI observations (spans, metrics, events).
Collection of event names used in AI observations.
Collection of metric attributes used in AI observations.
Collection of metric names used in AI observations.
Metadata associated with an AI operation (e.g.
 
Types of operations performed by AI systems.
Collection of systems providing AI functionality.
Utility methods for creating native runtime hints.
Types of tokens produced and consumed in an AI operation.
AllOfGenerateResponseDetails
AllOfStreamResponseDetails
Based on Bedrock's Anthropic Claude Messages API.
Anthropic models version.
AnthropicChatRequest encapsulates the request parameters for the Anthropic messages model.
 
AnthropicChatResponse encapsulates the response parameters for the Anthropic messages model.
AnthropicChatStreamingResponse encapsulates the streaming response parameters for the Anthropic messages model.
Encapsulates a delta.
The streaming type of this message.
Encapsulates the metrics about the model invocation.
Message comprising the conversation.
The role of the author of this message.
 
The source of the media content.
The type of this message.
 
 
 
Input messages.
 
 
 
 
Check the Models overview and model comparison for additional details and options.
 
The source of the media content.
The ContentBlock type.
 
 
 
 
 
 
 
 
 
 
 
The evnt type of the streamed chunk.
 
 
 
 
 
 
The role of the author of this message.
 
 
Special event used to aggregate multiple tool use events into a single event with list of aggregated ContentBlockToolUse.
Usage statistics.
 
 
Anthropic models version.
AnthropicChatRequest encapsulates the request parameters for the Anthropic chat model.
 
AnthropicChatResponse encapsulates the response parameters for the Anthropic chat model.
The ChatModel implementation for the Anthropic service.
The options to be used when sending a chat request to the Anthropic API.
 
 
 
Anthropic Chat autoconfiguration properties.
Anthropic API connection properties.
RateLimit implementation for OpenAI.
The AnthropicRuntimeHints class is responsible for registering runtime hints for Anthropic API classes.
Usage implementation for AnthropicApi.
 
 
 
The ApiUtils class provides utility methods for working with API requests and responses.
Lets the generative know the content was generated as a response to the user.
 
 
Represents a response returned by the AI.
 
 
Represents an audio transcription prompt for an AI model.
 
 
QianFan abstract authentication API.
 
Converts Filter.Expression into Azure Search OData filter syntax.
AzureOpenAI audio transcription client implementation for backed by OpenAIClient.
 
 
 
 
Segment of the transcribed text and its corresponding details.
Extracted word and it's corresponding timestamps.
 
 
 
Audio transcription metadata implementation for AzureOpenAI.
 
ChatModel implementation for Microsoft Azure AI backed by OpenAIClient.
The configuration information for a chat completions request.
 
 
 
Azure Open AI Embedding Model implementation.
The configuration information for the embedding requests.
 
 
Usage implementation for Microsoft Azure OpenAI Service embedding.
Represents the metadata for image generation using Azure OpenAI.
ImageModel implementation for Microsoft Azure AI backed by OpenAIClient.
The configuration information for a image generation request.
 
 
Configuration properties for Azure OpenAI image generation options.
Represents metadata associated with an image response from the Azure OpenAI image model.
Utility enumeration for representing the response format that may be requested from the Azure OpenAI model.
 
Usage implementation for Microsoft Azure OpenAI Service chat.
Uses Azure Cognitive Search as a backing vector store.
 
 
 
 
Contract for batching Document objects so that the call to embed them could be optimized.
An implementation of StructuredOutputConverter that transforms the LLM output to a specific object type using JSON schema.
Deprecated.
Use the BeanOutputConverter instead.
Auto-configuration for Bedrock Jurassic2 Chat Client.
Java ChatModel for the Bedrock Jurassic2 chat generative model.
 
Request body for the /complete endpoint of the Jurassic-2 API.
 
Penalty object for frequency, presence, and count penalties.
 
Configuration properties for Bedrock Ai21Jurassic2.
Auto-configuration for Bedrock Anthropic Chat Client.
Java ChatModel and StreamingChatModel for the Bedrock Anthropic chat generative.
Configuration properties for Bedrock Anthropic Claude 3.
Auto-configuration for Bedrock Anthropic Chat Client.
Java ChatModel and StreamingChatModel for the Bedrock Anthropic chat generative.
Configuration properties for Bedrock Anthropic.
 
Configuration properties for Bedrock AWS connection.
Auto-configuration for Bedrock Cohere Chat Client.
 
 
 
Bedrock Cohere Chat autoconfiguration properties.
Auto-configuration for Bedrock Cohere Embedding Model.
EmbeddingModel implementation that uses the Bedrock Cohere Embedding API.
 
 
Bedrock Cohere Embedding autoconfiguration properties.
Auto-configuration for Bedrock Llama Chat Client.
Java ChatModel and StreamingChatModel for the Bedrock Llama chat generative.
 
 
Configuration properties for Bedrock Llama.
The BedrockRuntimeHints class is responsible for registering runtime hints for Bedrock AI API classes.
Auto-configuration for Bedrock Titan Chat Client.
 
 
 
Bedrock Titan Chat autoconfiguration properties.
Auto-configuration for Bedrock Titan Embedding Model.
EmbeddingModel implementation that uses the Bedrock Titan Embedding API.
 
 
 
Bedrock Titan Embedding autoconfiguration properties.
Usage implementation for Bedrock API.
BestOfSequence
 
The Call Around Advisor Chain is used to invoke the next Around Advisor in the chain.
Create a CassandraChatMemory like CassandraChatMemory.create(CassandraChatMemoryConfig.builder().withTimeToLive(Duration.ofDays(1)).build()); For example @see org.springframework.ai.chat.memory.CassandraChatMemory
 
 
 
 
Given a string sessionId, return the value for each primary key column.
 
The CassandraVectorStore is for managing and querying vector data in an Apache Cassandra db.
Indexes are automatically created with COSINE.
 
Configuration for the Cassandra vector store.
 
Given a string document id, return the value for each primary key column.
Given a list of primary key column values, return the document id.
 
 
 
The Categories class represents a set of categories used to classify content.
 
This class represents the scores for different categories of content.
 
Client to perform stateless requests to an AI Model, using a fluent API.
 
A mutable builder for creating a ChatClient.
 
 
 
 
 
 
 
Auto-configuration for ChatClient.
Builder for configuring a ChatClient.Builder.
Configuration properties for the chat client builder.
 
Callback interface that can be used to customize a ChatClient.Builder.
An ObservationFilter to include the chat prompt content in the observation.
Context used to store metadata for chat client workflows.
 
Interface for an ObservationConvention for chat client workflows.
Documented conventions for chat client observations.
 
 
Abstract Data Type (ADT) encapsulating information on the completion choices in the AI response.
The ChatMemory interface represents a storage for chat conversation history.
 
An ObservationFilter to include the chat completion content in the observation.
Handler for including the chat completion content in the observation as a span event.
Marker interface, to be used to store info on the model such as the current context length.
Handler for generating metrics from chat model observations.
Utilities to process the prompt and completion content in observations for chat models.
Context used to store metadata for chat model exchanges.
 
Interface for an ObservationConvention for chat model exchanges.
Documented conventions for chat model observations.
Events for chat model operations.
High-cardinality observation key names for chat model operations.
Low-cardinality observation key names for chat model operations.
An ObservationFilter to include the chat prompt content in the observation.
Handler for including the chat prompt content in the observation as a span event.
Auto-configuration for Spring AI chat model observations.
Configuration properties for chat model observations.
The ChatOptions represent the common options, portable across different chat models.
 
A PromptTemplate that lets you specify the role as a string should the current implementations and their roles not suffice for your needs.
The chat completion (e.g.
 
Models common AI provider metadata returned in an AI response.
 
Single-class Chroma API implementation based on the (unofficial) Chroma REST API.
Add embeddings to the chroma data store.
Chroma embedding collection.
Request to create a new collection with the given name and metadata.
Request to delete embedding from a collection.
Single query embedding response.
Object containing the get embedding results.
Get embeddings from a collection.
Request to get the nResults nearest neighbor embeddings for provided queryEmbeddings.
 
A QueryResponse object containing the query results.
 
An implementation of BindingsPropertiesProcessor that detects Bindings of type: "chroma".
 
Converts Filter.Expression into Chroma metadata filter expression format.
ChromaVectorStore is a concrete implementation of the VectorStore interface.
 
 
Java client for the Bedrock Cohere chat model.
Cohere models version.
CohereChatRequest encapsulates the request parameters for the Cohere command model.
Builder for the CohereChatRequest.
Prevents the model from generating unwanted tokens or incentivize the model to include desired tokens.
(optional) Specify how and if the token likelihoods are returned with the response.
Specifies how the API handles inputs longer than the maximum token length.
CohereChatResponse encapsulates the response parameters for the Cohere command model.
Generated result along with the likelihoods for tokens requested.
The reason the response finished being generated.
 
Cohere Embedding API.
Cohere Embedding model ids.
The Cohere Embed model request.
Cohere Embedding API input types.
Specifies how the API handles inputs longer than the maximum token length.
Cohere Embedding response.
 
 
CompatGenerateRequest
Data structure that contains content and metadata.
Converts the Document text and metadata into an AI, prompt-friendly text representation.
ContentFormatTransformer processes a list of documents by applying a content formatter to each document.
 
 
The default implementation of ChatClient as created by the ChatClient.Builder.build() } method.
 
 
 
 
 
 
 
 
DefaultChatClientBuilder is a builder class for creating a ChatClient.
Default conventions to populate observations for chat client workflows.
Default conventions to populate observations for chat model operations.
 
 
Default conventions to populate observations for embedding model operations.
Default conventions to populate observations for image model operations.
Default conventions to populate observations for vector store operations.
Details
A document is a container for the content and metadata of a document.
 
EmbeddingModel is a generic interface for embedding models.
Represents a request to embed a list of documents.
 
 
 
 
ElasticsearchAiSearchFilterExpressionConverter is a class that converts Filter.Expression objects into Elasticsearch query string representation.
The ElasticsearchVectorStore class implements the VectorStore interface and provides functionality for managing and querying documents in Elasticsearch.
 
Provided Elasticsearch vector option configuration.
 
Represents a single embedding vector.
EmbeddingModel is a generic interface for embedding models.
 
Handler for generating metrics from embedding model observations.
Context used to store metadata for embedding model exchanges.
 
Interface for an ObservationConvention for embedding model exchanges.
Documented conventions for embedding model observations.
High-cardinality observation key names for embedding model operations.
Low-cardinality observation key names for embedding model operations.
Auto-configuration for Spring AI embedding model observations.
 
 
 
Embedding response object.
Common AI provider metadata returned in an embedding response.
 
 
 
 
 
A RateLimit implementation that returns zero for all property getters
A EmpytUsage implementation that returns zero for all property getters
 
ErrorResponse
Represents an evaluation request, which includes the user's text, a list of content data, and a chat response.
 
 
A utility to reformat extracted text content before encapsulating it in a Document.
The Builder class is a nested static class of ExtractedTextFormatter designed to facilitate the creation and customization of instances of ExtractedTextFormatter.
The FactCheckingEvaluator class implements a method for evaluating the factual accuracy of Large Language Model (LLM) responses against provided context.
Writes the content of a list of Documents into a file.
Portable runtime generative for metadata filter expressions.
Triple that represents and filter boolean expression as left type right.
Filter expression operations.
Represents expression grouping (e.g.
String identifier representing an expression key.
Mark interface representing the supported expression types: Filter.Key, Filter.Value, Filter.Expression and Filter.Group.
Represents expression value constant or constant array.
DSL builder for Filter.Expression instances.
 
Converters a generic, portable Filter.Expression into a VectorStore specific expression language format.
Parse a textual, vector-store agnostic, filter expression language into Filter.Expression.
 
 
 
Helper class providing various boolean transformation.
This class provides an empty implementation of FiltersListener, which can be extended to create a listener which only needs to handle a subset of the available methods.
This class provides an empty implementation of FiltersVisitor, which can be extended to create a visitor which only needs to handle a subset of the available methods.
 
This interface defines a complete listener for a parse tree produced by FiltersParser.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
This interface defines a complete generic visitor for a parse tree produced by FiltersParser.
Gets or Sets FinishReason
This class extends PDFTextStripper to provide custom text extraction and formatting capabilities for PDF pages.
Implementations of this interface provides instructions for how the output of a language generative should be formatted.
Deprecated.
Use the FormatProvider instead.
Represents a model function call handler.
A Spring ApplicationContextAware implementation that provides a way to retrieve a Function from the Spring context and wrap it into a FunctionCallback.
 
Note that the underlying function is responsible for converting the output into format that can be consumed by the Model.
 
 
 
 
 
A VectorStore implementation backed by GemFire.
 
 
 
 
 
GenerateParameters
GenerateRequest
GenerateResponse
Represents a response returned by the AI.
The Generation class represents a response from a moderation process.
The SAP HANA Cloud vector engine offers multiple use cases in AI scenarios.
 
The HanaCloudVectorStoreConfig class represents the configuration for the HanaCloudVectorStore.
 
 
The HanaVectorEntity is an abstract class that represents a mapped superclass for entities that have a vector representation stored in a HANA vector repository.
The HanaVectorRepository interface provides methods for interacting with a HANA vector repository, which allows storing and querying vector embeddings.
 
 
An implementation of ChatModel that interfaces with HuggingFace Inference Endpoints for text generation.
 
Interface for generating unique document IDs.
 
 
 
 
 
Context used to store metadata for image model exchanges.
 
Interface for an ObservationConvention for image model exchanges.
Documented conventions for image model observations.
Events for image model operations.
High-cardinality observation key names for image model operations.
Low-cardinality observation key names for image model operations.
An ObservationFilter to include the image prompt content in the observation.
Auto-configuration for Spring AI image model observations.
Configuration properties for image model observations.
ImageOptions represent the common options, portable across different image generation models.
 
 
The image completion (e.g.
Represents metadata associated with an image response.
 
Info
The InMemoryChatMemory class is an implementation of the ChatMemory interface that represents an in-memory storage for chat conversation history.
A SHA-256 based ID generator that returns the hash as a UUID.
 
A class that reads JSON documents and converts them into a list of Document objects.
Utility class for JSON processing.
Estimates the number of tokens in a given text or message using the JTokkit encoding library.
Keyword extractor that uses generative to extract 'excerpt_keywords' metadata field.
 
Returns a new list of content (e.g list of messages of list of documents) that is a subset of the input list of contents and complies with the max token size constraint.
StructuredOutputConverter implementation that uses a DefaultConversionService to convert the LLM output into a List instance.
Deprecated.
Use the ListOutputConverter instead.
Java client for the Bedrock Llama chat model.
Llama models version.
LlamaChatRequest encapsulates the request parameters for the Meta Llama chat model.
 
LlamaChatResponse encapsulates the response parameters for the Meta Llama chat model.
The reason the response finished being generated.
StructuredOutputConverter implementation that uses a pre-configured MappingJackson2MessageConverter to convert the LLM output into a java.util.Map<String, Object> instance.
Deprecated.
Use the MapOutputConverter instead.
Reads the given Markdown resource and groups headers, paragraphs, or text divided by horizontal lines (depending on the MarkdownDocumentReaderConfig.horizontalRuleCreateDocument configuration) into Documents.
Common configuration for the MarkdownDocumentReader.
 
The Media class represents the data and metadata of a media attachment in a message.
 
Utility class for merging ChatCompletions instances and their associated objects.
The Message interface represents a message that can be sent or received in a chat application.
Helper that for streaming chat responses, aggregate the chat response messages into a single AssistantMessage.
 
Memory is retrieved added as a collection of messages to the prompt
 
Converts a list of messages to a prompt for bedrock models.
 
The MessageType enum represents the type of message in a chat application.
 
Converts Filter.Expression into Milvus metadata filter expression format.
 
Parameters for Milvus client connection.
 
Configuration for the Milvus vector store.
 
 
 
 
 
Gemini supports the following MIME types: image/gif image/png image/jpeg video/mov video/mpeg video/mp4 video/mpg video/avi video/wmv video/mpegps video/flv https://cloud.google.com/vertex-ai/docs/generative-ai/model-reference/gemini
Single class implementation of the MiniMax Chat Completion API and MiniMax Embedding API.
Represents a chat completion response returned by model, based on the provided input.
 
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
An array of content parts with a defined type.
 
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
Helper factory that creates a tool_choice of type 'none', 'auto' or selected function by name.
MiniMax Chat Completion Models: MiniMax Model.
List of multiple embedding responses.
MiniMax Embeddings Models: Embeddings.
Creates an embedding vector representing the input text.
MiniMax Embeddings Types
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics for the completion request.
Common value constants for MiniMax api.
 
ChatModel and StreamingChatModel implementation for MiniMax backed by MiniMaxApi.
MiniMaxChatOptions represents the options for performing chat completion using the MiniMax API.
 
 
 
MiniMax Embedding Model implementation.
This class represents the options for MiniMax embedding.
 
 
The MiniMaxRuntimeHints class is responsible for registering runtime hints for MiniMax API classes.
Helper class to support Streaming function calling.
Usage implementation for MiniMax.
Single-class, Java Client library for Mistral AI platform.
Represents a chat completion response returned by model, based on the provided input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
The role of the author of this message.
The relevant tool call.
Creates a model request for chat conversation.
An object specifying the format that the model must output.
Specifies a tool the model should use.
List of well-known Mistral chat models.
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
List of well-known Mistral embedding models.
Creates an embedding vector representing the input text.
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics.
 
An implementation of BindingsPropertiesProcessor that detects Bindings of type: "mistralai".
Represents a Mistral AI Chat Model.
 
 
 
 
Provides the Mistral AI Embedding Model.
 
 
 
 
The MistralAiRuntimeHints class is responsible for registering runtime hints for Mistral AI API classes.
Helper class to support Streaming function calling.
Usage implementation for Mistral AI.
Model<TReq extends ModelRequest<?>,TRes extends ModelResponse<?>>
The Model interface provides a generic API for invoking AI models.
Describes an AI model's basic characteristics.
Context used when sending a request to a machine learning model and waiting for a response from the model provider.
Interface representing the customizable options for AI model interactions.
Utility class for manipulating ModelOptions objects.
Interface representing a request to an AI model.
Interface representing the response received from an AI model.
This interface provides methods to access the main output of the AI model and the metadata associated with this result.
Generate metrics about the model usage in the context of an AI operation.
The Moderation class represents the result of a moderation process.
 
An interface that represents metadata associated with the results of a moderation generation process.
Represents a single message intended for moderation, encapsulating the text content.
The ModerationModel interface defines a generic AI model for moderation.
Represents the options for moderation.
A builder class for creating instances of ModerationOptions.
Represents a prompt for moderation containing a single message and the options for the moderation model.
Represents a response from a moderation process, encapsulating the moderation metadata and the generated content.
Defines the metadata associated with a moderation response, extending a base response interface.
Represents the result of a moderation process, indicating whether content was flagged, the categories of moderation, and detailed scores for each category.
 
Converts Filter.Expression into MongoDB Atlas metadata filter expression format.
 
 
 
 
 
Single-class, Java Client library for Moonshot platform.
Represents a chat completion response returned by model, based on the provided input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
Helper factory that creates a tool_choice of type 'none', 'auto' or selected function by name.
Moonshot Chat Completion Models: MOONSHOT_V1_8K - moonshot-v1-8k MOONSHOT_V1_32K - moonshot-v1-32k MOONSHOT_V1_128K - moonshot-v1-128k
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Usage statistics.
 
 
 
 
 
 
 
 
The MoonshotRuntimeHints class is responsible for registering runtime hints for Moonshot API classes.
Helper class to support Streaming function calling.
 
 
Converts Filter.Expression into Neo4j condition expression format.
 
An enum to configure the distance function used in the Neo4j vector index.
Configuration for the Neo4j vector store.
 
 
 
Root of the hierarchy of Model access exceptions that are considered non-transient - where a retry of the same operation would fail unless the cause of the Exception is corrected.
 
 
 
 
EmbeddingModel implementation that uses the OCI GenAI Embedding API.
 
The configuration information for OCI embedding requests
 
 
Java Client for the Ollama API.
Chat request object.
 
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Ollama chat response object.
Deprecated, for removal: This API element is subject to removal in a future version.
Deprecated, for removal: This API element is subject to removal in a future version.
Generate embeddings from a model.
The response object returned from the /embedding endpoint.
Deprecated, for removal: This API element is subject to removal in a future version.
 
Deprecated, for removal: This API element is subject to removal in a future version.
Chat message object.
 
The role of the message in the conversation.
The relevant tool call.
The function definition.
Auto-configuration for Ollama Chat Client.
An implementation of BindingsPropertiesProcessor that detects Bindings of type: "ollama".
ChatModel implementation for Ollama.
Ollama Chat autoconfiguration properties.
 
Ollama connection autoconfiguration properties.
EmbeddingModel implementation for Ollama.
 
Ollama Embedding autoconfiguration properties.
Helper class for common Ollama models.
Helper class for creating strongly-typed Ollama options.
The OllamaRuntimeHints class is responsible for registering runtime hints for Ollama AI API classes.
Usage implementation for Ollama
Single class implementation of the OpenAI Chat Completion API and OpenAI Embedding API.
Represents a chat completion response returned by model, based on the provided input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
An array of content parts with a defined type.
 
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
JSON schema object that describes the format of the JSON object.
 
 
Helper factory that creates a tool_choice of type 'none', 'auto' or selected function by name.
OpenAI Chat Completion Models: GPT-4o GPT-4o mini GPT-4 and GPT-4 Turbo GPT-3.5 Turbo
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
OpenAI Embeddings Models: Embeddings.
Creates an embedding vector representing the input text.
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics for the completion request.
Breakdown of tokens used in a completion
Thrown on 4xx client errors, such as 401 - Incorrect API key provided, 401 - You must be a member of an organization to use the API, 429 - Rate limit reached for requests, 429 - You exceeded your current quota , please check your plan and billing details.
Common value constants for OpenAI api.
Enumeration of OpenAI API response headers.
Turn audio into text or text into audio.
Request to generates audio from the input text.
The format to audio in.
Builder for the SpeechRequest.
The voice to use for synthesis.
The Transcription Object represents a verbose json transcription response returned by model, based on the provided input.
Segment of the transcribed text and its corresponding details.
Extracted word and it corresponding timestamps.
Request to transcribe an audio file to text.
 
 
The format of the transcript and translation outputs, in one of these options: json, text, srt, verbose_json, or vtt.
Request to translate an audio file to English.
 
TTS is an AI model that converts text to natural sounding spoken text.
Whisper is a general-purpose speech recognition model.
 
OpenAI audio speech client implementation for backed by OpenAiAudioApi.
Options for OpenAI text to audio - speech synthesis.
 
Configuration properties for OpenAI audio speech.
Audio speech metadata implementation for OpenAI.
OpenAI audio transcription client implementation for backed by OpenAiAudioApi.
 
 
 
Audio transcription metadata implementation for OpenAI.
 
An implementation of BindingsPropertiesProcessor that detects Bindings of type: "openai".
ChatModel and StreamingChatModel implementation for OpenAI backed by OpenAiApi.
 
 
 
 
Open AI Embedding Model implementation.
 
 
 
OpenAI Image API.
 
OpenAI Image API model.
 
 
 
OpenAiImageModel is a class that implements the ImageModel interface.
OpenAI Image API options.
 
OpenAI Image autoconfiguration properties.
OpenAI Moderation API.
 
 
 
 
 
 
 
OpenAiModerationModel is a class that implements the ModerationModel interface.
OpenAI Moderation API options.
 
OpenAI Moderation autoconfiguration properties.
RateLimit implementation for OpenAI.
Utility used to extract known HTTP response headers for the OpenAI API.
The OpenAiRuntimeHints class is responsible for registering runtime hints for OpenAI API classes.
Helper class to support Streaming function calling.
Usage implementation for OpenAI.
 
 
 
 
 
Integration of Oracle database 23ai as a Vector Store.
 
 
 
 
Deprecated.
Use the StructuredOutputConverter instead.
Groups the parsed PDF pages into Documents.
The ParagraphManager class is responsible for managing the paragraphs and hierarchy of a PDF document.
Represents a document paragraph metadata and hierarchy.
Uses the PDF catalog (e.g.
 
Utility methods for String parsing.
Common configuration builder for the PagePdfDocumentReader and the ParagraphPdfDocumentReader.
 
Re-implement the PDFLayoutTextStripperByArea on top of the PDFLayoutTextStripper instead the original PDFTextStripper.
The PdfReaderRuntimeHints class is responsible for registering runtime hints for PDFBox resources.
Converts Filter.Expression into PgVector metadata filter expression format.
 
Uses the "vector_store" table to store the Spring AI vector data.
 
Defaults to CosineDistance.
By default, pgvector performs exact nearest neighbor search, which provides perfect recall.
 
 
Converts Filter.Expression into Pinecone metadata filter expression format.
A VectorStore implementation backed by Pinecone, a cloud-based vector database.
Configuration class for the PineconeVectorStore.
 
 
 
Auto-configuration class for PostgresMlEmbeddingModel.
PostgresML EmbeddingModel
 
 
 
Configuration properties for Postgres ML.
PrefillToken
Converts Filter.Expression into test string format.
The Prompt class represents a prompt used in AI model requests.
Memory is retrieved added into the prompt's system text.
 
Abstract Data Type (ADT) modeling metadata gathered by the AI during request processing.
Abstract Data Type (ADT) modeling filter metadata for all prompts sent during an AI request.
 
 
 
 
 
 
Qdrant vectorStore implementation.
Deprecated, for removal: This API element is subject to removal in a future version.
since 1.0.0 in favor of QdrantVectorStore.
 
 
 
Represents an access token for the QianFan API.
Single class implementation of the QianFan Chat Completion API and Embedding API.
Represents a chat completion response returned by model, based on the provided input.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Message comprising the conversation.
The role of the author of this message.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
QianFan Chat Completion Models: QianFan Model.
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
QianFan Embeddings Models: Embeddings.
Creates an embedding vector representing the input text.
Usage statistics for the completion request.
QianFanAuthenticator is a class that authenticates and requests access token for the QianFan API.
 
 
ChatModel and StreamingChatModel implementation for QianFan backed by QianFanApi.
QianFanChatOptions represents the options for performing chat completion using the QianFan API.
 
 
 
The ApiUtils class provides utility methods for working with API requests and responses.
QianFan Embedding Client implementation.
This class represents the options for QianFan embedding.
 
 
QianFan Image API.
 
QianFan Image API model.
 
 
QianFanImageModel is a class that implements the ImageModel interface.
QianFan Image API options.
 
QianFan Image autoconfiguration properties.
The QianFanRuntimeHints class is responsible for registering runtime hints for QianFan API classes.
Usage implementation for QianFan.
 
Context for the question is retrieved from a Vector Store and added to the prompt's user text.
 
A random ID generator that returns a UUID.
Abstract Data Type (ADT) encapsulating metadata from an AI provider's API rate limits granted to the API key in use and the API key's current balance.
Converts Filter.Expression into Redis search filter expression format.
The RedisVectorStore is for managing and querying vector data in a Redis database.
 
 
Configuration for the Redis vector store.
 
 
 
 
Deprecated.
since 1.0.0 M3 please use CallAroundAdvisor or StreamAroundAdvisor instead.
Service that helps caching remote Resources on the local file system.
Miscellaneous Resource utility methods.
Represents a Model response that includes the entire response along withe specified response entity type.
Interface representing metadata associated with an AI model's response.
Interface representing metadata associated with the results of an AI model.
RetryUtils is a utility class for configuring and handling retry operations.
 
A CallAroundAdvisor and StreamAroundAdvisor that filters out the response if the user input contains any of the sensitive words.
 
 
Similarity search request builder.
 
https://www.elastic.co/guide/en/elasticsearch/reference/master/dense-vector.html max_inner_product is currently not supported because the distance value is not normalized and would not comply with the requirement of being between 0 and 1
A simple logger advisor that logs the request and response messages.
SimpleVectorStore is a simple implementation of the VectorStore interface.
 
The Speech class represents the result of speech synthesis from an AI model.
The SpeechMessage class represents a single text message to be converted to speech by the OpenAI TTS API.
The SpeechModel interface provides a way to interact with the OpenAI Text-to-Speech (TTS) API.
The SpeechPrompt class represents a request to the OpenAI Text-to-Speech (TTS) API.
Creates a new instance of SpeechResponse with the given speech result.
 
Types of Spring AI constructs which can be observed.
 
 
Exponential Backoff properties.
 
Represents the StabilityAI API.
 
 
 
 
 
 
 
Represents metadata associated with the image generation process in the StabilityAI framework.
StabilityAiImageModel is a class that implements the ImageModel interface.
StabilityAiImageOptions is an interface that extends ImageOptions.
 
 
 
The StreamAroundAdvisorChain is used to delegate the call to the next StreamAroundAdvisor in the chain.
StreamDetails
Helper class to support streaming function calling.
 
 
The StreamingModel interface provides a generic API for invoking an AI models with streaming response.
The StreamingSpeechModel interface provides a way to interact with the OpenAI Text-to-Speech (TTS) API using a streaming approach, allowing you to receive the generated audio in a real-time fashion.
StreamResponse
Converts the (raw) LLM output into a structured responses of type.
Enum representing different styles for images.
Title extractor with adjacent sharing that uses generative to extract 'section_summary', 'prev_section_summary', 'next_section_summary' metadata fields.
 
A message of the type 'system' passed as input.
 
An implementation of BindingsPropertiesProcessor that detects Bindings of type: "genai".
 
 
 
 
A document reader that leverages Apache Tika to extract text from a variety of document formats, such as PDF, DOC/DOCX, PPT/PPTX, and HTML.
Java client for the Bedrock Titan chat model.
Titan models version.
TitanChatRequest encapsulates the request parameters for the Titan chat model.
 
Titan request text generation configuration.
TitanChatResponse encapsulates the response parameters for the Titan chat model.
The reason the response finished being generated.
Titan response result.
Titan chat model streaming response.
Java client for the Bedrock Titan Embedding model.
Titan Embedding model ids.
Titan Embedding request parameters.
TitanEmbeddingRequest builder.
Titan Embedding response.
Token
Token count based strategy implementation for BatchingStrategy.
Estimates the number of tokens in a given text or message.
 
Helper class that reuses the AbstractToolCallSupport to implement the function call handling logic on the client side.
Helper used to provide only the function definition, without the actual function call implementation.
Represents the context for tool execution in a function calling scenario.
The ToolResponseMessage class represents a message with a function content in a chat application.
 
Utilities to prepare and process traces for observability.
An implementation of the AbstractEmbeddingModel that uses ONNX-based Transformer models for text embeddings.
 
 
 
 
Configurations for the HuggingFaceTokenizer used to convert sentences into tokens.
Root of the hierarchy of Model access exceptions that are considered transient - where a previously failed operation might be able to succeed when the operation is retried without any intervention by application-level functionality.
A utility class that provides methods for resolving types and classes related to functions.
 
 
Converts Filter.Expression into Typesense metadata filter expression format.
 
 
 
 
 
 
Abstract Data Type (ADT) encapsulating metadata on the usage of an AI provider's API per AI request.
A message of the type 'user' passed as input Messages with the user role are from the end-user or developer.
The VectorStore interface defines the operations for managing and querying documents in a vector database.
Memory is retrieved from a VectorStore added into the prompt's system text.
 
Collection of attribute keys used in vector store observations (spans, metrics, events).
Auto-configuration for Spring AI vector store observations.
Utilities to process the query content in observations for vector store operations.
Context used to store metadata for vector store operations.
 
 
 
Documented conventions for vector store observations.
High-cardinality observation key names for vector store operations.
Low-cardinality observation key names for vector store operations.
Collection of event names used in vector store observations.
Configuration properties for vector store observations.
Collection of systems providing vector store functionality.
An ObservationFilter to include the Vector Store search response content in the observation.
Handler for including the query response content in the observation as a span event.
Types of similarity metrics used in vector store operations.
Auto-configuration for Vertex AI Gemini Chat.
VertexAiEmbeddingConnectionDetails represents the details of a connection to the Vertex AI embedding service.
 
Configuration properties for Vertex AI Embedding.
 
Utility class for constructing parameter objects for Vertex AI embedding requests.
 
 
 
 
 
Auto-configuration for Vertex AI Gemini Chat.
 
 
 
 
 
 
 
Configuration properties for Vertex AI Gemini Chat.
Configuration properties for Vertex AI Gemini Chat.
 
 
The VertexAiGeminiRuntimeHints class is responsible for registering runtime hints for Vertex AI Gemini API classes.
Implementation of the Vertex AI Multimodal Embedding Model.
VertexAI Embedding Models: - Text embeddings - Multimodal embeddings
Class representing the options for Vertex AI Multimodal Embedding.
 
Configuration properties for Vertex AI Gemini Chat.
Vertex AI API client for the Generative Language model.
A list of floats representing the embedding.
Message generation request body.
The response from the model.
Content filtering metadata associated with processing a single request.
Reasons why content may have been blocked.
The base unit of structured text.
A collection of source attributions for a piece of content.
A citation to a source for a portion of a specific response.
All of the structured input text passed to the model as a prompt.
An input/output example used to instruct the Model.
Information about a Generative Language Model.
API error response.
Error details.
 
 
 
 
 
 
 
 
A class representing a Vertex AI Text Embedding Model.
VertexAI Embedding Models: - Text embeddings - Multimodal embeddings
 
 
 
Configuration properties for Vertex AI Gemini Chat.
 
The VertexRuntimeHints class is responsible for registering runtime hints for Vertex AI API classes.
Java Client for the watsonx.ai API.
WatsonX.ai autoconfiguration class.
ChatModel implementation for watsonx.ai.
Helper class for creating watsonx.ai options.
 
Chat properties for Watsonx.AI Chat.
Java class for Watsonx.ai Chat Request object.
 
Java class for Watsonx.ai Chat Response object.
Java class for Watsonx.ai Chat Results object.
WatsonX.ai connection autoconfiguration properties.
EmbeddingModel implementation for Watsonx.ai.
The configuration information for the embedding requests.
Watsonx.ai Embedding autoconfiguration properties.
Java class for Watsonx.ai Embedding Request object.
 
Java class for Watsonx.ai Embedding Response object.
Java class for Watsonx.ai Embedding Results object.
The WatsonxAiRuntimeHints class is responsible for registering runtime hints for Watsonx AI API classes.
An implementation of BindingsPropertiesProcessor that detects Bindings of type: "weaviate".
 
Converts Filter.Expression into Weaviate metadata filter expression format.
A VectorStore implementation backed by Weaviate vector database.
Configuration class for the WeaviateVectorStore.
 
https://weaviate.io/developers/weaviate/concepts/replication-architecture/consistency#tunable-consistency-strategies
 
 
 
 
Single class implementation of the ZhiPuAI Chat Completion API and ZhiPuAI Embedding API.
Represents a chat completion response returned by model, based on the provided input.
Chat completion choice.
Represents a streamed chunk of a chat completion response returned by model, based on the provided input.
Chat completion choice.
The reason the model stopped generating tokens.
Message comprising the conversation.
The function definition.
An array of content parts with a defined type.
 
The role of the author of this message.
The relevant tool call.
Creates a model response for the given chat conversation.
An object specifying the format that the model must output.
Helper factory that creates a tool_choice of type 'none', 'auto' or selected function by name.
ZhiPuAI Chat Completion Models: ZhiPuAI Model.
Represents an embedding vector returned by embedding endpoint.
List of multiple embedding responses.
ZhiPuAI Embeddings Models: Embeddings.
Creates an embedding vector representing the input text.
Represents a tool the model may call.
Function definition.
Create a tool of type 'function' and the given function definition.
Log probability information for the choice.
Message content tokens with log probability information.
The most likely tokens and their log probability, at this token position.
Usage statistics for the completion request.
 
ChatModel and StreamingChatModel implementation for ZhiPuAI backed by ZhiPuAiApi.
ZhiPuAiChatOptions represents the options for the ZhiPuAiChat model.
 
 
 
ZhiPuAI Embedding Model implementation.
The ZhiPuAiEmbeddingOptions class represents the options for ZhiPuAI embedding.
 
 
ZhiPuAI Image API.
 
ZhiPuAI Image API model.
 
 
ZhiPuAiImageModel is a class that implements the ImageModel interface.
ZhiPuAiImageOptions represents the options for image generation using ZhiPuAI image model.
 
 
The ZhiPuAiRuntimeHints class is responsible for registering runtime hints for ZhiPu AI API classes.
Helper class to support Streaming function calling.
Usage implementation for ZhiPuAI.
Common value constants for ZhiPu api.