Class TextGenerationInferenceApi
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionGenerate tokens if `stream == false` or a stream of token if `stream == true` Generate tokens if `stream == false` or a stream of token if `stream == true`org.springframework.http.ResponseEntity<GenerateResponse>
Generate tokens if `stream == false` or a stream of token if `stream == true` Generate tokens if `stream == false` or a stream of token if `stream == true`generate
(GenerateRequest body) Generate tokens Generate tokensGenerate a stream of token using Server-Sent Events Generate a stream of token using Server-Sent Eventsorg.springframework.http.ResponseEntity<StreamResponse>
Generate a stream of token using Server-Sent Events Generate a stream of token using Server-Sent Eventsorg.springframework.http.ResponseEntity<GenerateResponse>
Generate tokens Generate tokensText Generation Inference endpoint info Text Generation Inference endpoint infoorg.springframework.http.ResponseEntity<Info>
Text Generation Inference endpoint info Text Generation Inference endpoint infovoid
health()
Health check method Health check methodorg.springframework.http.ResponseEntity<Void>
Health check method Health check methodmetrics()
Prometheus metrics scrape endpoint Prometheus metrics scrape endpointorg.springframework.http.ResponseEntity<String>
Prometheus metrics scrape endpoint Prometheus metrics scrape endpointvoid
setApiClient
(ApiClient apiClient)
-
Constructor Details
-
TextGenerationInferenceApi
public TextGenerationInferenceApi() -
TextGenerationInferenceApi
-
-
Method Details
-
getApiClient
-
setApiClient
-
compatGenerate
public GenerateResponse compatGenerate(CompatGenerateRequest body) throws org.springframework.web.client.RestClientException Generate tokens if `stream == false` or a stream of token if `stream == true` Generate tokens if `stream == false` or a stream of token if `stream == true`200 - Generated Text
422 - Input validation error
424 - Generation Error
429 - Model is overloaded
500 - Incomplete generation
- Parameters:
body
- (required)- Returns:
- GenerateResponse
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
compatGenerateWithHttpInfo
public org.springframework.http.ResponseEntity<GenerateResponse> compatGenerateWithHttpInfo(CompatGenerateRequest body) throws org.springframework.web.client.RestClientException Generate tokens if `stream == false` or a stream of token if `stream == true` Generate tokens if `stream == false` or a stream of token if `stream == true`200 - Generated Text
422 - Input validation error
424 - Generation Error
429 - Model is overloaded
500 - Incomplete generation
- Parameters:
body
- (required)- Returns:
- ResponseEntity<GenerateResponse>
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
generate
public GenerateResponse generate(GenerateRequest body) throws org.springframework.web.client.RestClientException Generate tokens Generate tokens200 - Generated Text
422 - Input validation error
424 - Generation Error
429 - Model is overloaded
500 - Incomplete generation
- Parameters:
body
- (required)- Returns:
- GenerateResponse
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
generateWithHttpInfo
public org.springframework.http.ResponseEntity<GenerateResponse> generateWithHttpInfo(GenerateRequest body) throws org.springframework.web.client.RestClientException Generate tokens Generate tokens200 - Generated Text
422 - Input validation error
424 - Generation Error
429 - Model is overloaded
500 - Incomplete generation
- Parameters:
body
- (required)- Returns:
- ResponseEntity<GenerateResponse>
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
generateStream
public StreamResponse generateStream(GenerateRequest body) throws org.springframework.web.client.RestClientException Generate a stream of token using Server-Sent Events Generate a stream of token using Server-Sent Events200 - Generated Text
422 - Input validation error
424 - Generation Error
429 - Model is overloaded
500 - Incomplete generation
- Parameters:
body
- (required)- Returns:
- StreamResponse
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
generateStreamWithHttpInfo
public org.springframework.http.ResponseEntity<StreamResponse> generateStreamWithHttpInfo(GenerateRequest body) throws org.springframework.web.client.RestClientException Generate a stream of token using Server-Sent Events Generate a stream of token using Server-Sent Events200 - Generated Text
422 - Input validation error
424 - Generation Error
429 - Model is overloaded
500 - Incomplete generation
- Parameters:
body
- (required)- Returns:
- ResponseEntity<StreamResponse>
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
getModelInfo
Text Generation Inference endpoint info Text Generation Inference endpoint info200 - Served model info
- Returns:
- Info
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
getModelInfoWithHttpInfo
public org.springframework.http.ResponseEntity<Info> getModelInfoWithHttpInfo() throws org.springframework.web.client.RestClientExceptionText Generation Inference endpoint info Text Generation Inference endpoint info200 - Served model info
- Returns:
- ResponseEntity<Info>
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
health
public void health() throws org.springframework.web.client.RestClientExceptionHealth check method Health check method200 - Everything is working fine
503 - Text generation inference is down
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
healthWithHttpInfo
public org.springframework.http.ResponseEntity<Void> healthWithHttpInfo() throws org.springframework.web.client.RestClientExceptionHealth check method Health check method200 - Everything is working fine
503 - Text generation inference is down
- Returns:
- ResponseEntity<Void>
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
metrics
Prometheus metrics scrape endpoint Prometheus metrics scrape endpoint200 - Prometheus Metrics
- Returns:
- String
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-
metricsWithHttpInfo
public org.springframework.http.ResponseEntity<String> metricsWithHttpInfo() throws org.springframework.web.client.RestClientExceptionPrometheus metrics scrape endpoint Prometheus metrics scrape endpoint200 - Prometheus Metrics
- Returns:
- ResponseEntity<String>
- Throws:
org.springframework.web.client.RestClientException
- if an error occurs while attempting to invoke the API
-