@Generated(value="com.amazonaws:aws-java-sdk-code-generator") public class TextInferenceConfig extends Object implements Serializable, Cloneable, StructuredPojo
Configuration settings for text generation using a language model via the RetrieveAndGenerate operation. Includes parameters like temperature, top-p, maximum token count, and stop sequences.
 The valid range of maxTokens depends on the accepted values for your chosen model's inference
 parameters. To see the inference parameters for your model, see Inference parameters for foundation
 models.
 
| Constructor and Description | 
|---|
| TextInferenceConfig() | 
| Modifier and Type | Method and Description | 
|---|---|
| TextInferenceConfig | clone() | 
| boolean | equals(Object obj) | 
| Integer | getMaxTokens()
 The maximum number of tokens to generate in the output text. | 
| List<String> | getStopSequences()
 A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. | 
| Float | getTemperature()
 Controls the random-ness of text generated by the language model, influencing how much the model sticks to the
 most predictable next words versus exploring more surprising options. | 
| Float | getTopP()
 A probability distribution threshold which controls what the model considers for the set of possible next tokens. | 
| int | hashCode() | 
| void | marshall(ProtocolMarshaller protocolMarshaller)Marshalls this structured data using the given  ProtocolMarshaller. | 
| void | setMaxTokens(Integer maxTokens)
 The maximum number of tokens to generate in the output text. | 
| void | setStopSequences(Collection<String> stopSequences)
 A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. | 
| void | setTemperature(Float temperature)
 Controls the random-ness of text generated by the language model, influencing how much the model sticks to the
 most predictable next words versus exploring more surprising options. | 
| void | setTopP(Float topP)
 A probability distribution threshold which controls what the model considers for the set of possible next tokens. | 
| String | toString()Returns a string representation of this object. | 
| TextInferenceConfig | withMaxTokens(Integer maxTokens)
 The maximum number of tokens to generate in the output text. | 
| TextInferenceConfig | withStopSequences(Collection<String> stopSequences)
 A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. | 
| TextInferenceConfig | withStopSequences(String... stopSequences)
 A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. | 
| TextInferenceConfig | withTemperature(Float temperature)
 Controls the random-ness of text generated by the language model, influencing how much the model sticks to the
 most predictable next words versus exploring more surprising options. | 
| TextInferenceConfig | withTopP(Float topP)
 A probability distribution threshold which controls what the model considers for the set of possible next tokens. | 
public void setMaxTokens(Integer maxTokens)
The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of 65536. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
maxTokens - The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of
        65536. The limit values described here are arbitary values, for actual values consult the limits defined
        by your specific model.public Integer getMaxTokens()
The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of 65536. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
public TextInferenceConfig withMaxTokens(Integer maxTokens)
The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of 65536. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
maxTokens - The maximum number of tokens to generate in the output text. Do not use the minimum of 0 or the maximum of
        65536. The limit values described here are arbitary values, for actual values consult the limits defined
        by your specific model.public List<String> getStopSequences()
A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
public void setStopSequences(Collection<String> stopSequences)
A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
stopSequences - A list of sequences of characters that, if generated, will cause the model to stop generating further
        tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are
        arbitary values, for actual values consult the limits defined by your specific model.public TextInferenceConfig withStopSequences(String... stopSequences)
A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
 NOTE: This method appends the values to the existing list (if any). Use
 setStopSequences(java.util.Collection) or withStopSequences(java.util.Collection) if you want
 to override the existing values.
 
stopSequences - A list of sequences of characters that, if generated, will cause the model to stop generating further
        tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are
        arbitary values, for actual values consult the limits defined by your specific model.public TextInferenceConfig withStopSequences(Collection<String> stopSequences)
A list of sequences of characters that, if generated, will cause the model to stop generating further tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are arbitary values, for actual values consult the limits defined by your specific model.
stopSequences - A list of sequences of characters that, if generated, will cause the model to stop generating further
        tokens. Do not use a minimum length of 1 or a maximum length of 1000. The limit values described here are
        arbitary values, for actual values consult the limits defined by your specific model.public void setTemperature(Float temperature)
Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options. A lower temperature value (e.g. 0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or 0.9) makes the outputs more creative or unpredictable.
temperature - Controls the random-ness of text generated by the language model, influencing how much the model sticks to
        the most predictable next words versus exploring more surprising options. A lower temperature value (e.g.
        0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or
        0.9) makes the outputs more creative or unpredictable.public Float getTemperature()
Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options. A lower temperature value (e.g. 0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or 0.9) makes the outputs more creative or unpredictable.
public TextInferenceConfig withTemperature(Float temperature)
Controls the random-ness of text generated by the language model, influencing how much the model sticks to the most predictable next words versus exploring more surprising options. A lower temperature value (e.g. 0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or 0.9) makes the outputs more creative or unpredictable.
temperature - Controls the random-ness of text generated by the language model, influencing how much the model sticks to
        the most predictable next words versus exploring more surprising options. A lower temperature value (e.g.
        0.2 or 0.3) makes model outputs more deterministic or predictable, while a higher temperature (e.g. 0.8 or
        0.9) makes the outputs more creative or unpredictable.public void setTopP(Float topP)
A probability distribution threshold which controls what the model considers for the set of possible next tokens. The model will only consider the top p% of the probability distribution when generating the next token.
topP - A probability distribution threshold which controls what the model considers for the set of possible next
        tokens. The model will only consider the top p% of the probability distribution when generating the next
        token.public Float getTopP()
A probability distribution threshold which controls what the model considers for the set of possible next tokens. The model will only consider the top p% of the probability distribution when generating the next token.
public TextInferenceConfig withTopP(Float topP)
A probability distribution threshold which controls what the model considers for the set of possible next tokens. The model will only consider the top p% of the probability distribution when generating the next token.
topP - A probability distribution threshold which controls what the model considers for the set of possible next
        tokens. The model will only consider the top p% of the probability distribution when generating the next
        token.public String toString()
toString in class ObjectObject.toString()public TextInferenceConfig clone()
public void marshall(ProtocolMarshaller protocolMarshaller)
StructuredPojoProtocolMarshaller.marshall in interface StructuredPojoprotocolMarshaller - Implementation of ProtocolMarshaller used to marshall this object's data.