Scan Settings

The following parameters define how your model is scanned.

Field Name	Description	Values
Temperature	The Temperature value controls the balance between the predictability and creativity of the generated text in the response. Lower temperature value returns more returns predictable and conservative responses whereas higher temperature value returns less predictable and creative response.	Range: 0 to 1 Default value - 0.7
Top K	This configuration controls the randomness and diversity of the generated text.	Any integer value. Default value is 10
Maximum Tokens	Maximum number of generated tokens by Target LLM.	Any integer value. Default value is 512
Top P	This configuration controls the randomness and diversity of the generated text. It employs a technique called nucleus sampling or top-p sampling.	Range: 0 to 1 Default value is 0.95.
Repetition Penalty	This configuration reduces the likelihood of repetitive text in text generation. If set to 1, there is no penalty. Higher values increasingly discourage the repetition of tokens.	Float value between 1 and 2. Default value is 1.03
Response Timeout	Timeout, in seconds, for a single inference request sent to the Target LLM.	Any integer value Default value is 120.
Retry Attempts	Number of times a single inference request is attempted.	Default value is 5.