Scan Settings 

The following parameters define how your model is scanned.

Field Name  Description Values 

Temperature

The Temperature value controls the balance between the predictability and creativity of the generated text in the response. 

Lower temperature value returns more returns predictable and conservative responses whereas higher temperature value returns less predictable and creative response. 

Range: 0 to 1

Default value - 0.7

Top K

This configuration controls the randomness and diversity of the generated text.

Any integer value.

Default value is 10

Maximum Tokens

Maximum number of generated tokens by Target LLM.

Any integer value.

Default value is 512

Top P

This configuration controls the randomness and diversity of the generated text. It employs a technique called nucleus sampling or top-p sampling.

Range: 0 to 1

Default value is 0.95.

Repetition Penalty

This configuration reduces the likelihood of repetitive text in text generation.

If set to 1, there is no penalty. Higher values increasingly discourage the repetition of tokens.

Float value between 1 and 2.

Default value is 1.03

Response Timeout

Timeout, in seconds, for a single inference request sent to the Target LLM.

Any integer value

Default value is 120.

Retry Attempts

Number of times a single inference request is attempted.

Default value is 5.