Optional
apiThe API key to use.
{process.env.MISTRAL_API_KEY}
Optional
cacheOptional
callbackUse callbacks
instead
Optional
callbacksOptional
endpointOverride the default endpoint.
Optional
maxThe maximum number of concurrent calls that can be made.
Defaults to Infinity
, which means no limit.
Optional
maxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
Optional
maxThe maximum number of tokens to generate in the completion. The token count of your prompt plus max_tokens cannot exceed the model's context length.
Optional
metadataOptional
modelThe name of the model to use.
{"mistral-small"}
Optional
onCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
Optional
randomThe seed to use for random sampling. If set, different calls will generate deterministic results.
Optional
safeWhether to inject a safety prompt before all conversations.
{false}
Optional
streamingWhether or not to stream the response.
{false}
Optional
tagsOptional
temperatureWhat sampling temperature to use, between 0.0 and 2.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
{0.7}
Optional
topPNucleus sampling, where the model considers the results of the tokens with top_p
probability mass.
So 0.1 means only the tokens comprising the top 10% probability mass are considered.
Should be between 0 and 1.
{1}
Optional
verboseGenerated using TypeDoc
Input to chat model class.