Ctrl K

Qwen2 1.5B Instruct (Free)

Qwen/Qwen2-1.5B-Instruct

2024-12-19T18:37:26.051Z

Qwen2-1.5B-Instruct is an instruction-tuned large language model in the Qwen2 series, with a parameter size of 1.5B. This model is based on the Transformer architecture and employs techniques such as the SwiGLU activation function, attention QKV bias, and group query attention. It excels in language understanding, generation, multilingual capabilities, coding, mathematics, and reasoning across multiple benchmark tests, surpassing most open-source models. Compared to Qwen1.5-1.8B-Chat, Qwen2-1.5B-Instruct shows significant performance improvements in tests such as MMLU, HumanEval, GSM8K, C-Eval, and IFEval, despite having slightly fewer parameters.

32K

Providers Supporting This Model

Qwen/Qwen2-1.5B-Instruct

Maximum Context Length

32K

Maximum Output Length

Input Price

Output Price

Model Parameters

Randomness

temperature

This setting affects the diversity of the model's responses. Lower values lead to more predictable and typical responses, while higher values encourage more diverse and less common responses. When set to 0, the model always gives the same response to a given input. View Documentation

Type

FLOAT

Default Value

1.00

Range

0.00 ~ 2.00

Nucleus Sampling

top_p

This setting limits the model's selection to a certain proportion of the most likely vocabulary: only selecting those top words whose cumulative probability reaches P. Lower values make the model's responses more predictable, while the default setting allows the model to choose from the entire range of vocabulary. View Documentation

Type

FLOAT

Default Value

1.00

Range

0.00 ~ 1.00

Topic Freshness

presence_penalty

This setting aims to control the reuse of vocabulary based on its frequency in the input. It attempts to use less of those words that appear more frequently in the input, with usage frequency proportional to occurrence frequency. Vocabulary penalties increase with frequency of occurrence. Negative values encourage vocabulary reuse. View Documentation

Type

FLOAT

Default Value

0.00

Range

-2.00 ~ 2.00

Frequency Penalty

frequency_penalty

This setting adjusts the frequency at which the model reuses specific vocabulary that has already appeared in the input. Higher values reduce the likelihood of such repetition, while negative values have the opposite effect. Vocabulary penalties do not increase with frequency of occurrence. Negative values encourage vocabulary reuse. View Documentation

Type

FLOAT

Default Value

0.00

Range

-2.00 ~ 2.00

Single Response Limit

max_tokens

This setting defines the maximum length that the model can generate in a single response. Setting a higher value allows the model to produce longer replies, while a lower value restricts the length of the response, making it more concise. Adjusting this value appropriately based on different application scenarios can help achieve the desired response length and level of detail. View Documentation

Type

INT

Default Value

Related Models

Qwen2 1.5B Instruct (Free)

Providers Supporting This Model

Model Parameters

Related Models

Hunyuan A52B Instruct

DeepSeek V2.5

DeepSeek V2 Chat

QwQ 32B Preview

Qwen2.5 7B Instruct (Free)