SmallAi
Back to Discovery
OpenAI

GPT-4o mini

gpt-4o-mini
GPT-4o mini is the latest model released by OpenAI after GPT-4 Omni, supporting both image and text input and output. As their most advanced small model, it is significantly cheaper than other recent cutting-edge models, costing over 60% less than GPT-3.5 Turbo. It maintains state-of-the-art intelligence while offering remarkable cost-effectiveness. GPT-4o mini scored 82% on the MMLU test and currently ranks higher than GPT-4 in chat preferences.
128K

Providers Supporting This Model

OpenAI
OpenAIOpenAI
OpenAIgpt-4o-mini
Maximum Context Length
128K
Maximum Output Length
16K
Input Price
$0.15
Output Price
$0.60
MoonshotAIMoonshotAI
OpenAIgpt-4o-mini
Maximum Context Length
128K
Maximum Output Length
16K
Input Price
--
Output Price
--

Model Parameters

Randomnesstemperature

This setting affects the diversity of the model's responses. Lower values lead to more predictable and typical responses, while higher values encourage more diverse and uncommon responses. When set to 0, the model always gives the same response to a given input. View Documentation

Type
FLOAT
Default Value
1.00
Range
0.00 ~ 2.00
Nucleus Samplingtop_p

This setting limits the model's selection to a certain proportion of the most likely words: only selecting those top words whose cumulative probability reaches P. Lower values make the model's responses more predictable, while the default setting allows the model to choose from the entire range of vocabulary. View Documentation

Type
FLOAT
Default Value
1.00
Range
0.00 ~ 1.00
Topic Freshnesspresence_penalty

This setting aims to control the repetition of words based on their frequency in the input. It attempts to use less frequently used words that appear more in the input, with usage frequency proportional to appearance frequency. Word penalties increase with frequency. Negative values encourage word repetition. View Documentation

Type
FLOAT
Default Value
0.00
Range
-2.00 ~ 2.00
Frequency Penaltyfrequency_penalty

This setting adjusts the frequency of specific words that have already appeared in the input. Higher values reduce the likelihood of such repetitions, while negative values have the opposite effect. Word penalties do not increase with frequency. Negative values encourage word repetition. View Documentation

Type
FLOAT
Default Value
0.00
Range
-2.00 ~ 2.00
Single Response Limitmax_tokens

This setting defines the maximum length the model can generate in a single response. Higher values allow the model to generate longer responses, while lower values limit the length, making it more concise. Adjusting this value appropriately can help achieve the desired response length and detail based on different application scenarios. View Documentation

Type
INT
Default Value
--
Range
0 ~ 16K
Reasoning Intensityreasoning_effort

This setting controls the intensity of reasoning the model performs before generating an answer. Low intensity prioritizes response speed and saves tokens, while high intensity provides more complete reasoning but consumes more tokens and slows down response speed. The default value is medium, balancing reasoning accuracy and response speed. View Documentation

Type
STRING
Default Value
--
Range
low ~ high
Encountering issues during the process? Contact customer service via WeChat: SmallAi2024