What is GPT-5.4 Mini?

A smaller, faster, and more affordable variant of GPT-5.4. Retains the 1M context window and most capabilities at a fraction of the cost. Ideal for high-volume applications where speed and cost matter.

How much does GPT-5.4 Mini cost?

GPT-5.4 Mini costs $1/1M tokens for input and $6/1M tokens for output on CallMissed. 1 credit = ₹1 = $0.01 USD.

How do I use GPT-5.4 Mini via API?

Send a POST request to POST /v1/chat/completions with model "openai/gpt-5.4-mini" and your API key. CallMissed uses the OpenAI-compatible format — just change the base URL and model field.

What is the context window of GPT-5.4 Mini?

GPT-5.4 Mini supports a 1M token context window with up to 16K output tokens.

सभी मॉडल पर वापस जाएं

LLM चैटfastaffordable

GPT-5.4 Mini

द्वारा OpenAI · रिलीज़ March 2026

GPT-5.4 का छोटा, तेज़ और अधिक affordable variant। 1M context window और अधिकांश capabilities fraction of cost पर। speed और cost महत्वपूर्ण high-volume applications के लिए ideal।

LLM चैट

GPT-5.4 Mini

द्वारा संचालित OpenAI · Transformer (proprietary, distilled)

कॉन्टेक्स्ट विंडो

पैरामीटर

Undisclosed

अधिकतम आउटपुट

16K

श्रेणी

LLM चैट

अवलोकन

GPT-5.4 Mini GPT-5.4 का distilled variant है, high-volume production workloads के लिए जहाँ speed और cost critical हैं। काफ़ी छोटा होते हुए भी 1M token context window retain करता है — massive documents और codebases fraction of cost पर process करने देता है।

Fast inference के लिए optimized — real-time chat, content summarization, classification और low latency workflows के लिए suitable। $1.00/M input और $6.00/M output पर GPT-5.4 की तुलना में 6x सस्ते output tokens — strong general capabilities चाहने वाले cost-sensitive deployments के लिए go-to choice।

GPT-5.4 Mini standard benchmarks पर good performance maintain करता है, सबसे complex reasoning tasks पर कुछ capability trade करता है। Summarization, extraction, classification और conversational AI जैसे straightforward tasks में excel — जहाँ GPT-5.4 या Pro की full power unnecessary।

प्राइसिंग

मेट्रिक	कीमत
इनपुट /1M tokens	₹100.0000
आउटपुट /1M tokens	₹600.0000

1 क्रेडिट = ₹1 = $0.01 USD। कीमतें प्रोवाइडर से दिखाई गई हैं; CallMissed ~35% मार्कअप के साथ पास-थ्रू करता है।

मुख्य बातें

Output tokens पर GPT-5.4 से 6x सस्ता
1M token context window retained
Real-time applications के लिए fast inference
Standard benchmarks पर strong performance

बेंचमार्क

बेंचमार्क	स्कोर	नोट्स
MMLU-Pro	80.1%	व्यावसायिक ज्ञान
HumanEval	88.5%	कोड जनरेशन
MATH-500	88.7%	प्रतियोगिता गणित
GPQA Diamond	68.2%	स्नातकोत्तर-स्तर विज्ञान
SWE-bench Verified	58.3%	सॉफ़्टवेयर इंजीनियरिंग

तकनीकी विवरण

GPT-5.4 से distilled — छोटे size पर core capabilities retain
Context window: full GPT-5.4 से retained 1,000,000 tokens
Fast inference और low latency के लिए optimized
GPT-5.4 की तुलना में 6x सस्ते output tokens
Structured outputs, function calling और JSON mode सपोर्ट
Instruction following के लिए RLHF के साथ post-trained
OpenAI API और CallMissed unified gateway के ज़रिए उपलब्ध

ताकतें

1M context window retain करते हुए GPT-5.4 से 6x सस्ता
Real-time और high-volume workloads के लिए optimized fast inference
Straightforward tasks के लिए strong general-purpose performance
Production deployments के लिए cost, speed और capability का अच्छा balance

सीमाएं

GPT-5.4 और Pro की तुलना में complex reasoning पर reduced performance
Deep planning चाहने वाले multi-step agentic tasks में कम सक्षम
Proprietary — self-hosting या fine-tuning options नहीं

उपयोग के मामले

High-volume chatContent summarizationClassification tasksरियल-टाइम ऐप

API उदाहरण

curl https://api.callmissed.com/v1/chat/completions \
  -H "Authorization: Bearer cm_YOUR_KEY" \
  -d '{"model": "openai/gpt-5.4-mini", "messages": [{"role": "user", "content": "Summarize this article"}]}'

एंडपॉइंट: POST /v1/chat/completions · मॉडल ID: openai/gpt-5.4-mini

GPT-5.4 Mini अभी आज़माएं

साइनअप पर 1000 फ्री API क्रेडिट पाएं। कोई क्रेडिट कार्ड ज़रूरी नहीं।

फ्री शुरू करें डॉक्स पढ़ें