OpenAI releases 'GPT 4.1' API, next-generation AI model with 1 million token support and practical performance

OpenAI announced the APIs for ' GPT-4.1 ', ' GPT-4.1 mini ', and ' GPT-4.1 nano ' on April 14, 2025. The GPT-4.1 series supports input of up to 1 million tokens, and its coding and instruction tracking capabilities have been significantly improved compared to the GPT-4o series, making it a low-latency, high-cost-performance model.
Introducing GPT-4.1 in the API | OpenAI
GPT-4.1 (and -mini and -nano) are now available in the API!
— Sam Altman (@sama) April 14, 2025
These models are great at coding, instruction following, and long context (1 million tokens).
benchmarks are strong, but we focused on real-world utility, and developers seem very happy.
GPT-4.1 family is API-only.
OpenAI's announcement of GPT-4.1 can be seen in the following movie.
GPT 4.1 in the API - YouTube
A feature of GPT-4.1 is that it supports a very long context window of up to 1 million tokens. This makes it possible to process hundreds of pages of documents and large code bases at once and make appropriate inferences while retaining the content. OpenAI says that this ability to process long texts is particularly important in many practical fields, such as law, finance, and customer support.
In terms of coding, GPT-4.1 achieved a score of 54.6% in

In

Evolution has also been seen in the understanding of visual information, scoring highly in visual question-answering tasks that include diagrams and mathematical formulas. In particular, the GPT-4.1 mini model is lighter than GPT-4o, yet is increasingly outperforming it in image processing capabilities, maintaining high performance while reducing costs and latency.
In addition, response speed and cost performance have also been improved. GPT-4.1 is up to 26% cheaper than previous models, and the reduction rate due to prompt cache has been expanded to 75%. Lightweight 'mini' and 'nano' models are also available at the same time, and can be used for applications that require low latency and cost optimization.

These characteristics make GPT-4.1 a practical AI model that enables the automation of complex tasks and the construction of intelligent agents beyond previous limits, and is optimized for use via APIs. It is expected that future releases will continue in this direction and meet even more diverse development needs.
At the time of writing, the GPT-4.1 series is available on the OpenAI API platform and is not available for use with ChatGPT. However, ChatGPT's GPT-4o is gradually incorporating improvements introduced in GPT-4.1, such as instruction following and code generation, and further integration is planned for the future.
The API prices are as follows:
Model | input | Cached Input | output | Blended Price |
---|---|---|---|---|
GPT-4.1 | $2.00 (about 290 yen) | $0.50 (approx. 72 yen) | $8.00 (about 1,100 yen) | $1.84 (about 260 yen) |
GPT-4.1-mini | $0.40 (about 57 yen) | 0.10 dollars (about 14 yen) | $1.60 (about 230 yen) | $0.42 (about 60 yen) |
GPT-4.1-nano | 0.10 dollars (about 14 yen) | 0.025 dollars (approximately 3.6 yen) | $0.40 (about 53 yen) | 0.12 dollars (about 17 yen) |
In addition, with the advent of GPT-4.1, the previous 'GPT-4.5 Preview' model has been decided to be discontinued on the API on July 14, 2025, and developers are being given a transition period. GPT-4.5 was initially provided as a research preview that used a lot of computational resources, and since it has been confirmed that GPT-4.1 is superior in many aspects, it will be gradually replaced.
Related Posts:
in Software, Web Service, Posted by log1i_yk