Changelog
- Released gpt-4o-2024-11-20, our newest model in the gpt-4o series.
- Released Predicted Outputs, which greatly reduces latency for model responses where much of the response is known ahead of time. This is most common when regenerating the content of documents and code files with only minor changes.
- Added five new voice types in the Realtime API and Chat Completions API.
-
Released new
gpt-4o-audio-preview
model for chat completions, which supports both audio inputs and outputs. Uses the same underlying model as the Realtime API.
Released several new features at OpenAI DevDay in San Francisco:
Realtime API: Build fast speech-to-speech experiences into your applications using a WebSockets interface.
Model distillation: Platform for fine-tuning cost-efficient models with your outputs from a large frontier model.
Image fine-tuning: Fine-tune GPT-4o with images and text to improve vision capabilities.
Evals: Create and run custom evaluations to measure model performance on specific tasks.
Prompt caching: Discounts and faster processing times on recently seen input tokens.
Generate in playground: Easily generate prompts, function definitions, and structured output schemas in the playground using the Generate button.
-
Released
new
omni-moderation-latest
moderation model, which supports both images and text (for some categories), supports two new text-only harm categories, and has more accurate scores.
- Released o1-preview and o1-mini, new large language models trained with reinforcement learning to perform complex reasoning tasks.
- Assistants API now supports including file search results used by the file search tool, and customizing ranking behavior.
-
GA release for
gpt-4o-2024-08-06
fine-tuning—all API users can now fine-tune the latest GPT-4o model.
-
Released dynamic model for
chatgpt-4o-latest
—this model will point to the latest GPT-4o model used by ChatGPT.
- Launched Structured Outputs—model outputs now reliabilty adhere to developer supplied JSON Schemas.
- Released gpt-4o-2024-08-06, our newest model in the gpt-4o series.
- Launched Admin and Audit Log APIs, allowing customers to programmatically administer their organization and monitor changes using the audit logs. Audit logging must be enabled within settings.
- Launched self-serve SSO configuration, allowing Enterprise customers on custom and unlimited billing to set up authentication against their desired IDP.
- Launched fine-tuning for GPT-4o mini, enabling even higher performance for specific use cases.
- Released GPT-4o mini, our affordable an intelligent small model for fast, lightweight tasks.
- Released Uploads to upload large files in multiple parts.
-
Parallel function calling
can be disabled in Chat Completions and the Assistants API by passing
parallel_tool_calls=false
. - .NET SDK launched in Beta.
- Added support for file search customizations .
- Added support for archiving projects . Only organization owners can access this functionality.
- Added support for setting cost limits on a per-project basis for pay as you go customers.
- Released GPT-4o in the API. GPT-4o is our fastest and most affordable flagship model.
- Added support for image inputs to the Assistants API.
- Added support for fine-tuned models to the Batch API .
-
Added
stream_options: {"include_usage": true}
parameter to the Chat Completions and Completions APIs. Setting this gives developers access to usage stats when using streaming.
- Added a new endpoint to delete a message from a thread in the Assistants API.
-
Added a new
function calling option
tool_choice: "required"
to the Chat Completions and Assistants APIs. - Added a guide for the Batch API and Batch API support for embeddings models
- Introduced a series of updates to the Assistants API , including a new file search tool allowing up to 10,000 files per assistant, new token controls, and support for tool choice.
- Introduced project based hierarchy for organizing work by projects, including the ability to create API keys and manage rate and cost limits on a per-project basis (cost limits available only for Enterprise customers).
- Released Batch API
- Released GPT-4 Turbo with Vision in general availability in the API
- Added support for seed in the fine-tuning API
- Added support for checkpoints in the fine-tuning API
- Added support for adding Messages when creating a Run in the Assistants API
- Added support for filtering Messages by run_id in the Assistants API
- Added support for temperature and assistant message creation in the Assistants API
- Added support for streaming in the Assistants API
-
Added
timestamp_granularities
parameter to the Audio API
- Released embedding V3 models and an updated GPT-4 Turbo preview
-
Added
dimensions
parameter to the Embeddings API
-
Added
additional_instructions
parameter to run creation in the Assistants API
-
Added
logprobs
andtop_logprobs
parameters to the Chat Completions API
- Changed function parameters argument on a tool call to be optional
- Released OpenAI Deno SDK
- Released GPT-4 Turbo Preview , updated GPT-3.5 Turbo, GPT-4 Turbo with Vision, Assistants API, DALL·E 3 in the API, and text-to-speech API
-
Deprecated the Chat Completions
functions
parameter in favor oftools
- Released OpenAI Python SDK V1.0
-
Added
encoding_format
parameter to the Embeddings API - Added
max_tokens
to the Moderation models
- Added function calling support to the Fine-tuning API