All Topics

Technology

Art

OpenAI's Harmony Response Format for gpt-oss Models

meetpateltech

9mo ago· 3 min readenCode

95/100

Golden Brown

Bagelometer↗

Master baker tier. Every paragraph earns its place on the tray.

Score95Typehow-toSentimentneutral

Summary

The article introduces OpenAI's Harmony response format, designed for use with the gpt-oss open-weight model series. It explains the format's purpose in structuring conversations, generating reasoning output, and handling function calls. The guide is aimed at developers building their own inference solutions, while noting that API users or providers like Ollama need not worry about the format.

Key quotes

· 3 pulled

The gpt-oss models were trained on the harmony response format for defining conversation structures, generating reasoning output and structuring function calls.

If you are not using gpt-oss directly but through an API or a provider like Ollama, you will not have to be concerned about this as your inference solution will handle the formatting.

The format is designed to mimic the harmony response format for defining conversation structures.

Snippet from the RSS feed

Renderer for the harmony response format to be used with gpt-oss - openai/harmony

You might also wanna read

Running Gemma 4 on a 2016 Xeon Server with No GPU: A Technical Walkthrough

The article describes running Gemma 4 (a 25B-parameter Mixture-of-Experts model) on a severely outdated server with a 2016 Intel Xeon E5-262

point.free·1h ago

NVIDIA Announces "Hack for Impact" London Event for Autonomous AI Agent Development

NVIDIA is hosting a "Hack for Impact" event in London, challenging participants to build autonomous agentic applications using open-source m

luma.com·3h ago

Four practical steps to control Azure Foundry token costs for agentic AI workloads

This article provides practical guidance on controlling token costs in Microsoft Azure Foundry, particularly for agentic AI workloads where

purplefrogsystems.com·4h ago

MerLean-Prover: A Recursive Agent Harness for Lean 4 Theorem Proving Outperforms Baselines

MerLean-Prover is an end-to-end Lean4 theorem prover that replaces 'sorry' declarations with kernel-checkable proofs using three agent types

arxiv.org·5h ago

Why small pull request policies can backfire on software quality

The article critiques a common software engineering policy that limits pull requests (PRs) to small sizes (e.g., 500 lines, few files). Whil

apenwarr.ca·7h ago

How Anthropic contains Claude's expanding access across its products

Anthropic describes how it has evolved its approach to granting Claude, its AI assistant, increasingly broad access to internal systems over

anthropic.com·8h ago