All Categories

Large Language Models

50 articles

How we monitor internal coding agents for misalignment
Large Language Models1d ago

How we monitor internal coding agents for misalignment

How OpenAI uses chain-of-thought monitoring to study misalignment in internal coding agents—analyzing real-world deployments to detect risks and strengthen AI s...

OpenAI Blog
OpenAI to acquire Astral
Large Language Models1d ago

OpenAI to acquire Astral

Accelerates Codex growth to power the next generation of Python developer tools

OpenAI Blog
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
Large Language Models1d ago

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training

A developer successfully replicated David Ng's RYS method on consumer AMD GPUs, discovering that transformers possess discrete "reasoning circuits" consisting o...

transformersreasoningamd gpus
Hacker News
OpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first
Large Language Models3d ago

OpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first

OpenAI Japan announces the Japan Teen Safety Blueprint, introducing stronger age protections, parental controls, and well-being safeguards for teens using gener...

OpenAI Blog
Introducing GPT-5.4 mini and nano
Large Language Models3d ago

Introducing GPT-5.4 mini and nano

GPT-5.4 mini and nano are smaller, faster versions of GPT-5.4 optimized for coding, tool use, multimodal reasoning, and high-volume API and sub-agent workloads.

OpenAI Blog
Equipping workers with insights about compensation
Large Language Models3d ago

Equipping workers with insights about compensation

New research shows Americans send nearly 3 million daily messages to ChatGPT asking about compensation and earnings, helping close the wage information gap.

OpenAI Blog
Why Codex Security Doesn’t Include a SAST Report
Large Language Models4d ago

Why Codex Security Doesn’t Include a SAST Report

A deep dive into why Codex Security doesn’t rely on traditional SAST, instead using AI-driven constraint reasoning and validation to find real vulnerabilities w...

OpenAI Blog
Rakuten fixes issues twice as fast with Codex
Large Language ModelsMar 11, 2026

Rakuten fixes issues twice as fast with Codex

Rakuten uses Codex, the coding agent from OpenAI, to ship software faster and safer, reducing MTTR 50%, automating CI/CD reviews, and delivering full-stack buil...

OpenAI Blog
Designing AI agents to resist prompt injection
Large Language ModelsMar 11, 2026

Designing AI agents to resist prompt injection

How ChatGPT defends against prompt injection and social engineering by constraining risky actions and protecting sensitive data in agent workflows.

OpenAI Blog
Wayfair boosts catalog accuracy and support speed with OpenAI
Large Language ModelsMar 11, 2026

Wayfair boosts catalog accuracy and support speed with OpenAI

Wayfair uses OpenAI models to improve ecommerce support and product catalog accuracy, automating ticket triage and enhancing millions of product attributes at s...

OpenAI Blog
From model to agent: Equipping the Responses API with a computer environment
Large Language ModelsMar 11, 2026

From model to agent: Equipping the Responses API with a computer environment

How OpenAI built an agent runtime using the Responses API, shell tool, and hosted containers to run secure, scalable agents with files, tools, and state.

OpenAI Blog
Improving instruction hierarchy in frontier LLMs
Large Language ModelsMar 10, 2026

Improving instruction hierarchy in frontier LLMs

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.

OpenAI Blog
New ways to learn math and science in ChatGPT
Large Language ModelsMar 10, 2026

New ways to learn math and science in ChatGPT

ChatGPT introduces interactive visual explanations for math and science, helping students explore formulas, variables, and concepts in real time.

OpenAI Blog
OpenAI to acquire Promptfoo
Large Language ModelsMar 9, 2026

OpenAI to acquire Promptfoo

OpenAI is acquiring Promptfoo, an AI security platform that helps enterprises identify and remediate vulnerabilities in AI systems during development.

OpenAI Blog
How Descript enables multilingual video dubbing at scale
Large Language ModelsMar 6, 2026

How Descript enables multilingual video dubbing at scale

Descript uses OpenAI models to scale multilingual video dubbing, optimizing translations for both meaning and timing so dubbed speech sounds natural across lang...

OpenAI Blog
Codex Security: now in research preview
Large Language ModelsMar 6, 2026

Codex Security: now in research preview

Codex Security is an AI application security agent that analyzes project context to detect, validate, and patch complex vulnerabilities with higher confidence a...

OpenAI Blog
How Balyasny Asset Management built an AI research engine for investing
Large Language ModelsMar 6, 2026

How Balyasny Asset Management built an AI research engine for investing

See how Balyasny built an AI research system with GPT-5.4, rigorous model evaluation, and agent workflows to transform investment analysis at scale.

OpenAI Blog
Introducing GPT-5.4
Large Language ModelsMar 5, 2026

Introducing GPT-5.4

Introducing GPT-5.4, OpenAI’s most most capable and efficient frontier model for professional work, with state-of-the-art coding, computer use, tool search, and...

OpenAI Blog
GPT-5.4 Thinking System Card
Large Language ModelsMar 5, 2026

GPT-5.4 Thinking System Card

OpenAI Blog
Reasoning models struggle to control their chains of thought, and that’s good
Large Language ModelsMar 5, 2026

Reasoning models struggle to control their chains of thought, and that’s good

OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeguard.

OpenAI Blog
Ensuring AI use in education leads to opportunity
Large Language ModelsMar 5, 2026

Ensuring AI use in education leads to opportunity

OpenAI shares new tools, certifications, and measurement resources to help schools and universities close AI capability gaps and expand opportunity.

OpenAI Blog
Introducing ChatGPT for Excel and new financial data integrations
Large Language ModelsMar 5, 2026

Introducing ChatGPT for Excel and new financial data integrations

OpenAI introduces ChatGPT for Excel and new financial app integrations, powered by GPT-5.4 to accelerate modeling, research, and analysis in regulated environme...

OpenAI Blog
Introducing the Adoption news channel
Large Language ModelsMar 5, 2026

Introducing the Adoption news channel

Practical insights and frameworks to turn AI progress into business advantage

OpenAI Blog
The five AI value models driving business reinvention
Large Language ModelsMar 5, 2026

The five AI value models driving business reinvention

Five AI value models show how leaders can sequence AI from workforce fluency to process reinvention and build durable business advantage.

OpenAI Blog
Extending single-minus amplitudes to gravitons
Large Language ModelsMar 4, 2026

Extending single-minus amplitudes to gravitons

A new preprint extends single-minus amplitudes to gravitons, with GPT-5.2 Pro helping derive and verify nonzero graviton tree amplitudes in quantum gravity.

OpenAI Blog
How Axios uses AI to help deliver high-impact local journalism
Large Language ModelsMar 4, 2026

How Axios uses AI to help deliver high-impact local journalism

Axios COO Allison Murphy explains how the company uses AI to support local reporters, streamline newsroom workflows, and deliver high-impact local journalism at...

OpenAI Blog
Understanding AI and learning outcomes
Large Language ModelsMar 4, 2026

Understanding AI and learning outcomes

OpenAI introduces the Learning Outcomes Measurement Suite to assess AI’s impact on student learning across diverse educational environments over time.

OpenAI Blog
GPT-5.3 Instant: Smoother, more useful everyday conversations
Large Language ModelsMar 3, 2026

GPT-5.3 Instant: Smoother, more useful everyday conversations

OpenAI Blog
GPT-5.3 Instant System Card
Large Language ModelsMar 3, 2026

GPT-5.3 Instant System Card

OpenAI Blog
Our agreement with the Department of War
Large Language ModelsFeb 28, 2026

Our agreement with the Department of War

Details on OpenAI’s contract with the Department of War, outlining safety red lines, legal protections, and how AI systems will be deployed in classified enviro...

OpenAI Blog
Joint Statement from OpenAI and Microsoft
Large Language ModelsFeb 27, 2026

Joint Statement from OpenAI and Microsoft

Microsoft and OpenAI continue to work closely across research, engineering, and product development, building on years of deep collaboration and shared success.

OpenAI Blog
Introducing the Stateful Runtime Environment for Agents in Amazon Bedrock
Large Language ModelsFeb 27, 2026

Introducing the Stateful Runtime Environment for Agents in Amazon Bedrock

Stateful Runtime for Agents in Amazon Bedrock brings persistent orchestration, memory, and secure execution to multi-step AI workflows powered by OpenAI.

OpenAI Blog
OpenAI and Amazon announce strategic partnership
Large Language ModelsFeb 27, 2026

OpenAI and Amazon announce strategic partnership

OpenAI and Amazon announce a strategic partnership bringing OpenAI’s Frontier platform to AWS, expanding AI infrastructure, custom models, and enterprise AI age...

OpenAI Blog
Scaling AI for everyone
Large Language ModelsFeb 27, 2026

Scaling AI for everyone

Today we’re announcing $110B in new investment at a $730B pre money valuation. This includes $30B from SoftBank, $30B from NVIDIA, and $50B from Amazon.

OpenAI Blog
An update on our mental health-related work
Large Language ModelsFeb 27, 2026

An update on our mental health-related work

OpenAI shares updates on its mental health safety work, including parental controls, trusted contacts, improved distress detection, and recent litigation develo...

OpenAI Blog
Pacific Northwest National Laboratory and OpenAI partner to accelerate federal permitting
Large Language ModelsFeb 26, 2026

Pacific Northwest National Laboratory and OpenAI partner to accelerate federal permitting

OpenAI and Pacific Northwest National Laboratory introduce DraftNEPABench, a new benchmark evaluating how AI coding agents can accelerate federal permitting—sho...

OpenAI Blog
OpenAI Codex and Figma launch seamless code-to-design experience
Large Language ModelsFeb 26, 2026

OpenAI Codex and Figma launch seamless code-to-design experience

OpenAI and Figma launch a new Codex integration that connects code and design, enabling teams to move between implementation and the Figma canvas to iterate and...

OpenAI Blog
Disrupting malicious uses of AI | February 2026
Large Language ModelsFeb 25, 2026

Disrupting malicious uses of AI | February 2026

Our latest threat report examines how malicious actors combine AI models with websites and social platforms—and what it means for detection and defense.

OpenAI Blog
Arvind KC appointed Chief People Officer
Large Language ModelsFeb 24, 2026

Arvind KC appointed Chief People Officer

OpenAI appoints Arvind KC as Chief People Officer to help scale the company, strengthen its culture, and lead how work evolves in the age of AI.

OpenAI Blog
Why we no longer evaluate SWE-bench Verified
Large Language ModelsFeb 23, 2026

Why we no longer evaluate SWE-bench Verified

SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE...

OpenAI Blog
OpenAI announces Frontier Alliance Partners
Large Language ModelsFeb 23, 2026

OpenAI announces Frontier Alliance Partners

OpenAI announces Frontier Alliance Partners to help enterprises move from AI pilots to production with secure, scalable agent deployments.

OpenAI Blog
Our First Proof submissions
Large Language ModelsFeb 20, 2026

Our First Proof submissions

We share our AI model’s proof attempts for the First Proof math challenge, testing research-grade reasoning on expert-level problems.

OpenAI Blog
Advancing independent research on AI alignment
Large Language ModelsFeb 19, 2026

Advancing independent research on AI alignment

OpenAI commits $7.5M to The Alignment Project to fund independent AI alignment research, strengthening global efforts to address AGI safety and security risks.

OpenAI Blog
Introducing OpenAI for India
Large Language ModelsFeb 18, 2026

Introducing OpenAI for India

OpenAI for India expands AI access across the country—building local infrastructure, powering enterprises, and advancing workforce skills.

OpenAI Blog
Introducing EVMbench
Large Language ModelsFeb 18, 2026

Introducing EVMbench

OpenAI and Paradigm introduce EVMbench, a benchmark evaluating AI agents’ ability to detect, patch, and exploit high-severity smart contract vulnerabilities.

OpenAI Blog
GPT-5.2 derives a new result in theoretical physics
Large Language ModelsFeb 13, 2026

GPT-5.2 derives a new result in theoretical physics

A new preprint shows GPT-5.2 proposing a new formula for a gluon amplitude, later formally proved and verified by OpenAI and academic collaborators.

OpenAI Blog
Introducing Lockdown Mode and Elevated Risk labels in ChatGPT
Large Language ModelsFeb 13, 2026

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration.

OpenAI Blog
Scaling social science research
Large Language ModelsFeb 13, 2026

Scaling social science research

GABRIEL is a new open-source toolkit from OpenAI that uses GPT to turn qualitative text and images into quantitative data, helping social scientists analyze res...

OpenAI Blog
Beyond rate limits: scaling access to Codex and Sora
Large Language ModelsFeb 13, 2026

Beyond rate limits: scaling access to Codex and Sora

How OpenAI built a real-time access system combining rate limits, usage tracking, and credits to power continuous access to Sora and Codex.

OpenAI Blog
Introducing GPT-5.3-Codex-Spark
Large Language ModelsFeb 12, 2026

Introducing GPT-5.3-Codex-Spark

Introducing GPT-5.3-Codex-Spark—our first real-time coding model. 15x faster generation, 128k context, now in research preview for ChatGPT Pro users.

OpenAI Blog