Community

Community, Language, Reasoning, Research, Responsible AI, Safety & Alignment, Video generation

Language models can explain neurons in language models

We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a dataset

April 19, 2024

Community, Responsible AI, Safety & Alignment, Video generation

Frontier AI regulation: Managing emerging risks to public safety

April 19, 2024

Community, Reasoning, Research, Responsible AI, Safety & Alignment, Video generation

Improving mathematical reasoning with process supervision

We've trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply

April 19, 2024

Community, Responsible AI, Safety & Alignment, Video generation

Confidence-Building Measures for Artificial Intelligence: Workshop proceedings

April 19, 2024

Load More Posts

Language models can explain neurons in language models

Frontier AI regulation: Managing emerging risks to public safety

Improving mathematical reasoning with process supervision

Confidence-Building Measures for Artificial Intelligence: Workshop proceedings

Let's Talk?

Let's Talk?

Phone.

Email.