Bruno Pedro


Found at “Learning to Reason with LLMs | OpenAI” on 2024-09-12 19:28:17 +02:00.

We are introducing OpenAI o1, a new large language model trained with reinforcement learning to perform complex reasoning.