AI Gets Agents: ChatGPT Now Has Deep Research with Agentic Capabilities
Today, OpenAI has announced a new ChatGPT feature called "Deep Research," which is capable of performing complex, multi-step research processes entirely on its own. Using so-called agents, which are autonomous bots working on top of the AI model, this feature searches the web and curates all needed information. This agentic behavior was trained on real-world browser usage, accompanied by Python code execution. Deep search, like OpenAI's o1 and o3 models, uses reinforcement learning, which steps back to "think" and creates a chain of thought before delivering users an answer to their question. Depending on the topic, deep research can take 5 to 30 minutes to search the web, crawl through data, and compile it in a reader-friendly manner.
Regarding benchmarks of its performance, OpenAI put out a lot of interesting comparisons and evaluations. Compared to all previous models, deep research gives these models additional context to help AI with more information. Thus, in evaluation benchmarks like Humanity's Last exam, deep research scored 26.6%, whereas o1 and o3-mini scored 9.1 and 13%, respectively. Other evaluations showed a modest improvement, while concrete comparisons were made in UX, business, and medical research. Turning the deep research feature on delivered more information every time, and you can see it for yourself here.
Regarding benchmarks of its performance, OpenAI put out a lot of interesting comparisons and evaluations. Compared to all previous models, deep research gives these models additional context to help AI with more information. Thus, in evaluation benchmarks like Humanity's Last exam, deep research scored 26.6%, whereas o1 and o3-mini scored 9.1 and 13%, respectively. Other evaluations showed a modest improvement, while concrete comparisons were made in UX, business, and medical research. Turning the deep research feature on delivered more information every time, and you can see it for yourself here.