What Is a Bar Diagram for Solving Model Math Problems

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam

Nous Research's open-source Nomos 1 AI model scored 87/120 on the notoriously difficult Putnam math competition, ranking ...

IEEE

Self-Reflection in Large Language Model Agents: Effects on Problem-Solving Performance

Abstract: In this study, we investigated the effects of self-reflection in large language models (LLMs) on problem-solving performance. We instructed nine popular LLMs to answer a series of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam

Self-Reflection in Large Language Model Agents: Effects on Problem-Solving Performance

Trending now