H
u
m
a
n
-
l
e
v
e
l
r
e
a
s
o
n
i
n
g
i
s
t
h
e
h
o
l
y
g
r
a
i
l
o
f
A
I
.

O
u
r
m
i
s
s
i
o
n
i
s
t
o
b
r
i
n
g
h
u
m
a
n
-
l
e
v
e
l
r
e
a
s
o
n
i
n
g
t
o
c
o
m
p
l
e
x
e
n
t
e
r
p
r
i
s
e
a
p
p
l
i
c
a
t
i
o
n
s
a
n
d
w
o
r
k
f
l
o
w
s
t
h
a
t
d
e
m
a
n
d
t
h
e
h
i
g
h
e
s
t
p
r
e
c
i
s
i
o
n
a
n
d
a
c
c
u
r
a
c
y
.

The first accurate neurosymbolic engine

Actual reasoning,
clear results.

When it comes to reasoning, Reasoner delivers clarity where others fall short.

Here’s an example: The surgeon riddle

T
h
e
s
u
r
g
e
o
n
,
w
h
o
i
s
t
h
e
b
o
y
'
s
f
a
t
h
e
r
,
s
a
y
s
,
"
I
c
a
n
'
t
o
p
e
r
a
t
e
o
n
t
h
i
s
b
o
y
,
h
e
'
s
m
y
s
o
n
!
"
W
h
o
i
s
t
h
e
s
u
r
g
e
o
n
t
o
t
h
e
b
o
y
?

Thinking
Thought for 23 seconds

T
h
e
s
u
r
g
e
o
n
i
s
t
h
e
b
o
y
'
s
g
r
a
n
d
f
a
t
h
e
r
.

H
e
r
e
'
s
t
h
e
r
e
a
s
o
n
i
n
g
:

T
h
e
s
u
r
g
e
o
n
i
s
d
e
s
c
r
i
b
e
d
a
s
t
h
e
b
o
y
'
s
f
a
t
h
e
r
.

W
h
e
n
t
h
e
s
u
r
g
e
o
n
s
a
y
s
,
"
I
c
a
n
'
t
o
p
e
r
a
t
e
o
n
t
h
i
s
b
o
y
,
h
e
'
s
m
y
s
o
n
!
"
,
i
t
i
m
p
l
i
e
s
t
h
a
t
t
h
e
b
o
y
i
s
h
i
s
s
o
n
.
T
h
e
r
e
f
o
r
e
,
t
h
e
b
o
y
'
s
f
a
t
h
e
r
(
t
h
e
s
u
r
g
e
o
n
)
r
e
f
e
r
s
t
o
t
h
e
b
o
y
a
s
"
m
y
s
o
n
.
"

T
h
i
s
m
e
a
n
s
t
h
a
t
t
h
e
s
u
r
g
e
o
n
i
s
b
o
t
h
t
h
e
b
o
y
'
s
f
a
t
h
e
r
a
n
d
r
e
f
e
r
s
t
o
t
h
e
b
o
y
'
s
f
a
t
h
e
r
a
s
"
m
y
s
o
n
,
"
i
n
d
i
c
a
t
i
n
g
t
h
a
t
t
h
e
s
u
r
g
e
o
n
i
s
a
c
t
u
a
l
l
y
t
h
e
b
o
y
'
s
g
r
a
n
d
f
a
t
h
e
r
.

Answer: His grandfather—the surgeon is the boy’s grandfather.
Reasoning
Reasoned
Data Analysis
Reasoning
Reasoned
Data Analysis

Benchmark-beating reasoning

A data-driven analysis of Reasoner’s engine’s performance using Google’s FRAMES benchmark shows Reasoner excels at factuality, retrieval accuracy, and complex reasoning.

77.3%


ACCURACY

92.8%


accuracy achieved

847


tasks correct

913


Total Reasoning Tasks

1,647


Total Sources Used

Outperforming in every category

Reasoner's engine delivers unmatched results across all types of reasoning.
Assessment
Type
TOTAL
ASSESSMENTS
REASONER %
Raw Score
OPENAI o1 %
Raw Score
Numerical Reasoning 121 93.5 71.1
Tabular Reasoning 151 87.5 78.1
Multiple Constraints 400 93.5 79.3
POST PROCESSING 71 94.4 73.2
TEMPORAL Reasoning 170 94.7 78.2
Total 913 92.8 77.3
Reasoner
For applications that demand the highest precision and accuracy.

Knowledge
Consistency

Reasoner’s surpasses o1 on all reasoning assessments

End-to-End
Traceability

Every decision explained, every action auditable

Low Cost to
Operate

More value, less complexity

supported by

Get access to the private beta today.