Evaluating Reasoning Systems

Conrad Bock; Michael Gruninger; Donald E. Libes; Joshua Lubell; Eswaran Subrahmanian

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Evaluating Reasoning Systems

Published

May 1, 2006

Author(s)

Conrad Bock, Michael Gruninger, Donald E. Libes, Joshua Lubell, Eswaran Subrahmanian

Abstract

A review of the literature on evaluating reasoning systems reveals that it is a very broad area with wide variation in depth and breadth of research on metrics and tests. Consolidation is hampered by nonstandard terminology, differing methodologies, scattered application domains, unpublished algorithmic details, and the effects of domain content and context on the choice of metric and tests. The field of information metrology, which applies to reasoning as a kind of information processing, is still emerging from ad hoc experience in evaluating narrow kinds of information systems. This report begins to bring order to the area by categorizing reasoning systems according to their capabilities. The characteristics of each category can be used as a basis for evaluating and testing reasoning systems claiming to be in that category. Capabilities are analyzed along several dimensions, including representation languages, inference, and user and software interfaces. The report groups representation languages by their relation to first-order logic, and model-theoretic properties, such as soundness and completeness. Inference procedures are divided into deduction, induction, abduction, and analogical reasoning. Capabilities of user and software interfaces are described as they apply to reasoning systems. The report introduces information metrology, model theory, and inference to facilitate understanding of the reasoning categories presented. It concludes with recommendations for future work.

Citation

NIST Interagency/Internal Report (NISTIR) - 7310

Report Number

7310

NIST Pub Series

NIST Interagency/Internal Report (NISTIR)

Pub Type

NIST Pubs

Download Paper

https://doi.org/10.6028/NIST.IR.7310

Local Download

Keywords

reasoning categories, reasoning systems, software metrics

Citation

Bock, C. , Gruninger, M. , Libes, D. , Lubell, J. and Subrahmanian, E. (2006), Evaluating Reasoning Systems, NIST Interagency/Internal Report (NISTIR), National Institute of Standards and Technology, Gaithersburg, MD, [online], https://doi.org/10.6028/NIST.IR.7310, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=822613 (Accessed July 31, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created April 30, 2006, Updated October 12, 2021

Was this page helpful?

Evaluating Reasoning Systems

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats

Issues