Reseña: Using automated theorem provers to teach knowledge representation in first-order logic

Se ha publicado un artículo de aplicación del razonamiento automático a la enseñanza, usando Prover9, titulado Using automated theorem provers to teach knowledge representation in first-order logic.

Sus autores son Angelo Kyrilov y David Noelle (de la University of California, Merced).

Su resumen es

Undergraduate students of artificial intelligence often struggle with representing knowledge as logical sentences. This is a skill that seems to require extensive practice to obtain, suggesting a teaching strategy that involves the assignment of numerous exercises involving the formulation of some bit of knowledge, communicated using a natural language such as English, as a sentence in some logic. The number of such exercises needed to master this skill is far too large to allow typical artificial intelligence course teaching teams to provide prompt feedback on student efforts. Thus, an automated assessment system for such exercises is needed to ensure that students receive an adequate amount of practice, with the rapid delivery of feedback allowing students to identify errors in their understanding and correct them. This paper describes an automated grading system for knowledge representation exercises using first-order logic. A resolution theorem prover, Prover9, is used to check if a student-submitted formula is logically equivalent to a solution provided by the instructor. This system has been used by students enrolled in undergraduate artificial intelligence classes for several years. Use of this teaching tool resulted in a statistically significant improvement on first-order logic knowledge representation questions appearing on the course final examination. This article explains how this system works, provides an analysis of changes in student learning outcomes, and explores potential enhancements of this system, including the possibility of providing rich formative feedback by replacing the resolution theorem prover with a tableaux-based method.

El trabajo se ha presentado en el TTL2015 (Fourth International Conference on Tools for Teaching Logic)

El sistema y la metodología presentada en el artículo es análoga a la expuesta en el trabajo KRRT: Knowledge Representation and Reasoning Tutor System.

Reseña: Formal verification of programs computing the floating-point average

Se ha publicado un artículo de razonamiento formalizado en Coq sobre la aritmética titulado Formal verification of programs computing the floating-point average.

Sus autora es Silvie Boldo (del grupo Toccata (Formally Verified Programs, Certified Tools and Numerical Computations) en el LRI (Laboratoire de Recherche en Informatique) de la Universidad Paris-Sur).

Su resumen es

The most well-known feature of floating-point arithmetic is the limited precision, which creates round-off errors and inaccuracies. Another important issue is the limited range, which creates underflow and overflow, even if this topic is dismissed most of the time. This article shows a very simple example: the average of two floating-point numbers. As we want to take exceptional behaviors into account, we cannot use the naive formula (x+y)/2. Based on hints given by Sterbenz, we first write an accurate program and formally prove its properties. An interesting fact is that Sterbenz did not give this program, but only specified it. We prove this specification and include a new property: a precise certified error bound. We also present and formally prove a new algorithm that computes the correct rounding of the average of two floating-point numbers. It is more accurate than the previous one and is correct whatever the inputs.

El trabajo se presentará en el ICFEM 2015 (The 17th International Conference on Formal Engineering Methods).

El código de las correspondientes teorías en Coq se encuentra aquí.

Reseña: A synthetic proof of Pappus’ theorem in Tarski’s geometry

Se ha publicado un artículo de razonamiento formalizado en Coq sobre geometría titulado A synthetic proof of Pappus’ theorem in Tarski’s geometry

Sus autores son Gabriel Braun y Julien Narboux (del Équipe Informatique Géométrique et Graphique en la Universidad de Estrasburgo, Francia).

Su resumen es

In this paper, we report on the formalization of a synthetic proof of Pappus’ theorem. We provide two versions of the theorem: the first one is proved in neutral geometry (without assuming the parallel postulate), the second (usual) version is proved in Euclidean geometry. The proof that we formalize is the one presented by Hilbert in The Foundations of Geometry which has been detailed by Schwabhäuser, Szmielew and Tarski in part I of Metamathematische Methoden in der Geometrie. We highlight the steps which are still missing in this later version. The proofs are checked formally using the Coq proof assistant. Our proofs are based on Tarski’s axiom system for geometry without any continuity axiom. This theorem is an important milestone toward obtaining the arithmetization of geometry which will allow us to provide a connection between analytic and synthetic geometry.

El código de las correspondientes teorías en Coq se encuentra aquí.

Reseña: Parallel postulates and decidability of intersection of lines: a mechanized study within Tarski’s system of geometry

Se ha publicado un artículo de razonamiento formalizado en Coq sobre geometría titulado Parallel postulates and decidability of intersection of lines: a mechanized study within Tarski’s system of geometry.

Sus autores son Pierre Boutry, Julien Narboux y Pascal Schreck (del Équipe Informatique Géométrique et Graphique en la Universidad de Estrasburgo, Francia)

Su resumen es

In this paper we focus on the formalization of the proof of equivalence between different versions of Euclid’s 5 th postulate. This postulate is of historical importance because for centuries many mathematicians believed that this statement was rather a theorem which could be derived from the first four of Euclid’s postulates and history is rich of incorrect proofs of Euclid’s 5 th postulate. These proofs are incorrect because they assume more or less implicitly a statement which is equivalent to Euclid’s 5 th postulate and whose validity is taken for granted. Even though these proofs are incorrect the attempt was not pointless because the flawed proof can be turned into a proof that the unjustified statement implies the parallel postulate. In this paper we provide formal proofs verified using the Coq proof assistant that 10 different statements are equivalent to Euclid’s 5 th postulate. We work in the context of Tarski’s neutral geometry without continuity nor Archimedes’ axiom. The formalization provide a clarification of the hypotheses used for the proofs. Following Beeson, we study the impact of the choice of a particular version of the parallel postulate on the decidability issues.

El código de las correspondientes teorías en Coq se encuentra aquí.

Reseña: A formalisation of metric spaces in HOL Light

Se ha publicado un artículo de razonamiento formalizado en HOL Light sobre topología titulado A formalisation of metric spaces in HOL Light.

Su autor es Marco Maggesi (de la Univ. de Florencia en Italia).

Su resumen es

We present a computer formalisation of metric spaces in the HOL Light theorem prover. Basic results of the theory of complete metric spaces are proved. A simple decision procedure for the theory of metric space is implemented.

El trabajo se ha presentado en CICM 2015 (the 8th Conference on Intelligent Computer Mathematics).