Comparing Coq (pCUIC) with HoTT and CubicalTT for common verification tasks

palmskog · February 3, 2020, 4:41pm

Homotopy type theory (HoTT) and Cubical type theory (CubicalTT) have been used to formalize foundations of mathematics, and have inspired extensions to Coq such as SProp.

However, demonstrated applications of HoTT and CubicalTT in regular verification tasks, such as for program verification, are to my knowledge few (e.g., SQL rewriting, patch theory, encoding pi-calculus, and probabilistic programming), and the general extent to which univalence increases the productivity of proof engineers is unknown.

Based on a discussion with @spitters on Coq’s Gitter, I would like to raise the issue of finding relevant benchmarks and case studies for comparing verification using vanilla Coq (pCUIC), HoTT for Coq, and Cubical Agda.

Interesting hard measures to compare include:

lines of spec code
lines of proof script code
effort to complete in person hours

Softer measures such as reusability, comprehensibility, extendability could also be considered.

Here are some candidates for benchmarks:

correctness of abstract data type implementations and algorithms from Okasaki’s book Purely Functional Data Structures (several of these have already been done for HOL4 in CakeML)
definitions and (category) theory from the book Algebra of Programming by Bird and de Moor (some of this has already been done in plain Agda)
metatheory and examples for some powerful modal logic, such as the modal μ-calculus, e.g., as presented by Bradfield and Stirling

The comparison approach could be similar to a code golfing competition, i.e., the goal is to get definitions and proofs that are as elegant and short as possible.

The literature contains many comparisons of different systems for specific verification tasks that can provide inspiration, e.g., correctness of a graph algorithm by Tarjan in Coq, Why3, and Isabelle/HOL.

spitters · February 3, 2020, 5:15pm

One motivation in our guarded HoTT project and the related project by Møgelberg is to find a type theory that is more suitable for modelling projects like iris, but also coinduction. We are not there yet, but type theories such as guarded cubical type theory are conjectured to have good computational properties, as opposed to the (extensional) guarded dependent type theory which was used before.

I recently had two students implement Queues (as in Sec 5.2 of Okasaki) in cubical using HITs. This looks quite natural. I’ll see whether it can be made available.

spitters · February 4, 2020, 9:36am

Queues

palmskog · February 4, 2020, 9:42am

Nice @spitters, I may be out of my depth here, but would it be “fair” for comparison purposes (and at all possible) to capture the higher inductive type approach for the batched queue in vanilla Coq using private inductives? Or is an alternate vanilla Coq version using only “plain inductive types” the most reasonable?

spitters · February 12, 2020, 3:37pm

Yes, I think a similar approach would work in Coq HoTT encoding HITs as private inductives.

Topic		Replies	Views
[WAIT2024] Call for Contributions: Fifth International Workshop on Automated (Co)inductive Theorem Proving Announcements	0	108	May 5, 2024
Notes from the CoqPL 2024 Q/A session Miscellaneous meeting	9	558	January 29, 2024
Proof engineering survey published Announcements	3	1079	March 17, 2020
ChatGPT can help translate between Coq and Lean Miscellaneous	0	237	February 13, 2024
Iris 3.2 and std++ 1.2.1 Announcements	0	818	August 30, 2019

Comparing Coq (pCUIC) with HoTT and CubicalTT for common verification tasks

Related topics