1 Introduction

In this paper we develop a formal framework that unifies Darwiche and Pearl’s iterated belief revision (Darwiche and Pearl 1997) and refined verisimilitude as defined by Zwart (2001). Both belief revision and verisimilitude profit from the established unification. On the one hand, the unification provides for an answer to the epistemic problem of refined verisimilitude; on the other hand, it shows that belief revision behaves properly under the addition of truth, in the following sense. Revising false information of knowledge bases with true information leads to more verisimilar knowledge basis. It even turns out that revising with better information leads to more verisimilar theories.

To our knowledge until today no attempt to combine Belief Revision and Verisimilitude has been successful in the way demanded by Laudan. Laudan (1981, p. 32) rightly observes:

None of the proponents of realism has yet articulated a coherent account of approximate truth which entails that approximately true theories will, across the range where we can test them, be successful predictors.

In this paper we show that a formal framework in terms of preference and similarity relations on possible worlds helps to establish a coherent answer to Laudan’s challenge. According to our framework, more verisimilar theories provide for more successful, i.e. more verisimilar, predictions and explanations. This ‘downward path’ from more general theories towards concrete predictions had been established already by the definition of refined verisimilitude given in Zwart (2001). Moreover, in the present framework, revising with even possibly false, but more verisimilar observations ends up with more verisimilar theories, provided that the preference relation regarding other possible empirical evidence is similar to the real verisimilitude order of this evidence. This version of the ‘upward path’ going from the concrete towards the more general theory, does not provide a certain method to come closer to the truth, as we are not familiar with the verisimilitude order of possible empirical evidence. It does, however, formulate and satisfy a welcome condition for a fruitful combination of verisimilitude and belief revision.

The formal framework used to establish our results is built on finite Boolean algebras (i.e. Lindenbaum algebras of propositional logic), the atoms of which correspond to models (possible worlds). We consider linear preorders on the collection of these worlds. They play the role of preference relations in the definition of (iterated) belief revision in the AGM style (Sect. 5). In the definition of (refined) verisimilitude in Sect. 6, we shall use these linear preorders as similarity functions.

The rest of the paper is structured as follows. In the next section we will first introduce the main ideas behind research into belief revision and verisimilitude. Moreover, we describe an example of Sven Ove Hansson which nicely illustrates the intentions of our exercise. We will return to this example in Sect. 7 to illustrate the mechanisms of our framework. Section 3 is mainly dedicated to the introduction of propositional logic in the form of Boolean algebra, the basic framework of our work. In Sect. 4 we introduce the main formal apparatus of the paper: preferences and preference orders (a dual of epistemic entrenchment). In the subsequent Sects. 5 and 6 we show how (iterated) belief revision and (refined) verisimilitude are defined in our framework. Especially, the new definitions of refined verisimilitude as previously defined in Zwart (2001) turn out to be useful. Finally in the penultimate Sect. 7, we show how neatly existing belief revision and refined verisimilitude fit together and successfully fulfill Laudan’s challenge. We end in Sect. 8 with the conclusions and describe prospects of future research.

2 Informal Exposition of Verisimilitude and Belief Revision

Investigations into verisimilitude started within the school of scientific realism in philosophy of science. In 1963, Popper proposed a formal definition of the idea that a given (possibly false) theory can be more similar to “the true theory” than another, competing (possibly false) theory. The research project of verisimilitude really got off the ground only when it was discovered that Popper’s definition in fact failed to compare any two nonequivalent false theories—which had precisely been the main aim of the definition.

The main subject of the verisimilitude project is formulated in relation to a formal language that is assumed to include a (usually) complete empirical truth τ. If τ is complete, any synthetic sentence is either a consequence of τ, or implies \(\lnot \tau. \) Within the context of classical, non-modal logic, empirical incompleteness of τ implies that the underlying language comprises non-referring propositional variables. Given a language, verisimilitude investigations concern two questions. The first question reads: How are we to define the similarity between an arbitrary theory in the language and its true theory τ? Answering this question about the definition of verisimilitude, we may assume to be familiar with τ. The second question about verisimilitude reads: when confronted with two different (scientific) theories, how are we to find out which of the two theories is more verisimilar or closer to the truth τ? This epistemic question of verisimilitude is the more practical one. Obviously, when we formulate the answer to the epistemic question, we may not assume that we know the true theory τ. When we have to decide whether the mechanics of Descartes rather than that of Newton is more verisimilar we do not know the full truth, although, perhaps, we know some elements in it, e.g. some very reliable observations made by different researchers. Until today, no generally accepted solution to the epistemic problem of verisimilitude exists. In the present paper we will show that belief revision provides an appropriate answer to the epistemic question of the refined verisimilitude.

The study of belief revision is rooted in the holistic or coherentist tradition of Duhem (1954) and Quine and Ullian (1978). Its main objective has been to answer the question how to revise an existing body of knowledge K when confronted with new evidence \(\varphi\) that may contradict K. In Alchourrón et al. (1985), formulated postulates that turned out to be an important part of the answer to the revision question. The AGM-postulates put constraints on how to revise K into a new knowledge set \(K * \varphi,\) but do not uniquely determine one revision operator. Alchourrón et al. (1985, p. 88) claim that their postulates

… exhaust what can be said about revisions and contraction in logical and set-theoretical terms only. This means that we must seek further information about the epistemic status of the elements of a knowledge state to solve the uniqueness problem.

The problem with the AGM postulates is that they only provide one-step revisions. After a revision of K by \(\varphi\) has taken place, the AGM-postulates fail to give indications how one should come to a new entrenchment relation \(\leqslant_{K\ast\varphi}.\) They fail therefore to indicate how to execute a next revision of \(K\ast\varphi\) by ψ. This drawback of the AGM approach was readily remarked and in 1954, Darwiche and Pearl added four postulates for iterated revisions.

For our purposes iterated revisions are unavoidable. The gist of combination of belief revision and verisimilitude is the answer to the question how theories behave under revisions on the long run, and then one-step revisions are insufficient. What we want is to show that iterated truthful revisions of theories far from the truth eventually end up with theories that are much closer to the truth.

For the purposes mentioned, we prefer to use the formulation of belief revision in terms of preferences as presented by Grove (1988). This approach is dual to entrenchment: \(\varphi\) is preferred to \(\psi (\varphi < \psi)\) iff \(\neg\psi\) is better entrenched than \(\neg\varphi (\neg\varphi <^{\mathsf{e}} \neg\psi)\).

Let us consider a simple example, due to Sven Ove Hansson (private conversation), to illustrate how the result of belief revision depends on the preference.

Example 1

(Hansson) Let the present body of knowledge κ be \(P_0 \wedge \neg P_1.\) How are we to revise κ when confronted with new evidence \(\varphi = P_0\leftrightarrow P_1\)? Here the idea of preference (or its dual, epistemic entrenchment) comes in. If (i) \(\neg P_0 \wedge \neg P_1\) is preferred to P 0P 1, i.e. \(\neg P_0 \wedge \neg P_1 < P_0 \wedge P_1, \) then \(\kappa * \varphi = \neg P_0 \wedge \neg P_1; \) but if (ii) \(P_0 \wedge P_1 < \neg P_0 \wedge \neg P_1\) then \(\kappa * \varphi = P_0 \wedge P_1. \) And if (iii) P 0 ∧ P 1 and \(\neg P_0 \wedge \neg P_1\) are equally preferred, then \(\kappa * \varphi = P_0 \leftrightarrow P_1. \) Now if the truth is supposed to be P 0P 1, then revision (ii) is better than (i), since it leads us closer to the truth.

The example concerns only a one-step revision, but is readily expanded to iterated revisions. To handle iterated revisions, one should indicate how existing preference relations should be updated after a revision. It turned out that the most practical way to formulate the combination of iterated revisions and verisimilitude was in terms of preference relations. The reason of cause being that iterated revisions and verisimilitude all come down to the comparison of orders and the revisions of orders into new ones.

3 Formal Preliminaries

As our logical basis, we take classical (two-valued) propositional logic over a finite collection of propositional variables \(\mathsf{PVAR} = \{P_0,\ldots,P_{n-1}\}\). On the collection \(\mathsf{FORM}\) of formulae, we have the usual entailment relation \(\vdash\) and the logical equivalence relation ≡. In the sequel, language L refers to the triple \( \langle \mathsf{FORM}, \vdash, \equiv\rangle.\)

As is well known, \(\mathsf{FORM}\) is (modulo logical equivalence) isomorphic to the Boolean algebra \({\mathsf{BA}}(n)\) over n generators. Recall that \({\mathsf{BA}}(n)\) is finite but large, having \(2^{2^{n}}\) elements. Since \(\mathsf{FORM}\) is finite (modulo logical equivalence), every knowledge base or theory, i.e. every collection of formulae, is equivalent with a single formula: the conjunction of its (finitely many non-equivalent) elements. We will use this property throughout this paper, representing theories cq. knowledge bases by single formulae.

The collection \({\mathsf{ATOM}}\) of atoms of \({\mathsf{BA}}(n)\) contains the 2n formulae of the form \((\neg)P_0 \wedge \ldots \wedge (\neg)P_{n-1}:\) they are the minimal elements of \({\mathsf{BA}}(n) - \{\bot\}\). (NB: observe that, in general, an atom is not an atomic formula!) Atoms are interesting formulae, as they can be seen as models for propositional logic: \(\varphi\) holds in the model corresponding to atom α iff \(\alpha \vdash \varphi. \) As is well known, every formula in \({\mathsf{BA}}(n)\) can be written uniquely as a disjunction of atoms: \(\varphi = \bigvee{\mathsf{atom}}(\varphi), \) where \({\mathsf{atom}}(\varphi) = \{\alpha \in {\mathsf{ATOM}} \mid \varphi \vdash \alpha\}.\) Some properties of atoms are:

$$ \begin{aligned} &\varphi \vdash \psi \,\hbox{iff}\,{\mathsf{atom}}(\varphi) \subseteq {\mathsf{atom}}(\psi)\\ &{\mathsf{atom}}(\top) = {\mathsf{ATOM}}\\ &{\mathsf{atom}}(\bot)= \emptyset\\ &{\mathsf{atom}}(\neg\varphi) = {\mathsf{ATOM}} - {\mathsf{atom}}(\varphi)\\ &{\mathsf{atom}}(\varphi \wedge \psi) ={\mathsf{atom}}(\varphi) \cap {\mathsf{atom}}(\psi)\\ &{\mathsf{atom}}(\varphi \vee \psi) = {\mathsf{atom}}(\varphi) \cup{\mathsf{atom}}(\psi) \end{aligned} $$

4 Preferences

Some situations are preferred to others: this simple consideration is the basis for preferences. We identify situations with models, hence with atoms, so preference can be modeled as an order relation \(\leqslant \) on atoms.Footnote 1

The preference orders considered in this paper are total preorders on \({\mathsf{ATOM}}, \) i.e. they are reflexive, transitive and connected:

$$ \begin{aligned} &\forall \alpha (\alpha\,\leqslant \,\alpha) \\ &\forall \alpha \beta \gamma (\alpha\, \leqslant\, \beta \;\&\; \beta \,\leqslant \,\gamma \Rightarrow \alpha\, \leqslant\, \gamma) \\ &\forall \alpha \beta (\alpha\, \leqslant \,\beta \,\hbox{or}\, \alpha = \beta \,\hbox{or}\, \beta \,\leqslant\, \alpha ) \end{aligned} $$

Given a preference order \(\leqslant \), the strict preference order < is defined by α < β iff \(\beta \,\not\leqslant\, \alpha, \) and the preference equivalence ≅ by α ≅ β iff \(\alpha\, \leqslant\, \beta \;\&\; \beta \,\leqslant\, \alpha. \)

Preference orders on a finite domain can be represented by preference functions \({p : {\mathsf{ATOM}} \rightarrow \mathbb{N}. }\) The idea is that p(α) indicates the degree of preference of α: α < p β, i.e. p prefers α to β, whenever p(α) < p(β). Conversely, the preference function \(p\leqslant \) of a preference order \(\leqslant \) can be defined by

$$ p_{\leqslant}\,(\alpha) = \max \{ k \mid k=0 \hbox{ or }\exists \beta_1, \ldots, \beta_k \; (\beta_1 < \cdots < \beta_k < \alpha) \} $$

The collection of preference functions is called \(\mathsf{PREF}.\) Different preference functions may lead to the same preference order, e.g. p and λα.p(α) + 1. There is, however, a one-to-one correspondence between preference orders and normal preference functions \(p \in \mathsf{NPREF}\) that have an initial segment [0, n] as their range. Every preference function can be normalized by the mapping \(\mathsf{norm}, \) defined by

$$ {\mathsf{norm}}(p)(\alpha) = \# \{ k \mid \exists \beta \in {\mathsf{ATOM}} \; p(\beta) = k < p(\alpha) \} $$

where #X denotes the number of elements of X. So a preference order and its corresponding normal preference function are two sides of the same coin, and we shall use these two representations interchangeably in this paper.

A preference function p on atoms can be extended to a mapping \({ p : \mathsf{FORM} \rightarrow \mathbb{N}}\) on all formulae by

$$ \begin{aligned} p(\varphi) &= \min\{p(\alpha) \mid \alpha \in {\mathsf{atom}}(\varphi) \} \quad \hbox{if}\, \varphi \not\equiv \bot \\ p(\bot) & = \max(p) + 1 \end{aligned} $$

where \(\max(p) = \max \{ p(\alpha) \mid \alpha \in {\mathsf{ATOM}} \}\) (and similarly for min(p), to be used later). A preference order \(\leqslant \) on atoms is extended to formulae by

$$ \varphi\, \leqslant\, \psi \,\hbox{iff } \forall \alpha \in {\mathsf{atom}}(\psi) \;\exists \beta \in {\mathsf{atom}}(\varphi) \; \beta\, \leqslant \,\alpha $$

alternatively, it can be obtained from \({p :\mathsf{FORM} \rightarrow \mathbb{N}}\) by \(\varphi \,\leqslant_p\, \psi \Leftrightarrow p(\varphi) \,\leqslant\, p(\psi). \) A preference order on \(\mathsf{FORM}\) has the following properties:

  • \(\leqslant \) is a total preorder;

  • \(\leqslant \) subsumes \(\dashv\) (the inverse of \(\vdash\)): \(\varphi \vdash \psi\) implies \(\psi \,\leqslant\, \varphi; \)

  • \(\leqslant \) is disjunctive: \(\varphi \,\leqslant\, \varphi \vee \psi\) or \(\psi \,\leqslant\, \varphi \vee \psi; \)

  • \(\leqslant \)-maximal formulae are inconsistent: if \(\psi \,\leqslant\, \varphi\) for all ψ, then \(\varphi \vdash \bot. \)

The disjunctivity property of \(\leqslant \) is a consequence of the use of the minimum operator in the definition of \(p(\varphi). \) It is not hard to see that every relation on formulae that satisfies these properties is fully determined by its behaviour on atoms.

A formula \(\varphi\) can also be seen as a preference: atoms in \({\mathsf{atom}}(\varphi)\) are preferred to atoms not in \({\mathsf{atom}}(\varphi). \) Going in the other direction, we can reduce a preference to a formula consisting of the most preferred atoms. So we define \(\mathsf{form} : \mathsf{PREF} \rightarrow \mathsf{FORM}\) and \(\mathsf{pref} : \mathsf{FORM} \rightarrow \mathsf{PREF}\) by

$$ \begin{aligned} {\mathsf{form}}(p) & = \bigvee \{ \alpha \mid p(\alpha) = \min(p) \} \\ {\mathsf{pref}}(\varphi) & = \lambda \alpha. (\hbox{if }\alpha \vdash \varphi \hbox{ then } 0 \hbox{ else } 1) \end{aligned} $$

So \(\mathsf{pref}(\varphi)(\alpha) = 0\) if \(\alpha \vdash \varphi, \) and \(\mathsf{pref}(\varphi)(\alpha) = 1\) if \(\alpha \not\vdash \varphi. \) We have that \(\mathsf{form}\) and \(\mathsf{pref}\) are antimonotonic in the following sense:

$$ \begin{aligned} p \,\leqslant\, q &\Rightarrow {\mathsf{form}}(q) \vdash {\mathsf{form}}(p) \quad (p,q \in {\mathsf{NPREF}}) \\ \varphi \vdash \psi & \Rightarrow {\mathsf{pref}}(\psi) \,\leqslant\, {\mathsf{pref}}(\varphi) \end{aligned} $$

Now that we know what preferences are, the question remains how we may obtain them. One way to obtain a preference is via a distance function \({d : {\mathsf{ATOM}}^2 \rightarrow \mathbb{N}}\) satisfying

$$ \begin{array}{ll} d(\alpha,\beta) = 0 \hbox{ iff } \alpha = \beta &\hbox{(identity)} \\ d(\alpha,\beta) = d(\beta,\alpha) & \hbox{(symmetry)}\\ d(\alpha,\beta) + d(\beta,\gamma) \,\leqslant\, d(\alpha,\gamma) & \hbox{(triangle inequality)} \end{array} $$

Such a distance function can be generalized to all formulae by

$$ d(\varphi,\psi) = \min\{d(\alpha,\beta) \mid \alpha \in {\mathsf{atom}}(\varphi), \quad\beta \in {\mathsf{atom}}(\psi) \} $$

observe that this generalization \({d : \mathsf{FORM}^2 \rightarrow \mathbb{N}}\) is only a weak distance function, satisfying symmetry and the triangle inequality but not the identity property.

Given a weak distance function d defined on formulae and a formula \(\varphi, \) we can define a preference function \(\mathsf{pref}(d, \varphi): \)

$$ {\mathsf{pref}}(d,\varphi)(\alpha) = d(\alpha,\varphi) $$

so the degree of preference of α corresponds with the distance from \(\varphi. \) We have

$$ \varphi \vdash \psi \;\Rightarrow \; {\mathsf{pref}}(d,\varphi) \geqslant\, {\mathsf{pref}}(d,\psi) $$

Observe that the unary function \(\mathsf{pref}\) defined earlier is a special case of the binary \(\mathsf{pref}, \) based on the trivial distance function d defined by d(α,α) = 0 and d(α,β) = 1 if α ≠ β.

A particular example of a distance function on atoms is the Hamming distance d H , which counts the number of propositional variables on which the two atoms differ:

$$ d_H(\alpha,\beta) = \# \{ P \in {\mathsf{PVAR}} \mid \alpha \vdash P \;{\mathsf{xor}}\; \beta \vdash P \} $$

We shall use this distance function when discussing verisimilitude.

5 Belief Revision

Traditionally (see Alchourrón et al. (1985)), belief revision is defined in terms of an epistemic entrenchment order \(\leqslant^{\mathsf{e}}. \) The idea is that the degree of epistemic entrenchment determines which of two beliefs \(\varphi\) and ψ has to be given up when their combination has become untenable: in a situation where \(\varphi \wedge\psi\) is inconsistent and \(\varphi <^{\mathsf{e}} \psi\) (i.e. ψ is more entrenched that \(\varphi\)), it is preferred to give up \(\varphi\) and retain ψ. The knowledge set K associated with \(\leqslant^{\mathsf{e}}\) is defined as the collection of the non-\(\leqslant^{\mathsf{e}}\)-minimal formulae.

In the case of a finite logic, \(\leqslant^{\mathsf{e}}\) can be defined straightforwardly in terms of the dual atoms (i.e. negations of atoms). This approach has been dualized by Grove (1988), replacing dual atoms by atoms, and \(\leqslant^{\mathsf{e}}\) by its dual: a preference order \(\leqslant \) as described in Sect. 4 that satisfies \(\varphi \,\leqslant^{\mathsf{e}} \psi\) iff \(\neg\varphi \, \leqslant\, \neg\psi. \) Since the use of atoms is (in our eyes) more intuitive than dual atoms, we adopt Grove’s representation of belief revision based on preference order and preference functions. Since our logic is finite, the knowledge set K can be replaced by its conjunction \(\kappa = \bigwedge K. \) When \(\leqslant \) equals \(\leqslant \) p , the preference order generated by preference function p, then the conjunction κ = κ p of the knowledge set associated with \(\leqslant \) p equals \(\mathsf{form}(p), \) the disjunction of the most preferred atoms (i.e. with minimal p-value).

The theory of belief revision usually proceeds by defining belief revision as an operation on formulae: \(\kappa * \varphi\) is the revision of κ with evidence \(\varphi. \) When working with preferences, the revision of \(\kappa = \kappa_p = \mathsf{form}(p)\) with \(\varphi\) can be defined as the disjunction of all atoms in \({\mathsf{atom}}(\varphi)\) that are most preferred by p:

$$ \kappa_p * \varphi = \bigvee \{ \alpha \mid \alpha \vdash \varphi \;\&\; p(\alpha) = p(\varphi) \} $$
(1)

(recall that \(p(\varphi)\) is defined as the minimum of the preference of the atoms in \({\mathsf{atom}}(\varphi)\)). When κ p and \(\varphi\) are consistent, \({\mathsf{atom}}(\kappa_p)\) and \({\mathsf{atom}}(\varphi)\) have one or more atoms α in common: they satisfy p(α) = 0 and \(\alpha \vdash \varphi, \) so \(p(\varphi) = \min \{p(\alpha) \mid \alpha \vdash \varphi \} = 0. \) In this case we have a conservative revision \(\kappa_p * \varphi = \kappa_p \wedge \varphi. \) However, when \(\kappa_p \wedge \varphi \vdash \bot, \) we have \(\kappa_p * \varphi = \bigvee \{ \alpha \mid \alpha \vdash \varphi \;\&\; p(\alpha) = k \}\) for some k > 0, which we call a Popperian revision.

Observe that, in (1), the revision operator * is in fact an operation on a preference p and a formula \(\varphi, \) although the notation \(\kappa_p * \varphi\) suggest that it is a binary operation on formulae. As a consequence, belief revision in this form cannot be iterated: for the proper definition of \((\kappa_p * \varphi) * \psi, \) the preference function associated with \(\kappa_p * \varphi\) is required.

To deal with this, we shall generalize (1) in two steps. The first is to lift revision of formula κ with formula \(\varphi\) as defined in (1) to a revision of preference p with \(\varphi. \) This leads to the preference \(p * \varphi, \) which should satisfy \(\kappa_{p * \varphi} = \kappa_p * \varphi. \) Now iterated belief revision \((p * \varphi) * \psi\) is possible. This idea is pursued by Darwiche and Pearl (1997), where the following postulates on iterated revision are put forward:

$$ \begin{aligned} \varphi \vdash \psi \;&\Rightarrow\; (p * \psi) * \varphi = p * \varphi \\ \varphi \vdash \neg\psi \;&\Rightarrow\; (p * \psi) * \varphi = p * \varphi \\ K_{p * \varphi} \vdash \psi \;&\Rightarrow\; K_{(p * \psi) * \varphi} \vdash \psi \\ K_{p * \varphi} \not\vdash \neg\psi \;&\Rightarrow\; K_{(p * \psi) * \varphi} \not\vdash \neg\psi \end{aligned} $$

We go one step further, and lift revision to a revision operator \(* : \mathsf{PREF}^2 \rightarrow \mathsf{PREF}\) on preference functions. A related approach is described by Nayak (1994), where the revision of an entrenchment order with another entrenchment order is defined. We consider some candidates for the definition of preference revision:

$$ \begin{aligned} (p *_1 q)(\alpha) &= p(\alpha) + (\max(p) + 1)\cdot q(\alpha) \\ \\ (p *_2 q) (\alpha) & = 0 \quad \hbox{ if } q(\alpha) = 0 \;\&\; p(\alpha)=p({\mathsf{form}}(q)) \\ & = p(\alpha) + 1 \quad \hbox{ elsewhere} \\ \\ (p *_3 q)(\alpha) & = 0 \quad \hbox{ if } q(\alpha) = 0 \;\&\; p(\alpha)=p({\mathsf{form}}(q)) \\ & = q(\alpha) \quad \hbox{ if } q(\alpha) > 0 \;\&\; p(\alpha) = 0 \\ & = p(\alpha) \quad \hbox{elsewhere} \\ \\ (p *_4 q)(\alpha)& = p(\alpha) -p({\mathsf{form}}(q)) \quad \hbox{ if } q(\alpha) = 0 \\ & = p(\alpha) + q(\alpha) \quad \hbox{ elsewhere} \end{aligned} $$

For the first two candidates, there is an alternative definition possible in terms of preference orders:

$$ \begin{array}{ll} \alpha \,\leqslant_{p *_1 q} \beta \quad &\hbox{iff}\quad\alpha <_q \beta \hbox{ or } (\alpha \cong_q \beta \;\& \; \alpha\, \leqslant_p\, \beta) \\ \\ \alpha \,\leqslant_{p *_2 q}\, \beta & \hbox{iff}\quad\alpha \,\leqslant_p \,\beta \hbox{ or } (q(\alpha)=0 \;\&\; p(\alpha)=p({\mathsf{form}}(q))) \end{array} $$

Observe that \(\leqslant_{p *_1q}\) is the lexicographical combination of \(\leqslant_{p} \) and \(\leqslant_{q} \); p *2 q can be paraphrased as ‘follow p as much as possible’.

Traditional belief revision as defined in (1) can be defined in terms of preference revision by \(\kappa_p * \varphi = \mathsf{form}(p) * \varphi = \mathsf{form}(p *_i \mathsf{pref}(\varphi))\,(i = 1,2,3,4)\), where the result does not depend on i: we have in all cases \(\kappa_p * \varphi = \bigvee \{ \alpha \mid \alpha \vdash \varphi \;\&\; p(\alpha)=p(\varphi) \}.\)

Preference revision with a formula is obtained by \(p *_i \varphi = p *_i \mathsf{pref}(\varphi) (i \in \{1,2,3,4\})\). This leads to

$$ \begin{array}{lll} (p *_1 \varphi)(\alpha) &= p(\alpha) & \hbox{if } \alpha \vdash \varphi \\ &= p(\alpha) + \max(p) + 1 & \hbox{if } \alpha \not\vdash \varphi\end{array} $$
$$\begin{array}{lll} (p *_2 \varphi)(\alpha) &= 0 & \hbox{if } \alpha \vdash \varphi \hbox{ and } p(\alpha) = p(\varphi) \\ &= p(\alpha) + 1 & \hbox{if } \alpha \not\vdash \varphi \hbox{ or } p(\alpha) \neq p(\varphi) \end{array} $$

and similar for *3 and *4. In all cases, the Darwiche-Pearl postulates hold.

6 Verisimilitude

The starting point for verisimilitude is the strongest empirically true theory τ expressible in L . Normally, τ is assumed to be complete, in the sense that any contingent sentence is either a consequence of τ, when it is empirically true, or implies \( \lnot \tau\) when it is false. To stay as general as possible, however, our formal framework does allow for the degenerated case in which the truth is incomplete and \(\tau\not\in {\mathsf{ATOM}}.\) Note that in such cases, language L is inadequate in the sense that it comprises propositional variables which nature does not verify nor falsify. After fixing L and its strongest empirical truth τ, we set out to obtain a relation \(\leqslant^{\mathsf{v}}\) (and its strict version \(<^{\mathsf{v}}\)) of verisimilitude, where \(\varphi <^{\mathsf{v}} \psi\) expresses that \(\varphi\) is more verisimilar (‘closer to the truth’) than ψ.

As for similarity between atoms, we shall obtain a verisimilitude order on \(\mathsf{FORM}\) from a total preorder on atoms. So let some similarity function \({ t : {\mathsf{ATOM}} \rightarrow \mathbb{N}}\) be given. We assume that the t-minimal atoms are precisely the atoms of τ, i.e. \({\mathsf{atom}}(\tau) = \{ \alpha \mid t(\alpha) = 0 \}\). A natural example of a similarity based on τ is t = λ α. d H (α, τ) where d H is the Hamming distance between atoms, introduced at the end of Sect. 4. So t(α) = 0 if \(\alpha \in {\mathsf{atom}}(\tau); \) in general, t(α) is the minimal number of changes in α (i.e. adding or removing a negation sign) required to obtain an atom in τ.

Given t and \(\tau = \bigvee \{ \alpha \mid t(\alpha) = 0 \}\), we formulate some requirements on the verisimilitude relation \((\leqslant^{\mathsf{v}}) = (\leqslant^{\mathsf{v}}_t)\) associated with t:

$$ \leqslant^{{\mathsf{v}}} \hbox{ is a preorder, i.e. reflexive and transitive} $$
(2)
$$ \varphi \vdash \psi \vee {\tau}\Rightarrow\; \varphi\, \leqslant^{\mathsf{v}}\, \psi \hbox{ (content condition)} \\ $$
(3)
$$ t(\alpha) \,\leqslant\, t(\beta) \;\Rightarrow\; \alpha\, \leqslant^{\mathsf{v}}\, \beta \hbox{ for atoms } \alpha,\beta \hbox{ (likeness condition)} \\ $$
(4)
$$ \leqslant^{\mathsf{v}} \hbox{ commutes with } \vdash \hbox{ and with} \dashv\hbox{, i.e.} \quad \quad (\leqslant^{\mathsf{v}} \cdot \vdash) = (\vdash \cdot \leqslant^{\mathsf{v}}) \hbox{ and } (\leqslant^{\mathsf{v}} \cdot \dashv) = (\dashv\cdot \leqslant^{\mathsf{v}}) \\ $$
(5)
$$ \varphi \,\leqslant^{\mathsf{v}}\, \psi \;\Leftrightarrow\; \neg\psi \,\leqslant^{\mathsf{v}}\, \neg\varphi \hbox{ (contraposition)} $$
(6)

Here \(\dashv\) is the inverse of \(\vdash,\) and \(\cdot\) denotes composition of relations: \(x R\cdot S y\) iff there is a z with xRz and zSy.

Requirement (2) is a rather natural requirement for a ordering relation. The content condition (3) and the likeness condition (4) have been discussed extensively in Zwart (2001), Zwart and Franssen (2007). Niiniluoto (1987, p. 233) introduces the truth content criterion M7, which comes down to the content condition (3) restricted to false sentences ψ. In addition, Niiniluoto introduces the similarity condition M6, being very similar to (4): if the similarity between atom α and the truth is larger than that of β and the truth, the former is more verisimilar. This natural condition lies at the heart of likeness approaches, such as Niiniluoto’s minsum measure in Niiniluoto (1987).

The commutation requirement (5) is related to Laudan’s challenge, as quoted in the Introduction. To see this, we formulate the following consequence of (5), where T and T′ are theories (denoted as formulae):

if \(T \leqslant^{\mathsf{v}} T^{\prime}, \) then (i) for every consequence \(\varphi^{\prime}\) of T′ there is a consequence \(\varphi\) of T with \(\varphi \,\leqslant^{\mathsf{v}}\, \varphi^{\prime}, \) and (ii) for every consequence \(\varphi\) of T there is a consequence \(\varphi^{\prime}\) of T′ with \(\varphi \,\leqslant^{\mathsf{v}}\, \varphi^{\prime}. \)

In slogan: the better theory has the better consequences. As (5) is a necessary condition for a successful acceptance of Laudan’s challenge, it is a desirable property for verisimilitude. In our framework of a Boolean algebra, empirical predictions may be represented by the weakest nontrivial consequences of a theory, viz. its dual atoms \(\delta = (\neg)P_0 \vee \dots (\neg)P_{n-1}. \) This enables us to show that the better/worse theory simply has the better/worse consequences, even in the sense that the better of false theories has the better dual atoms, i.e. empirical consequences. Something similar is out of reach of other content definitions of verisimilitude such as those of Miller’s (1978) and Kuipers’ symmetric difference definition (Kuipers 2000, p. 151): in their definitions, all false atoms (i.e. atoms not in \({\mathsf{atom}}(\tau)\)) are incomparable regarding their similarity with the truth.

Finally, let us consider contraposition (6). In Zwart and Franssen (2007), it turned out that contraposition defines an important watershed in the verisimilitude literature. It forces \(\lnot {\tau}\) to be the worst sentence if τ is the truth, such that τ and \( \lnot \tau \) become the lower and upper limit of all sentences in the language. By doing so, contraposition demarcates the differences between likeness and content definitions of verisimilitude, since according to likeness definitions this upper limit is τ*, which is the atom of the language with maximal distance from τ. Consequently, contraposition (6) is a desirable property for content definitions of verisimilitude. Here, it turns out to be one of the reasons for refined verisimilitude to fit closely to the framework of belief revision.

In a first attempt to satisfy these properties, we define two order relations. The first is an attempt to realize the content condition (3), the second aims to satisfy the likeness condition (4).

Definition 1

(content and likeness order) The content order \(\sqsubseteq\) is defined by:

$$ \varphi \sqsubseteq \psi \;\Leftrightarrow\; \varphi \vdash \psi \vee {\tau} $$

and the likeness order \(\preccurlyeq\) by:

$$ \varphi \preccurlyeq \psi \;\Leftrightarrow\; \exists t{\hbox{-}}{\hbox{increasing bijection }} f : {\mathsf{atom}}(\varphi \wedge \neg\psi) \rightarrow {\mathsf{atom}}(\psi \wedge \neg\varphi) $$
(7)

Here t-increasing means: if \(\alpha \in {\mathsf{dom}}(f)\) then \(t(\alpha)\,\leqslant\, t(f(\alpha))\). We call the f in (7) a witness for \(\varphi \preccurlyeq \psi. \)

Remark

The original definition of the likeness order in Zwart (2001) reads

$$ \varphi \preccurlyeq \psi \;\Leftrightarrow\; \exists t\hbox{ -increasing bijection } g : {\mathsf{atom}}(\varphi) \rightarrow {\mathsf{atom}}(\psi) $$
(8)

where t is defined by the Hamming distance from the formula τ, i.e. t(α) = d H (τ, α). Apart from the specific choice of t, both definitions of the likeness order are equivalent. For if f satisfies (7) then g defined by

$$ \begin{aligned}g(\alpha) &= f(\alpha) \quad \hbox{ if } \alpha \in {\mathsf{atom}}(\varphi\wedge \neg\psi) \\ &= \alpha \quad\quad\quad \hbox{if } \alpha\in {\mathsf{atom}}(\varphi \wedge \psi) \end{aligned} $$

satisfies (8); and if g satisfies (8) then f, defined by

$$\begin{array}{ll}f(\alpha)= g^n(\alpha) &\quad\hbox{ if } \alpha \in{\mathsf{atom}}(\varphi \wedge \neg \psi), \\&\quad\hbox{where } n = 1 + \max \{ k \mid g^k(\alpha)\in {\mathsf{atom}}(\varphi) \} \end{array} $$

satisfies (7). Here g n denotes n-times iterated application of g: g 1(α) = g(α), g 2(α) = g(g(α)), etc. In the sequel, we shall use (7) or (8) as the definition of \(\preccurlyeq, \) whatever suits best.

Lemma 1

(properties of \(\sqsubseteq\) and \(\preccurlyeq\)) \(\sqsubseteq\) and \(\preccurlyeq\) are preorders, satisfy contraposition, and commute with \(\vdash\) and \(\dashv.\) Moreover, \(\sqsubseteq\) satisfies the content condition and \(\preccurlyeq\) the likeness condition, but not the other way round.

Proof

See the Appendix. \(\square\)

So neither \(\sqsubseteq\) nor \(\preccurlyeq\) satisfies all requirements (2–6). Let us try to combine both candidates. The next lemma says that the order of composition is irrelevant.

Lemma 2

(commuting relations) \(\sqsubseteq\) and \(\preccurlyeq\) commute, i.e. \((\sqsubseteq \cdot \preccurlyeq) = (\preccurlyeq\cdot \sqsubseteq).\)

Proof

See the Appendix. \(\square\)

Lemma 2 enables us to combine \(\sqsubseteq\) and \(\preccurlyeq\) into a stronger overarching verisimilitude concept.

Definition 2

(refined verisimilitude) Refined verisimilitude is defined by \(\leqslant^{\mathsf{rv}} = (\preccurlyeq \cdot \sqsubseteq), \) i.e.

$$ \varphi \,\leqslant^{\mathsf{rv}}\, \chi \;\Leftrightarrow\; \hbox{ there is a } \psi \hbox{ with } \varphi \preccurlyeq \psi \sqsubseteq \chi $$

Since both \(\sqsubseteq\) and \(\preccurlyeq\) are reflexive, they are both subsumed in \(\leqslant^{\mathsf{rv}}. \) Moreover, this notion of verisimilitude has all desired properties:

Theorem 1

(properties of refined verisimilitude) \(\leqslant^{\mathsf{rv}}\) is a preorder that satisfies the content and the likeness condition, contraposition, and commutes with \(\vdash\) and \(\dashv.\)

Proof

\(\sqsubseteq\) and \(\preccurlyeq\) are reflexive, so we have \((\sqsubseteq) \cup (\preccurlyeq) \subseteq (\leqslant^{\mathsf{rv}}). \) As a consequence, \(\leqslant^{\mathsf{rv}}\) inherits reflexivity and the content condition from \(\sqsubseteq, \) and the likeness condition from \(\preccurlyeq. \) Transitivity, contraposition and commuting with \(\vdash\) and \(\dashv\) follow from Lemma 2 and the corresponding properties of \(\sqsubseteq\) and \(\preccurlyeq.\)\(\square\)

We give two alternative definitions of \(\leqslant^{\mathsf{rv}}.\)

Lemma 3

(alternative definitions of refined verisimilitude)

  1. 1.

    \(\varphi \,\leqslant^{\mathsf{rv}}_t\, \psi\)iff there is at-increasing injection\(f : {\mathsf{atom}}(\varphi \wedge \neg\psi \wedge \neg{\tau)} \rightarrow {\mathsf{atom}}(\psi \wedge \neg\varphi \wedge \neg{\tau)}. \)

  2. 2.

    \(\leqslant^{\mathsf{rv}}_t\) is the least relation \(\leqslant^{\prime} \) satisfying

    $$ \varphi \vdash {\tau } \Rightarrow\; \varphi \hbox{ is } \leqslant^{\prime}\hbox{-minimal} ( \hbox{i.e.}\, \varphi \,\leqslant^{\prime} \psi\hbox{ for all } \psi) $$
    (9)
    $$ \hbox{for all atoms } \alpha,\beta: t(\alpha) \,\leqslant\, t(\beta) \;\Rightarrow \; \alpha \,\leqslant^{\prime} \beta $$
    (10)
    $$ \hbox{if }\varphi \,\leqslant^{\prime}\, \psi, \varphi^{\prime} \,\leqslant^{\prime}\, \psi \hbox{ and } \psi \wedge \psi^{\prime} \vdash \bot \hbox{ then } \varphi \vee \varphi^{\prime} \,\leqslant^{\prime}\, \psi \vee \psi^{\prime} $$
    (11)

Proof

See the Appendix. \(\square\)

7 Combining Belief Revision and Verisimilitude

The basic idea of the combination of belief revision and verisimilitude comes down to the following. Revising a knowledge base or theory by replacing unsuccessful content, which is less verisimilar, with content that is more verisimilar, and therefore more successful, must end up with a new theory that is more verisimilar. The problem is of course that in scientific practice the measure of verisimilitude of data and theories is unknown. Intuitively, an important adequacy constraint therefore reads:

$$ \begin{aligned} &\hbox{If one's preference order is similar to the verisimilitude order, } \\ &\hbox{then the theory after revision must be more verisimilar than } \\ &\hbox{before the revision}. \end{aligned} $$
(12)

Let us spell out what (12) comes down to in our framework. Suppose that we have an objective similarity t and a subjective preference p: t is the basis for the truth \(\tau = \mathsf{form}(t)\) and for the verisimilitude relation \((\leqslant^{\mathsf{rv}}) = (\leqslant^{\mathsf{rv}}_t), \) while p is subjected to revision with \(\varphi. \) Now the question is: when is the revision \(\mathsf{form}(p * \varphi)\) more verisimilar than \(\mathsf{form}(p), \) i.e. when do we have

$$ {\mathsf{form}}(p * \varphi) \,\leqslant^{\mathsf{rv}}_t\, {\mathsf{form}}(p) $$
(13)

In other words: when is the revision \(p*\varphi\) a step in the right direction? The conditions for (13) spell out the similarity condition of (12). By Lemma 3, these conditions come down to the existence of a t-increasing injective function f with

$$ \begin{aligned} {\mathsf{dom}}(f) & = {\mathsf{atom}}({\mathsf{form}}(p * \varphi) \wedge \neg{\mathsf{form}}(p) \wedge \neg\tau) \\ & = \{ \alpha \mid \alpha \vdash \varphi \;\&\; p(\alpha) = p(\varphi) > 0 \;\&\; t(\alpha) > 0 \} \\ {\mathsf{rg}}(f) & = {\mathsf{atom}}(\neg {\mathsf{form}}(p * \varphi) \wedge {\mathsf{form}}(p) \wedge \neg\tau) \\ & = \{ \alpha \mid p(\alpha) = 0 \;\&\; t(\alpha) > 0 \;\&\; (\alpha \not\vdash \varphi \hbox{ or } p(\varphi) > 0) \} \end{aligned} $$

When the revision is conservative (i.e. \(\mathsf{form}(p) \wedge \varphi \not\vdash \bot\) and \(p(\varphi) = 0\)), then \({\mathsf{dom}}(f) = \emptyset\) and (13) is obvious. Let us consider the three more interesting cases of Popperian revision where \(p(\varphi) > 0.\)

  1. 1.

    If theory \(\mathsf{form}(p)\) and evidence \(\varphi\) are false (i.e. \(\tau \vdash \neg\mathsf{form}(p)\) and \(\tau \vdash \neg\varphi\)), then according to Lemma 3 the similarity condition of (12) simply reduces to the existence of a t-increasing injection \(f : {\mathsf{atom}}(\mathsf{form}(p*\varphi)) \rightarrow {\mathsf{atom}}(\mathsf{form}(p)). \)

  2. 2.

    If theory \(\mathsf{form}(p)\) is false but evidence \(\varphi\) is true (i.e. \(\tau \vdash \neg\mathsf{form}(p)\) and \(\tau \vdash\varphi\)), then according to Lemma 3 the similarity condition of (12) reduces to the existence of a t-increasing injection \(f : {\mathsf{atom}}(\mathsf{form}(p*\varphi) \wedge \neg\tau) \rightarrow {\mathsf{atom}}(\mathsf{form}(p)). \)

  3. 3.

    If, on the contrary, theory \(\mathsf{form}(p)\) is true and evidence \(\varphi\) is false (i.e. \(\tau \vdash \mathsf{form}(p)\) and \(\tau \vdash \neg\varphi\)), then according to Lemma 3 the similarity condition of (12) equals to the existence of a t-increasing injection \(f : {\mathsf{atom}}(\mathsf{form}(p*\varphi)) \rightarrow {\mathsf{atom}}(\mathsf{form}(p) \wedge \neg\tau). \)

The previous cases spell out the exact sense in which one’s preference order should be similar, in the sense of having resemblance or likeness, to the verisimilitude order to guarantee improvement of verisimilitude after revision. Perhaps the second case fits best the situation of actual scientific research, where the theory used is probably false but the observations may be (approximately) true due to careful measurements. To illustrate this case let us reconsider Hansson’s example of Sect. 1.

Example 2

(Hansson’s example reconsidered) In Hansson’s example the theory (cq. body of knowledge) \(\mathsf{form}(p) = P_0 \wedge \neg P_1\) was confronted with new evidence \(\varphi = P_0\leftrightarrow P_1\) under the assumption that the truth τ equals the atom P 0P 1. Additionally, the various preference orders p 1p 2p 3 corresponding with (i), (ii), (iii) of Example 1, are defined by

 

P0P1

\(P_0 \wedge \neg P_1\)

\(\neg P_0 \wedge P_1\)

\(\neg P_0 \wedge \neg P_1\)

t

0

1

1

2

p 1

1

0

2 or 3

2

p 2

2

0

2 or 3

1

p 3

1

0

2 or 3

1

\(\varphi\)

0

1

1

0

It follows that \(\mathsf{form}(p_i) = P_0 \wedge \neg P_1\) for i = 1, 2, 3, and \( \mathsf{form}(p_1*\varphi ) = P_0 \wedge P_1, \mathsf{form}(p_2*\varphi ) = \neg P_0 \wedge \neg P_1\) and finally \( \mathsf{form}(p_3*\varphi ) = P_0 \leftrightarrow P_1. \) The injection \(f : {\mathsf{atom}}(\mathsf{form}(p_1*\varphi) \wedge \neg\tau) \rightarrow {\mathsf{atom}}(\mathsf{form}(p_1))\) is an empty function and hence t-increasing, so \(\mathsf{form}(p_1 * \varphi) \,\leqslant^{\mathsf{rv}}\, \mathsf{form}(p_1), \) i.e. the revision is at least as verisimilar as the original theory. For i = 2, 3, there is no t-increasing injection \(f : {\mathsf{atom}}(\mathsf{form}(p_i*\varphi) \wedge \neg\tau) \rightarrow {\mathsf{atom}}(\mathsf{form}(p_i)), \) since \(t(\neg P_0 \wedge \neg P_1) = 2 > 1 = t(P_0 \wedge \neg P_1) \).

Returning to the start of the present section, one might ask what can be said more in general about conditions for (13) in terms of the domain and the range of the injective function f? The injectivity of f can be guaranteed by e.g.

$$ \#{\mathsf{atom}}(\varphi \wedge \neg\tau) \leqslant \#{\mathsf{atom}}({\mathsf{form}}(p) \wedge \neg\tau), $$
(14)

and the t-increasingness of f follows from e.g.

$$ \forall \alpha (\alpha \vdash \varphi \wedge \neg\tau \;\Rightarrow\; t(\alpha) \,\leqslant t({\mathsf{form}}(p) \wedge \neg\tau)) $$
(15)

We may summarize the conjunction of (14) and (15) in the following way.

The revision \(p \mapsto p * \varphi\) is a step in the good direction (\(\mathsf{form}(p * \varphi) \leqslant^{\mathsf{rv}}_t \mathsf{form}(p)\)) when the number of \( \neg \tau\)-models of \(\varphi\) is at most the number of \( \neg \tau\)-models of \(\mathsf{form}(p), \) and they are t-preferred to them.

We observe that the conclusion holds trivially when \(\varphi\) has no models of \( \neg \tau, \) i.e. when \(\varphi \vdash \tau. \) But when \(\varphi \not\vdash \tau, \) (13) requires that there are only few \( \neg \tau\)-models for \(\varphi\) and that they are preferred to the \( \neg \tau\)-models of \(\mathsf{form}(p). \)

With similar reasoning, we have the following.

The revision \(p \mapsto p * \varphi\) is a step in the wrong direction (\(\mathsf{form}(p) <^{\mathsf{rv}}_t \mathsf{form}(p * \varphi)\)) when the number of models of \(\mathsf{form}(p)\) is smaller than the number of \( \neg \tau\)-models of \(\varphi, \) and they are t-preferred to them.

Finally, we look at the situation that t = p, so \(\mathsf{form}(p) = \tau. \) In that case \(\mathsf{form}(t * \varphi)\,\leqslant^{\mathsf{rv}}_t \tau\) iff the revision is conservative, i.e. \(\varphi \wedge \tau \not\vdash \bot\) and hence \(\mathsf{form}(t * \varphi) = \tau \wedge \varphi. \)

8 Discussion and Conclusions

In the foregoing we have seen that (iterated) AGM belief revision and refined verisimilitude fit reasonably well together. What does this mean philosophically? From the viewpoint of belief revision this means that the epistemic entrenchment approach is in a good position to be extended with considerations of truth and even verisimilitude. For refined verisimilitude our results imply that it has found an established and well-studied answer to its accompanying epistemic question. Our approach even puts forward a formal underpinning of the intuitively plausible idea that the better theory has the better consequences and doing so it successfully accepted Laudan’s challenge. What is the reason for the two approaches to fit so well in contrast to other attempts to combine verisimilitude and belief revision? The answer to this question is connected to the way refined verisimilitude and AGM belief revision are constructed. Content definitions of verisimilitude are defined in terms of logical content and truth-value and the same holds true AGM-belief revisions. In both approaches strength prevails over the ordering of worlds. The described match between belief revision and refined verisimilitude provides an important external argument in favor of the refined content approach.Footnote 2

Future research in this direction has to address at least the following three issues. Firstly we would like to explore more extensively the differences between the four rules of iterated belief revision (see Sect. 5), and the different ways in which they approach the truth. Secondly we would like to investigate whether our framework can be extended to languages with infinitely many formulae. Thirdly, we would like to address the question of the completeness of the empirical truth (in other words: whether τ is an atom or not). We observe that our results do not require the truth to be complete. If the truth is incomplete, some sentences do not acquire a definite truth-value. We would like to find out whether this lack of truth-value of some sentences together with the application of iterative belief revision to refined verisimilitude could assist in deciding between the appropriateness of languages for some area of research.