Misplaced Pages

FKG inequality

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Harris inequality) Correlation inequality

In mathematics, the Fortuin–Kasteleyn–Ginibre (FKG) inequality is a correlation inequality, a fundamental tool in statistical mechanics and probabilistic combinatorics (especially random graphs and the probabilistic method), due to Cees M. Fortuin, Pieter W. Kasteleyn, and Jean Ginibre (1971). Informally, it says that in many random systems, increasing events are positively correlated, while an increasing and a decreasing event are negatively correlated. It was obtained by studying the random cluster model.

An earlier version, for the special case of i.i.d. variables, called Harris inequality, is due to Theodore Edward Harris (1960), see below. One generalization of the FKG inequality is the Holley inequality (1974) below, and an even further generalization is the Ahlswede–Daykin "four functions" theorem (1978). Furthermore, it has the same conclusion as the Griffiths inequalities, but the hypotheses are different.

The inequality

Let X {\displaystyle X} be a finite distributive lattice, and μ a nonnegative function on it, that is assumed to satisfy the (FKG) lattice condition (sometimes a function satisfying this condition is called log supermodular) i.e.,

μ ( x y ) μ ( x y ) μ ( x ) μ ( y ) {\displaystyle \mu (x\wedge y)\mu (x\vee y)\geq \mu (x)\mu (y)}

for all x, y in the lattice X {\displaystyle X} .

The FKG inequality then says that for any two monotonically increasing functions ƒ and g on X {\displaystyle X} , the following positive correlation inequality holds:

( x X f ( x ) g ( x ) μ ( x ) ) ( x X μ ( x ) ) ( x X f ( x ) μ ( x ) ) ( x X g ( x ) μ ( x ) ) . {\displaystyle \left(\sum _{x\in X}f(x)g(x)\mu (x)\right)\left(\sum _{x\in X}\mu (x)\right)\geq \left(\sum _{x\in X}f(x)\mu (x)\right)\left(\sum _{x\in X}g(x)\mu (x)\right).}

The same inequality (positive correlation) is true when both ƒ and g are decreasing. If one is increasing and the other is decreasing, then they are negatively correlated and the above inequality is reversed.

Similar statements hold more generally, when X {\displaystyle X} is not necessarily finite, not even countable. In that case, μ has to be a finite measure, and the lattice condition has to be defined using cylinder events; see, e.g., Section 2.2 of Grimmett (1999).

For proofs, see Fortuin, Kasteleyn & Ginibre (1971) or the Ahlswede–Daykin inequality (1978). Also, a rough sketch is given below, due to Holley (1974), using a Markov chain coupling argument.

Variations on terminology

The lattice condition for μ is also called multivariate total positivity, and sometimes the strong FKG condition; the term (multiplicative) FKG condition is also used in older literature.

The property of μ that increasing functions are positively correlated is also called having positive associations, or the weak FKG condition.

Thus, the FKG theorem can be rephrased as "the strong FKG condition implies the weak FKG condition".


Probabilistic Version

Let X {\displaystyle X} be a random variable and let g , f {\displaystyle g,f} be real-valued and nondecreasing functions, then

E [ f ( X ) g ( X ) ] E [ f ( X ) ] E [ g ( X ) ] {\displaystyle E\geq EE}

Proof: Let X 1 , X 2 {\displaystyle X_{1},X_{2}} be independent copies of X {\displaystyle X} , and note that from our hypothesis we have that

( g ( X 1 ) g ( X 2 ) ) ( f ( X 1 ) f ( X 2 ) ) 0. {\displaystyle (g(X_{1})-g(X_{2}))(f(X_{1})-f(X_{2}))\geq 0.}

Taking expected value and factoring concludes the proof.

A special case: the Harris inequality

If the lattice X {\displaystyle X} is totally ordered, then the lattice condition is satisfied trivially for any measure μ. In case the measure μ is uniform, the FKG inequality is Chebyshev's sum inequality: if the two increasing functions take on values a 1 a 2 a n {\displaystyle a_{1}\leq a_{2}\leq \cdots \leq a_{n}} and b 1 b 2 b n {\displaystyle b_{1}\leq b_{2}\leq \cdots \leq b_{n}} , then

a 1 b 1 + + a n b n n a 1 + + a n n b 1 + + b n n . {\displaystyle {\frac {a_{1}b_{1}+\cdots +a_{n}b_{n}}{n}}\geq {\frac {a_{1}+\cdots +a_{n}}{n}}\;{\frac {b_{1}+\cdots +b_{n}}{n}}.}

More generally, for any probability measure μ on R {\displaystyle \mathbb {R} } and increasing functions ƒ and g,

R f ( x ) g ( x ) d μ ( x ) R f ( x ) d μ ( x ) R g ( x ) d μ ( x ) , {\displaystyle \int _{\mathbb {R} }f(x)g(x)\,d\mu (x)\geq \int _{\mathbb {R} }f(x)\,d\mu (x)\,\int _{\mathbb {R} }g(x)\,d\mu (x),}

which follows immediately from

R R [ f ( x ) f ( y ) ] [ g ( x ) g ( y ) ] d μ ( x ) d μ ( y ) 0. {\displaystyle \int _{\mathbb {R} }\int _{\mathbb {R} }\,d\mu (x)\,d\mu (y)\geq 0.}

The lattice condition is trivially satisfied also when the lattice is the product of totally ordered lattices, X = X 1 × × X n {\displaystyle X=X_{1}\times \cdots \times X_{n}} , and μ = μ 1 μ n {\displaystyle \mu =\mu _{1}\otimes \cdots \otimes \mu _{n}} is a product measure. Often all the factors (both the lattices and the measures) are identical, i.e., μ is the probability distribution of i.i.d. random variables.

The FKG inequality for the case of a product measure is known also as the Harris inequality after Harris (Harris 1960), who found and used it in his study of percolation in the plane. A proof of the Harris inequality that uses the above double integral trick on R {\displaystyle \mathbb {R} } can be found, e.g., in Section 2.2 of Grimmett (1999).

Simple examples

A typical example is the following. Color each hexagon of the infinite honeycomb lattice black with probability p {\displaystyle p} and white with probability 1 p {\displaystyle 1-p} , independently of each other. Let a, b, c, d be four hexagons, not necessarily distinct. Let a b {\displaystyle a\leftrightarrow b} and c d {\displaystyle c\leftrightarrow d} be the events that there is a black path from a to b, and a black path from c to d, respectively. Then the Harris inequality says that these events are positively correlated: Pr ( a b ,   c d ) Pr ( a b ) Pr ( c d ) {\displaystyle \Pr(a\leftrightarrow b,\ c\leftrightarrow d)\geq \Pr(a\leftrightarrow b)\Pr(c\leftrightarrow d)} . In other words, assuming the presence of one path can only increase the probability of the other.

Similarly, if we randomly color the hexagons inside an n × n {\displaystyle n\times n} rhombus-shaped hex board, then the events that there is black crossing from the left side of the board to the right side is positively correlated with having a black crossing from the top side to the bottom. On the other hand, having a left-to-right black crossing is negatively correlated with having a top-to-bottom white crossing, since the first is an increasing event (in the amount of blackness), while the second is decreasing. In fact, in any coloring of the hex board exactly one of these two events happen — this is why hex is a well-defined game.

In the Erdős–Rényi random graph, the existence of a Hamiltonian cycle is negatively correlated with the 3-colorability of the graph, since the first is an increasing event, while the latter is decreasing.

Examples from statistical mechanics

In statistical mechanics, the usual source of measures that satisfy the lattice condition (and hence the FKG inequality) is the following:

If S {\displaystyle S} is an ordered set (such as { 1 , + 1 } {\displaystyle \{-1,+1\}} ), and Γ {\displaystyle \Gamma } is a finite or infinite graph, then the set S Γ {\displaystyle S^{\Gamma }} of S {\displaystyle S} -valued configurations is a poset that is a distributive lattice.

Now, if Φ {\displaystyle \Phi } is a submodular potential (i.e., a family of functions

Φ Λ : S Λ R { } , {\displaystyle \Phi _{\Lambda }:S^{\Lambda }\longrightarrow \mathbb {R} \cup \{\infty \},}

one for each finite Λ Γ {\displaystyle \Lambda \subset \Gamma } , such that each Φ Λ {\displaystyle \Phi _{\Lambda }} is submodular), then one defines the corresponding Hamiltonians as

H Λ ( φ ) := Δ Λ Φ Δ ( φ ) . {\displaystyle H_{\Lambda }(\varphi ):=\sum _{\Delta \cap \Lambda \not =\emptyset }\Phi _{\Delta }(\varphi ).}

If μ is an extremal Gibbs measure for this Hamiltonian on the set of configurations φ {\displaystyle \varphi } , then it is easy to show that μ satisfies the lattice condition, see Sheffield (2005).

A key example is the Ising model on a graph Γ {\displaystyle \Gamma } . Let S = { 1 , + 1 } {\displaystyle S=\{-1,+1\}} , called spins, and β [ 0 , ] {\displaystyle \beta \in } . Take the following potential:

Φ Λ ( φ ) = { β 1 { φ ( x ) φ ( y ) } if  Λ = { x , y }  is a pair of adjacent vertices of  Γ ; 0 otherwise. {\displaystyle \Phi _{\Lambda }(\varphi )={\begin{cases}\beta 1_{\{\varphi (x)\not =\varphi (y)\}}&{\text{if }}\Lambda =\{x,y\}{\text{ is a pair of adjacent vertices of }}\Gamma ;\\0&{\text{otherwise.}}\end{cases}}}

Submodularity is easy to check; intuitively, taking the min or the max of two configurations tends to decrease the number of disagreeing spins. Then, depending on the graph Γ {\displaystyle \Gamma } and the value of β {\displaystyle \beta } , there could be one or more extremal Gibbs measures, see, e.g., Georgii, Häggström & Maes (2001) and Lyons (2000).

A generalization: the Holley inequality

The Holley inequality, due to Richard Holley (1974), states that the expectations

f i = x X f ( x ) μ i ( x ) x X μ i ( x ) {\displaystyle \langle f\rangle _{i}={\frac {\sum _{x\in X}f(x)\mu _{i}(x)}{\sum _{x\in X}\mu _{i}(x)}}}

of a monotonically increasing function ƒ on a finite distributive lattice X {\displaystyle X} with respect to two positive functions μ1, μ2 on the lattice satisfy the condition

f 1 f 2 , {\displaystyle \langle f\rangle _{1}\geq \langle f\rangle _{2},}

provided the functions satisfy the Holley condition (criterion)

μ 2 ( x y ) μ 1 ( x y ) μ 1 ( x ) μ 2 ( y ) {\displaystyle \mu _{2}(x\wedge y)\mu _{1}(x\vee y)\geq \mu _{1}(x)\mu _{2}(y)}

for all x, y in the lattice.

To recover the FKG inequality: If μ satisfies the lattice condition and ƒ and g are increasing functions on X {\displaystyle X} , then μ1(x) = g(x)μ(x) and μ2(x) = μ(x) will satisfy the lattice-type condition of the Holley inequality. Then the Holley inequality states that

f g μ g μ = f 1 f 2 = f μ , {\displaystyle {\frac {\langle fg\rangle _{\mu }}{\langle g\rangle _{\mu }}}=\langle f\rangle _{1}\geq \langle f\rangle _{2}=\langle f\rangle _{\mu },}

which is just the FKG inequality.

As for FKG, the Holley inequality follows from the Ahlswede–Daykin inequality.

Weakening the lattice condition: monotonicity

Consider the usual case of X {\displaystyle X} being a product R V {\displaystyle \mathbb {R} ^{V}} for some finite set V {\displaystyle V} . The lattice condition on μ is easily seen to imply the following monotonicity, which has the virtue that it is often easier to check than the lattice condition:

Whenever one fixes a vertex v V {\displaystyle v\in V} and two configurations φ and ψ outside v such that φ ( w ) ψ ( w ) {\displaystyle \varphi (w)\geq \psi (w)} for all w v {\displaystyle w\not =v} , the μ-conditional distribution of φ(v) given { φ ( w ) : w v } {\displaystyle \{\varphi (w):w\not =v\}} stochastically dominates the μ-conditional distribution of ψ(v) given { ψ ( w ) : w v } {\displaystyle \{\psi (w):w\not =v\}} .

Now, if μ satisfies this monotonicity property, that is already enough for the FKG inequality (positive associations) to hold.

Here is a rough sketch of the proof, due to Holley (1974): starting from any initial configuration on V {\displaystyle V} , one can run a simple Markov chain (the Metropolis algorithm) that uses independent Uniform random variables to update the configuration in each step, such that the chain has a unique stationary measure, the given μ. The monotonicity of μ implies that the configuration at each step is a monotone function of independent variables, hence the product measure version of Harris implies that it has positive associations. Therefore, the limiting stationary measure μ also has this property.

The monotonicity property has a natural version for two measures, saying that μ1 conditionally pointwise dominates μ2. It is again easy to see that if μ1 and μ2 satisfy the lattice-type condition of the Holley inequality, then μ1 conditionally pointwise dominates μ2. On the other hand, a Markov chain coupling argument similar to the above, but now without invoking the Harris inequality, shows that conditional pointwise domination, in fact, implies stochastically domination. Stochastic domination is equivalent to saying that f 1 f 2 {\displaystyle \langle f\rangle _{1}\geq \langle f\rangle _{2}} for all increasing ƒ, thus we get a proof of the Holley inequality. (And thus also a proof of the FKG inequality, without using the Harris inequality.)

See Holley (1974) and Georgii, Häggström & Maes (2001) for details.

See also

References

Categories:
FKG inequality Add topic