MINMAX : Editorial

admin · January 31, 2013, 12:42pm

PROBLEM LINKS

Practice

Contest

Problem Information

Problem Name: Wormtongue’s Mind

Author’s Name: Min-Max Expression

Problem Code: MINMAX

Alphabet: H

Difficulty:

Medium

Pre-requisites:

Probability, Calculus (integration of polynomials)

Problem:

Given an expression consisting of min and max operations over N independent U[0, 1] random variables x₁, x₂, …, x_N, find its expected value.

Solution:

Distributions

It turns out that this problem, though it looks hard, can be solved by figuring out the probability distribution of the expression.

Lets try to find the CDF (Cumulative Distribution Function) of expressions recursively. Note that we consider x ∈ [0, 1] only.

For an expression of the form “x” (i.e. just a uniform random variable X), F_X(x) := Prob {X <= x} = x

For an expression of the form “max(expr1, expr2)” (i.e. something that looks like X = max(X₁, X₂) where X₁ and X₂ are expressions),

F_X(x) := Prob {X <= x}

= Prob {max(X₁, X₂) <= x}

= Prob {X₁ <= x and X₂ <= x}.

Now, since X₁ and X₂ consist of independent random variables, we get that

Prob {X₁ <= x and X₂ <= x}

= Prob {X₁ <= x} * Prob {X₂ <= x}

= F_X₁(x) * F_X₂(x)

Similarly for an expression of the form “min(expr1, expr2)”. Let X = min(X₁, X₂)$. Now,

F_X(x) = Prob{min(X₁, X₂) <= x} = Prob {X₁ <= x or X₂ <= x}. In terms of sets, this becomes

Prob( {X₁ <= x} ∪ {X₂ <= x}).

Finally, using Prob(A ∪ B) = Prob(A) + Prob(B) - Prob(A ∩ B), along with (as in the case of max) the fact that the random variables are independent, we get

F_X(x) = F_X₁(x) + F_X₂(x) - F_X₁(x) * F_X₂(x).

Thus, we notice that in all cases, the distribution turns out to be some polynomial in x. From here, finding the expected value can be got by integration.

Integration

Recall that ∫ xⁿ = ¹⁄_n+1 xⁿ⁺¹ (ignoring constants of integration etc).

Also, E[X] = ∫₀¹ x f_X(x) dx, where f_X(x) is the probability density function of X, and is dF_X/dx. Thus, if F_X = ∑_i=0^k c_i xⁱ, then, xf_X = ∑i=1^k i c_i xⁱ, and hence the integral (with limits from 0 to 1) would be ∑_i=1^k ⁱ⁄_i+1 c_i xⁱ.

Alternately, once you have the CDFs, then you can also use E[X] = ∫₀^∞ Prob(X>=x) dx, which holds whenever X is a non-negative random variable. In this case, this is

E[X] = ∫₀¹(1-F_X(x))dx

= 1 - ∫₀¹ F_X dx

Note on Implementation:

You are given the input in the form of a pre-order traversal of the expression tree. It would be good to actually build a tree out of this, and store “cdfs” related to each node (which corresponds to an “expression”)

Also, there is heavy use of Polynomials. Hence, it is also advised to use a Polynomial class imbued with operations of “+”, “-” and “*”. Finally, with the given constraints (N <= |S|/2 where S is the input string length), we get that degree of polynomial is linear in number of random variables, and hence O(N^2) per polynomial multiplication is good enough. Polynomials can be stored using an array of 64-bit integers that store the coefficients.

Finally, due to precision requirements, using a double (even for the final calculation) is not good enough (atleast by this approach of calculating polynomials etc.). Hence it was specified to use long double and long long datatypes. In Java, BigDecimal solution passes.