MOU2H - Editorial

darkshadows · August 11, 2014, 3:13pm

PROBLEM LINK:

[Practice][111]
[Contest][222]

Author: Vitalij Kozhukhivskij
Tester: Praveen Dhinwa and Hiroto Sekido
Editorialist: Lalit Kundu

DIFFICULTY:

Medium

PREREQUISITES:

Dynamic Programming

PROBLEM:

You have a sequence of N elements H₁,H₂…H_N. A climb is defined by the nonempty sequence (p₁, p₁+1), (p₂, p₂+1), …, (p_s, p_s+1), where p_k+1 ≤ p_k+1 for k = 1, 2, …, s − 1.
Two climbs, say (p₁, p₁+1), (p₂, p₂+1), …, (p_s, p_s+1) and (q₁, q₁+1), (q₂, q₂+1), …, (q_t, q_t+1) are different if and only if

s ≠ t or
There exists at least one k such that 1 ≤ k < min(s, t) and H_{p_k+1} – H_{p_k} ≠ H_{q_k+1} – H_{q_k}.

If you read the problem carefully and is thought over for sometime you’ll realise the basically what is asked for is number of different/distinct subsequences of the sequence H₂-H₁,H₃-H₂…H_N-H_N-1.

EXPLANATION:

So, our problem is reduced to: Given a sequence A₁,A₂…A_N, find the number of distinct subsequences. Subsequence is a sequence that can be derived from sequence A by deleting some elements without changing the order of the remaining elements. Two subsequences are distinct if there length are different or some of the corresponding element is different.

dp[i] = Number of distinct subsequences ending with A[i].
sum[i] = dp[1] + dp[2] + … + dp[i]. So sum[n] will be our answer.
last[i] = last position of occurence of A[i] in the array A.

A null string has one subsequence, so dp[0] = 1.

for i=1 to N:
    dp[i]= sum[i-1] - sum[last[a[i]]-1]
    sum[i]=sum[i-1] + dp[i]
    last[a[i]]=i
print sum[n]

Initially, we assume we can append A[i] to all subsequences ending on previous characters, but this might violate the condition that the counted subsequences need to be distinct. Remember that last[A[i]] gives us the last position A[i] appeared on until now. The only subsequences we overcount are those that the previous A[i] was appended to, so we subtract those.

Using map in C++ to store last would have timed out. It was intentional. Also, take care of modulo operations.

ALTERNATIVE SOLUTION

If all elements in A are distinct, our answer will be 2^N. Let’s say dp[i] stores the answer for array A_1 to A_i. Now, if there is some i<j such that A[i]==A[j], we should consider only last occurence of A[j] ie. at the index i. So we have to subtract the number of subsequences due to it’s previous occurrence.

This pseudo code will make it more clear:

dp[0]=1 // for length 0 the subsequences are 1
for i=1 to N:
    dp[i]=dp[i-1]*2
    if A[i] has occured last time at index j:
	dp[i]=dp[i]-dp[j-1]
print dp[n]

AUTHOR’S AND TESTER’S SOLUTIONS:

[Author’s solution][131]
[Tester’s solution][132]

Note: Editorial inspired from a post on stackoverflow.
[111]: http://www.codechef.com/problems/MOU2H
[222]: http://www.codechef.com/AUG14/problems/MOU2H
[131]: http://www.codechef.com/download/Solutions/2014/August/Setter/MOU2H.cpp
[132]: http://www.codechef.com/download/Solutions/2014/August/Tester/MOU2H.cpp

grebnesieh · August 11, 2014, 8:48pm

I used the same algorithm in Python.

http://www.codechef.com/viewsolution/4533860
http://www.codechef.com/viewsolution/4542578

and several other variations, all of them gave a TLE. Any suggestions as to what I can do to reduce the runtime so that it passes?

I can’t find a single Python AC out of all the submissions for this problem.

xellos0 · August 11, 2014, 8:59pm

Nah, getting Python to do a serious O(N) algorithm with N=10^6 in less than several seconds is quite hard (at least in my experience). Moral lesson: when getting TLE in Python, ditch it and write the same in C++.

grebnesieh · August 11, 2014, 11:21pm

I did try C++ but I never really use it for competitive programming and kept getting segmentation error, I really need to practice with that.

Anyways, shouldn’t they raise the Python time limit so as to allow the submissions with acceptable time complexity? :\

incognito_103 · August 14, 2014, 2:41am

In the tester’s solution, would you please explain a little bit more the thing done by this line?

int mem[8100000], *used = mem + 4001000;

I mean this kind of code is kinda new to me.

placibo · August 18, 2014, 11:57am

thank’s for the post .The solution is interesting

freeman92 · August 18, 2014, 1:21pm

@incognito_103 >> in the above line, mem is declared as an int array of size 8100000 & used is a pointer to an int & it points to 4001000th location in mem array.Purpose of doing this is we can access used array using negative indices and it still remains as a pointer to a valid location in mem array.(in the above problem difference between two adjacent heights can be a minimum of -4*10^6 and maximum of 4*10^6, so we need an array of size at least 8*10^6 to access negative index of -4*10^6 and positive index of 4*10^6).

Illustrative Example(with smaller sized array):

say we declare an array named mem with size 3, and an int pointer to position 1 of mem(i.e. int* ptr = mem[1])

mem array:

0   1   2
----------
|  |  |  |
----------
    ^
    |
    ptr

So, this implies ptr[0] points to location 1 of mem array, ptr[-1] points to location 0 of mem array, ptr[1] points to location 2 of mem array.

accessing ptr[i] , i<-1

and ptr[i] , i>1 leads to array index out of bound runtime error.

freeman92 · August 18, 2014, 1:22pm

@incognito_103 >> see below for answer (i wasn’t able to fit the answer in comment field).

tripshock · August 29, 2014, 9:19pm

In C++, using std::map may cause TLE, but using std::unordered_map will not. That is what you should be using anyways, since there is no need to maintain the elements in a sorted order.