2364. Count Number of Bad Pairs - Detailed Explanation
Problem Statement
You are given a 0-indexed integer array nums
. We define a pair of indices (i, j)
to be a bad pair if i < j
and j - i != nums[j] - nums[i]
. In other words, the difference between the indices is not equal to the difference between the corresponding values. The task is to count the total number of bad pairs in the array.
Example 1:
- Input:
nums = [4, 1, 3, 3]
- Output:
5
- Explanation: There are 6 possible pairs of indices
(i, j)
withi < j
for this array. Out of these, 5 pairs are bad:(0, 1)
is a bad pair because1 - 0 ≠ 1 - 4
(i.e., index difference 1 vs value difference -3).(0, 2)
is a bad pair because2 - 0 ≠ 3 - 4
(2 vs -1).(0, 3)
is a bad pair because3 - 0 ≠ 3 - 4
(3 vs -1).(1, 2)
is a bad pair because2 - 1 ≠ 3 - 1
(1 vs 2).(2, 3)
is a bad pair because3 - 2 ≠ 3 - 3
(1 vs 0).
Only one pair (which is(1, 3)
) is not a bad pair in this case. Therefore, the total number of bad pairs is 5.
Example 2:
- Input:
nums = [1, 2, 3, 4, 5]
- Output:
0
- Explanation: There are no bad pairs because for every pair
(i, j)
, the conditionj - i = nums[j] - nums[i]
holds. The array is strictly increasing by 1 at each step, so the difference in indices equals the difference in values for any pair. Hence, all pairs are "good" and the count of bad pairs is 0.
Example 3:
- Input:
nums = [2, 2, 2]
- Output:
3
- Explanation: The array has 3 elements, so there are 3 possible pairs:
(0,1)
,(0,2)
, and(1,2)
. All of these pairs turn out to be bad. For instance, for pair(0,1)
, we have1 - 0 = 1
but2 - 2 = 0
for the values, which do not match. Similarly, every pair of distinct indices has index difference 1 or 2, while the value differences are 0 (because all values are equal). Thus, all 3 pairs are bad.
Constraints:
1 <= nums.length <= 100,000
1 <= nums[i] <= 1,000,000,000
The result can be large (in the order of tens of billions for the largest arrays), so make sure to use a data type (like 64-bit integer) that can handle large counts.
Hints
-
Hint 1: Try to understand the condition for a "bad pair." If a pair is not bad (i.e., a "good" pair), it satisfies
j - i = nums[j] - nums[i]
. See if this equation can be rearranged or transformed into a simpler form. This might reveal a pattern or a property that you can exploit. -
Hint 2: Think about the total number of pairs possible in an array of length
n
. If you can efficiently count how many pairs are good, you can subtract that from the total to get the number of bad pairs. Counting good pairs might be easier if you find a suitable way to categorize or group indices that form good pairs. -
Hint 3: Consider using a data structure like a hash map (dictionary) to count occurrences of a certain value related to each index. The equation from Hint 1 can guide you on what value to use. By counting these occurrences, you can determine how many pairs share that property (hence, are good pairs). Avoid brute force comparisons for each pair since that would be too slow for large inputs.
Multiple Approaches
Brute Force Approach
A straightforward way to solve the problem is to check every possible pair of indices and count those that satisfy the "bad pair" condition. This means using two nested loops: the outer loop goes through each index i
, and the inner loop checks every index j > i
. For each pair (i, j)
, we verify if j - i
is not equal to nums[j] - nums[i]
. If they are not equal, we increment a bad pair counter.
Step-by-step (Brute Force):
- Initialize a counter
bad_count = 0
. - Loop
i
from0
ton-1
(wheren
is the length ofnums
). - For each
i
, loopj
fromi+1
ton-1
. - Check the condition: if
j - i != nums[j] - nums[i]
, then this is a bad pair. Incrementbad_count
. - After checking all pairs,
bad_count
will hold the total number of bad pairs.
Example (Brute Force):
Consider nums = [3, 1, 2]
. The array length n = 3
, so total pairs to check = 3:
- Pair
(0, 1)
:- Index difference =
1 - 0 = 1
- Value difference =
nums[1] - nums[0] = 1 - 3 = -2
- These are not equal (1 ≠ -2), so this is a bad pair.
- Index difference =
- Pair
(0, 2)
:- Index difference =
2 - 0 = 2
- Value difference =
nums[2] - nums[0] = 2 - 3 = -1
- 2 ≠ -1, so this is a bad pair.
- Index difference =
- Pair
(1, 2)
:- Index difference =
2 - 1 = 1
- Value difference =
nums[2] - nums[1] = 2 - 1 = 1
- 1 = 1, so this pair is not bad (it's a good pair).
- Index difference =
Out of these, 2 pairs are bad. The brute force approach would correctly count bad_count = 2
for this example.
Analysis of Brute Force: This approach is easy to understand, but it is extremely inefficient for large arrays. It checks every pair of indices, resulting in \mathcal{O}(n^2) time complexity. For n = 100,000
, the number of pairs is on the order of 10^{10}, which is far too many to check one by one. Therefore, the brute force method will time out for large inputs and is not feasible within the problem constraints.
Optimized Approach (Using Math and Hash Map)
To optimize, we need to reduce the pair checking. A key observation comes from analyzing the condition for good pairs (the opposite of bad pairs). A pair (i, j)
is good if:
j - i = nums[j] - nums[i].
We can rearrange this equation:
j - i = nums[j] - nums[i]
=> j + nums[i] = i + nums[j]
=> j - nums[j] = i - nums[i]
This rearrangement shows that for a pair to be good, the value of i - nums[i]
must be the same for both indices. In other words, if we define a new value for each index k
as diff[k] = k - nums[k]
, then any two indices with the same diff
value form a good pair.
Why this works: If diff[i] = diff[j]
(meaning i - nums[i] = j - nums[j]
), then rearranging gives j - i = nums[j] - nums[i]
, which is exactly the condition for a good pair. Conversely, if diff[i]
is different from diff[j]
, then j - i
will not equal nums[j] - nums[i]
, meaning (i, j)
is a bad pair.
Strategy: Using this insight, we can count good pairs efficiently:
-
Compute
diff[i] = i - nums[i]
for each indexi
. -
Use a hash map (or dictionary) to count how many times each
diff
value occurs. -
For each unique
diff
value with frequencyf
in the map:- The number of good pairs contributed by this
diff
value is the number of ways to choose 2 indices out off
. This is given by the combination formula C(f, 2) = \frac{f \times (f-1)}{2} (which is 0 if f is less than 2).
- The number of good pairs contributed by this
-
Sum up the counts of good pairs for all
diff
values. -
Calculate total pairs in the array: total pairs = \frac{n \times (n-1)}{2}, where
n
is the length ofnums
. -
Finally, bad pairs = total pairs - good pairs.
This avoids explicitly checking every pair. We group indices by their diff
values and count pair combinations within each group.
Step-by-step (Optimized):
-
Create an empty hash map
freq
to count frequencies ofdiff
values. -
Loop through the array indices
i = 0
ton-1
:- Compute
diff = i - nums[i]
. - Update
freq[diff] += 1
(increment the count for that diff value).
- Compute
-
Initialize
good_count = 0
. -
For each entry
(value, count)
infreq
:- If
count
is at least 2, add \frac{count \times (count-1)}{2} togood_count
. - (If
count
is 0 or 1, it contributes 0 good pairs.)
- If
-
Compute
total_pairs = frac\{n \times (n-1)}{2}
. -
Compute
bad_count = total_pairs - good_count
. -
Return or output
bad_count
.
Example (Optimized):
Revisiting the Example 1 array nums = [4, 1, 3, 3]
:
- Calculate
diff
for each index:- Index 0:
diff[0] = 0 - 4 = -4
- Index 1:
diff[1] = 1 - 1 = 0
- Index 2:
diff[2] = 2 - 3 = -1
- Index 3:
diff[3] = 3 - 3 = 0
- Index 0:
- Frequency map of
diff
values:
{-4: 1, 0: 2, -1: 1}
Here,-4
occurs once,0
occurs twice, and-1
occurs once. - Count good pairs from these frequencies:
- The value
-4
with frequency 1 gives 0 good pairs (need at least 2 indices to form a pair). - The value
0
with frequency 2 gives C(2, 2) = 1 good pair. (Those two indices withdiff = 0
form a good pair; indeed, these are indices 1 and 3, which we saw was the only good pair.) - The value
-1
with frequency 1 gives 0 good pairs. - Total
good_count = 1
.
- The value
- Total pairs in array of length 4: C(4, 2) = \frac{4 \times 3}{2} = 6.
- Bad pairs = total pairs - good pairs = 6 - 1 = 5, which matches the output.
This optimized method computes the answer in a single pass (to build the map) plus a second pass over the map entries, which is much more efficient than checking every pair.
Additional Optimization: Instead of computing total pairs and subtracting, we could directly count bad pairs in one pass. For each index as we iterate, determine how many previous indices do not have the same diff
value. For example, at index i
, the number of pairs (j, i)
(with j < i
) that are bad is i
(total previous indices) minus the number of previous indices with the same diff
as i
. We can maintain a running count of seen diff
values and accumulate bad pairs on the fly. However, whether you count good pairs or bad pairs first, the efficiency is similar. The main idea is using the hash map to avoid the double loop.
Code Implementations
Python Implementation
Explanation: The Python function uses a single loop to build the frequency map and count good pairs on the fly. For each index i
, if a diff
value has appeared before, it means there are some previous indices with the same diff
. Each such previous index forms a good pair with i
. We add the number of those previous occurrences (freq[diff]
) to the good_count
. Then we update the frequency of that diff
. Finally, we compute total pairs and subtract good_count
to get bad_count
. The example usage runs the function on the sample test cases to illustrate the expected results.
Java Implementation
Explanation: The Java implementation follows the same logic. We use a HashMap<Long, Long>
to store the frequency of each diff
value (Long
is used for both key and value because indices and nums
values can lead to diff
values that exceed the range of 32-bit int). We iterate through the array, updating goodCount
whenever we find a diff
that has been seen before. We use long
for goodCount
, totalPairs
, and badCount
to handle large values. The main method tests the function on some example inputs. Running this main method will print out the results for the sample arrays.
Complexity Analysis
-
Brute Force Approach:
-
Time Complexity: O(n^2). We have a nested loop that potentially checks every pair of
n
elements. Forn = 100,000
, this is on the order of 10^{10} operations, which is not feasible in a reasonable time. -
Space Complexity: O(1) (constant extra space), aside from using a few variables for counting. The brute force method doesn't use additional data structures that grow with input size (it only iterates through the input).
-
-
Optimized Approach (Hash Map Counting):
- Time Complexity: O(n) on average. We make a single pass through the array to compute frequencies and another pass (over the map) to calculate the result. Hash map operations (insert and lookup) are amortized constant time on average. In the worst case, hash collisions could degrade performance, but with a good hash function and distribution (which we have for simple arithmetic differences), it remains efficient. Overall, the time complexity is linear in
n
, which is suitable for n up to 100,000. - Space Complexity: O(n). In the worst case, all
diff
values could be distinct (for example, if the array is strictly increasing by more than 1 each time, likenums = [0, 2, 4, 6, ...]
). In that scenario, the hash map would storen
entries. However, each entry is just a count of a particulardiff
. The space used is proportional ton
in the worst case. The input array itself uses O(n) space, but typically we don't count that since it's given. The additional space for the map is at most linear in the input size.
- Time Complexity: O(n) on average. We make a single pass through the array to compute frequencies and another pass (over the map) to calculate the result. Hash map operations (insert and lookup) are amortized constant time on average. In the worst case, hash collisions could degrade performance, but with a good hash function and distribution (which we have for simple arithmetic differences), it remains efficient. Overall, the time complexity is linear in
Note on Arithmetic Overflow: When computing the total number of pairs or adding up pair counts, the numbers can become large. For example, with n = 100,000
, total pairs = 100000 \times 99999 / 2 ≈ 5 \times 10^{9} (5 billion). This fits within a 64-bit integer range (which goes up to about 9.22 \times 10^{18} for signed 64-bit), but not within 32-bit. So, in languages like Java or C++, we use a larger integer type (long
in Java) to store such values. Python's integer type can handle big integers automatically.
Common Mistakes and Edge Cases
Common Mistakes:
-
Using the Wrong Formula: It's easy to mix up the condition. Remember that the condition given is for a bad pair (
j - i != nums[j] - nums[i]
). It's often easier to focus on the complementary "good pair" condition (j - i == nums[j] - nums[i]
) when devising a solution. Some might incorrectly set up the equation or grouping. The correct transformation is to comparei - nums[i]
(or equivalentlynums[i] - i
) across indices. -
Double Counting or Order Issues: When counting pairs, ensure each pair is counted only once. A mistake would be to count both
(i, j)
and(j, i)
separately. By definition,i < j
ensures each unordered pair is counted once. Also, when using combinations or the hash map method, be careful not to count pairs wherei
andj
are the same index (which isn't a valid pair). -
Not Using Long for Large Counts: In languages with fixed-size integer types, forgetting to use a 64-bit integer for the result can cause overflow errors or incorrect results for large inputs. Always check the range of values. In this problem, the count of pairs can exceed the 32-bit integer range.
-
Inefficient Approaches on Large Input: Trying a brute force solution or any approach that is worse than O(n log n) will likely time out for the upper constraint. Sometimes new problem solvers implement triple nested loops by mistake or other overly complex logic. It's important to simplify the condition and use mathematics to avoid unnecessary loops.
Edge Cases to Consider:
-
Small Arrays: The smallest array length is 1. In this case, there are no pairs at all, so the result should be 0. Ensure the code handles
n=1
(or generally cases where no pair exists) correctly. -
All Pairs Good: It's possible that no pair is bad (as in Example 2, where the output was 0). For instance, any array that forms an arithmetic progression with difference 1 (like
[5, 6, 7, 8]
) will havej - i == nums[j] - nums[i]
for all pairs. The solution should correctly return 0 for such cases. -
All Pairs Bad: It's also possible that every possible pair is bad. We saw an example with
nums = [2, 2, 2]
where this happened. Another scenario is if the array values are all equal or if they form an arithmetic progression with a difference of 0 (constant array) — in such cases, for anyi < j
,j - i
will be positive whilenums[j] - nums[i] = 0
, so the pair is bad. The solution should handle this and return the total number of pairs for such input. -
Large Values and Maximum Length: The solution should handle the upper limits:
n = 100,000
and values up to 10^9. The efficient approach described uses O(n) time and O(n) space, which will work for these limits. Just ensure that any arithmetic (like computing total pairs or intermediate sums) is done in a safe way (avoiding overflow in languages like Java/C++ as discussed).
Alternative Variations of the Problem
This problem is about counting index pairs that satisfy (or do not satisfy) a certain condition. There are several related problems and variations where a similar counting strategy or insight is useful:
-
Number of Good Pairs (LeetCode 1512): Instead of a custom difference condition, this simpler problem asks you to count pairs
(i, j)
wherenums[i] == nums[j]
(withi < j
). The direct solution uses a similar idea of counting frequencies of each value and summing up combinations C(f, 2) for each frequency. It's a good warm-up problem for understanding the combination counting technique. -
K-diff Pairs in an Array (LeetCode 532): Here, you need to count unique pairs
(i, j)
such that the absolute difference betweennums[i]
andnums[j]
isk
. While the condition is different, it still involves examining differences. Common solutions involve sorting and using two pointers or using a set/hash map to find complements (e.g., for each numberx
, check ifx+k
exists). This problem helps practice counting pairs under a condition without double counting. -
Subarray Sum Equals K (LeetCode 560): This problem is about subarrays (contiguous segments) rather than pairs of indices, but it uses a very similar pattern of using a hash map to accumulate counts of something (in that case, prefix sum values). To count subarrays that sum to
k
, you count how often a certain prefix sum appears. The approach of using a running count and a hash map for frequencies is analogous to what we did for good pairs usingdiff
. It shows how transforming a problem (sums or differences) and using counting can solve counting problems efficiently. -
Count Nice Pairs in an Array (LeetCode 1814): This is a more complex variation where the condition involves the values and their reversed digits. A pair of indices is considered "nice" if
nums[i] + rev(nums[j]) == nums[j] + rev(nums[i])
. It turns out you can reduce this condition to something likenums[i] - rev(nums[i]) == nums[j] - rev(nums[j])
. This again leads to grouping by a transformed value (herenums[i] - rev(nums[i])
) similarly to how we grouped byi - nums[i]
. The solution uses a hash map count of those transformed values to count pairs, much like the approach for bad pairs. It's a great example of applying the same counting principle to a different problem. -
Pairs of Songs With Total Durations Divisible by 60 (LeetCode 1010): Although this is about music and sums, it's another pairs counting problem. You are given song durations and need to count how many pairs sum up to a multiple of 60. A common solution involves using the remainders of durations mod 60 and counting complementary remainder pairs. This shows another scenario of counting with a hash map (or array of counts) by categorizing elements (in this case, by remainder).
GET YOUR FREE
Coding Questions Catalog
