4. Median of Two Sorted Arrays - Detailed Explanation
Problem Statement
You are given two sorted arrays, nums1
and nums2
, of sizes m and n respectively. The task is to find the median of the two sorted arrays. The overall run time complexity should be O(log(m+n)). The median is defined as the middle element when the two arrays are combined in sorted order. If the total number of elements is odd, the median is the middle element; if even, it is the average of the two middle elements.
Examples
Example 1
- Input: nums1 = [1, 3], nums2 = [2]
- Output: 2.0
- Explanation:
Combined sorted array: [1, 2, 3]. The middle element is 2.
Example 2
- Input: nums1 = [1, 2], nums2 = [3, 4]
- Output: 2.5
- Explanation:
Combined sorted array: [1, 2, 3, 4]. The two middle elements are 2 and 3, so the median is (2 + 3) / 2 = 2.5.
Approaches
There are several ways to solve this problem:
1. Merging Approach
- Idea:
Merge the two arrays into one sorted array and then compute the median. - Time Complexity: O(m+n)
- Drawback:
Although simple, merging the entire arrays is not optimal if we only need the median.
2. Binary Search Partition (Optimal)
- Idea:
Use binary search on the smaller array to partition both arrays into two halves such that:- The left half contains the first half of the combined sorted elements.
- The right half contains the second half.
- If the total number of elements is odd, the median is the minimum element of the right half.
- If even, the median is the average of the maximum element from the left half and the minimum element from the right half.
- Time Complexity: O(log(min(m, n)))
- Pros:
Efficient and meets the logarithmic time requirement.
Step-by-Step Walkthrough of the Binary Search Partition Approach
-
Ensure the Smaller Array is Used for Binary Search:
LetA = nums1
andB = nums2
. IfA
is longer thanB
, swap them so that binary search is performed on the smaller array. -
Determine the Total and Half Lengths:
Letm
be the length ofA
andn
be the length ofB
.
Total elements = m + n
Half = (m + n) // 2 -
Binary Search Setup:
Initialize two pointers,left
andright
, to perform binary search overA
. -
Partitioning the Arrays:
- Choose an index
i
fromA
(using binary search), and calculatej = half - i
forB
. - Define:
Aleft = A[i-1]
ifi > 0
, else-∞
Aright = A[i]
ifi < m
, else+∞
Bleft = B[j-1]
ifj > 0
, else-∞
Bright = B[j]
ifj < n
, else+∞
- Choose an index
-
Check for Correct Partition:
The partition is valid if:Aleft <= Bright
andBleft <= Aright
- If valid:
- If the total number of elements is odd, the median is
min(Aright, Bright)
. - If even, the median is
(max(Aleft, Bleft) + min(Aright, Bright)) / 2
.
- If the total number of elements is odd, the median is
- If not valid:
- If
Aleft > Bright
, move theright
pointer left. - Otherwise, move the
left
pointer right.
- If
-
Return the Result:
Once the correct partition is found, compute and return the median.
Python Code
Java Code
Explanation
-
Input Adjustment:
The solution first ensures that the binary search is performed on the smaller array (swapping if necessary). This helps minimize the number of iterations. -
Binary Search Setup:
We set the binary search boundaries (left
andright
) for the smaller array. We then calculate the partition indices (i
fornums1
andj
fornums2
) such that the left partition holds half of the total elements. -
Partitioning and Edge Handling:
- If
i == 0
orj == 0
, there are no elements on the left, so we treat the left value asInteger.MIN_VALUE
. - Similarly, if
i == m
orj == n
, there are no elements on the right, so we treat the right value asInteger.MAX_VALUE
.
- If
-
Correct Partition Check:
We check if the current partition is valid by ensuring that the maximum element on the left side of both arrays is less than or equal to the minimum element on the right side. -
Median Calculation:
- If the total number of elements is odd, the median is the maximum element of the left partitions.
- If even, the median is the average of the maximum element on the left and the minimum element on the right.
-
Output:
The main method tests the solution with sample test cases.
Complexity Analysis
-
Time Complexity:
O(log(min(m, n))) — Binary search is performed on the smaller array. -
Space Complexity:
O(1) — Only constant extra space is used.
Common Mistakes and Edge Cases
- Not Handling Edge Cases:
- When one of the arrays is empty.
- When the partition indices are at the boundaries (i.e., no elements on one side, requiring use of negative or positive infinity).
- Incorrect Partition Adjustment:
Failing to correctly adjust the binary search boundaries can lead to an infinite loop or incorrect partitioning.
Related Problems
-
Median of Two Sorted Arrays II (if available):
Variations on finding medians in sorted data structures. -
Kth Largest Element in an Array:
Involves selection algorithms and partitioning. -
Merge Sorted Arrays:
Combining two sorted arrays efficiently.
GET YOUR FREE
Coding Questions Catalog
