863. All Nodes Distance K in Binary Tree - Detailed Explanation
Problem Statement
Given the root of a binary tree, a target node target
, and an integer k
, the task is to find all nodes in the tree that are distance k
from the target node. The distance between two nodes is defined by the number of edges in the shortest path connecting them. You can think of this as all nodes that can be reached by taking exactly k
steps from the target node. The result should be returned as a list of node values (in any order).
- Distance
0
means just the target node itself. - Distance
1
means the immediate neighbors of the target (its parent or its direct children). - Distance
k
means nodes that you reach via exactlyk
parent or child pointers from the target.
Examples:
-
Example 1:
Input:root = [3,5,1,6,2,0,8,null,null,7,4]
,target = 5
,k = 2
Output:[7, 4, 1]
Explanation: Starting from node with value 5, the nodes at distance 2 are:- 5 → 2 → 7 (two steps: from 5 down to 2, then 2 down to 7)
- 5 → 2 → 4 (two steps: from 5 down to 2, then 2 down to 4)
- 5 → 3 → 1 (two steps: from 5 up to its parent 3, then 3 down to 1)
These nodes have values 7, 4, and 1.
-
Example 2:
Input:root = [1,2,3]
,target = 1
,k = 1
Output:[2, 3]
Explanation: The target is the root node (value 1). Nodes at distance 1 are its direct children with values 2 and 3. -
Example 3 (Edge Case):
Input:root = [1]
,target = 1
,k = 3
Output:[]
(empty list)
Explanation: There is only one node in the tree. Any distancek > 0
will yield no nodes. In this case, no node exists at distance 3 from the target.
Constraints:
- The number of nodes in the tree is in the range [1, 500]. (The tree is non-empty in given cases.)
- Node values are unique integers in the range 0 <= Node.val <= 500 .
target
is guaranteed to be a value of a node present in the tree.- 0 <= k <= 1000 . (Note:
k
can be larger than the height of the tree, in which case the output will be empty, as seen in Example 3.)
The output list can be returned in any order – the focus is just on which node values are at the required distance.
Hints to Guide the Thought Process
-
Think of the Tree as a Graph: In a binary tree, each node has up to two children, and (except for the root) one parent. If we consider parent-child connections as undirected edges, the tree becomes an undirected graph. Finding nodes at distance
k
from the target is then like a graph problem: all nodes at exactlyk
edges away from a starting node. This hint suggests that algorithms like BFS (breadth-first search) on graphs might be applicable. -
Traversing Both Downwards and Upwards: Normally, a tree traversal (DFS or BFS) goes downward (from parent to children). Here, because we need to find nodes that could be above the target (ancestors) as well as below (descendants), we need a way to move up from a node to its parent. Consider how you might move upwards in a tree – perhaps by storing parent pointers or using recursion to backtrack.
-
Use a Parent Map: One effective strategy is to perform a traversal of the tree to record each node’s parent in a hash map or dictionary. This way, given any node (including the target), you can quickly find its parent and treat that as an adjacent node in the graph sense. For example, for each node, store
parent[node] = its_parent
. This will allow you to move from a node to its parent in O(1) time during the search. -
Avoid Revisiting Nodes: When exploring in an undirected graph, you must be careful not to go in circles. In this tree graph, if you go from a child to its parent, you don’t want to immediately go back down to the child. Use a visited set to mark nodes you have already seen. This ensures you don’t revisit nodes and get stuck in an infinite loop between child and parent.
-
Two Main Approaches – BFS or DFS: Both breadth-first search (BFS) and depth-first search (DFS) can find nodes at distance
k
, but in different ways. BFS will naturally find nodes by increasing distance (level by level), which fits this problem well. DFS can be used with careful accounting of distances or by splitting the problem (e.g., find target node, then explore). Think about which might be easier to implement for you:- BFS: starting from the target, expand outward step by step until you reach distance
k
. - DFS: you could recursively search for target and then explore in all directions for nodes at the required distance.
- BFS: starting from the target, expand outward step by step until you reach distance
-
Start with Simpler Cases: Imagine simpler versions:
- What if
target
was the root? (Then the answer is just all nodes at depthk
from the root.) - What if
k = 0
? (Then the answer is just the target node itself.) - These can often be handled easily and can build intuition for the general case.
- What if
By considering these hints, you can formulate a plan to solve the problem by transforming it into a graph traversal task and then deciding between BFS or DFS to collect the nodes at the desired distance.
Approach Explanation
There are two common approaches to solve this problem: using Depth-First Search (DFS) or Breadth-First Search (BFS). In both approaches, a key step is to allow traversal upwards to parent nodes, which is not directly possible in a standard tree traversal. We achieve this by first constructing a graph representation of the tree, or at least a parent reference map, so that from any node we can move to its children or its parent. After that, we either perform a BFS or DFS starting from the target node to find all nodes at distance k
.
Depth-First Search (DFS) Approach
Idea: Use DFS to traverse the tree and prepare for the search, and then use DFS again to find all nodes at the required distance. This approach usually consists of two phases:
- Parent Mapping (DFS traversal): First, do a DFS traversal of the entire tree to record each node’s parent in a map or dictionary. This can be done with a simple recursive function. For example, when visiting node
curr
, if it has a left child, recordparent[left_child] = curr
and if it has a right child, recordparent[right_child] = curr
. Continue this for all nodes. After this step, you have aparent
map that allows you to move from any node to its parent in O(1) time. - DFS from Target: Now, perform a DFS starting from the target node to find all nodes at distance
k
. In this DFS, you need to explore in three directions from any given node: left child, right child, and parent (using the map from step 1). Use a parameter in the DFS function to keep track of the current distance from the target.-
If the current distance is equal to
k
, record the node’s value as part of the result. -
If the current distance is less than
k
, continue DFS deeper into all unvisited neighbors (children or parent). -
If the current distance exceeds
k
or the node isNone
or already visited, backtrack (prune that path).
-
- Mark Visited: It’s crucial to mark nodes as visited when you traverse them in this search to avoid going in circles (for example, going from a child to parent and back to child). This can be done with a set of visited nodes. Once you traverse a node, add it to the set. In the DFS recursion, if you encounter a node that’s already in the visited set, you return immediately without further processing.
- Collect Results: Any time your DFS reaches exactly distance
k
(and not beyond), add that node’s value to the result list. At the end of the DFS process, the result list will contain all nodes that are exactlyk
away from the target.
Step-by-Step (DFS Approach):
-
Build Parent Map (DFS1): Traverse the tree with DFS to map each node to its parent. For example, start with
dfs(node=root, parent=null)
. In the DFS:- If
node
is not null, setparent_map[node] = parent
. - Recurse
dfs(node.left, parent=node)
anddfs(node.right, parent=node)
to fill the map for children.
- If
-
Initialize Structures: Prepare an empty result list
result = []
and a visited setvisited = set()
. -
DFS from Target (DFS2): Define a recursive function
dfs_from_target(node, dist)
that explores from the currentnode
which isdist
edges away from the original target:-
If
node
isnull
or already invisited
, return (stop exploring this path). -
Mark
node
as visited by adding it to the set. -
If
dist == k
, appendnode.val
toresult
(we found a node at exact distance). -
If
dist < k
, calldfs_from_target
on all neighbors:node.left
,node.right
, andparent_map[node]
, withdist+1
.
-
-
Start DFS2 at Target: Call
dfs_from_target(target_node, 0)
to start exploring from the target (distance 0). -
Result: After the DFS completes,
result
will contain all node values at distancek
. Return this list.
Why it works: The first DFS builds the necessary parent links so that the second DFS can move in all directions from the target. The second DFS then exhaustively searches all paths from the target, but the visited set ensures it doesn’t repeat paths or traverse a node more than once. This effectively explores the tree as an undirected graph (each connection from parent to child can be seen as bidirectional for the purpose of finding all nodes within k
steps). By stopping the search when the distance exceeds k
, we limit our traversal to only relevant nodes.
Breadth-First Search (BFS) Approach
Idea: Convert the problem into a graph BFS problem. We treat the binary tree like an undirected graph by adding edges between parent and child. Then, we perform a BFS starting from the target node and go outwards level by level until we reach distance k
. BFS naturally finds nodes in order of increasing distance from the start node, which fits perfectly for this problem.
Graph Construction: We need to be able to get the neighbors of any given node (its left child, right child, and parent). There are two ways to achieve this:
- Explicit Graph (Adjacency List): Create an adjacency list (a dictionary or map) where each node maps to a list of its neighbors. You can populate this by a DFS or BFS traversal of the tree:
-
For each node, if it has a left child, add an undirected edge: add left child to the node’s neighbor list and the node to the left child’s neighbor list. Do the same for the right child.
-
This yields a graph structure where each connection (parent-child in the tree) is represented twice (child as neighbor of parent, parent as neighbor of child).
-
- Parent Map (Implicit Graph): Alternatively, similar to the DFS approach, store parent pointers in a map. You can then get neighbors of a node by looking at its children (from its
left
andright
pointers) and its parent (from the map). This avoids storing a full adjacency list, since the tree already gives child pointers.
Using either representation, the neighbor-finding operation will be the same: given a node, its neighbors for the BFS are [node.left, node.right, parent_map[node]]
(ignoring any None
or visited neighbors).
BFS from Target: Once the graph connections are ready, do the following:
-
Initialize BFS: Create a queue for BFS. Start by enqueueing the target node with distance 0. Also initialize a visited set and add the target to it (marking it visited immediately).
-
Level-by-Level Traversal: Perform BFS by dequeuing nodes. For each node dequeued with current distance
d
:- If
d == k
, you have reached nodes at the target distance. You can collect this node’s value (and continue to see if other nodes in the queue are also at distancek
). Important: Once you reach distancek
, you know that all further nodes popped from the queue (in this BFS process) will be distancek
or greater. You can choose to stop the BFS here or simply skip adding further neighbors. - If
d < k
, enqueue all unvisited neighbors of this node with distanced+1
. Neighbors include the left child, right child, and parent (if they exist and haven’t been visited). Mark each neighbor as visited when you enqueue it.
- If
-
Stopping Condition: Continue the BFS until you either reach distance
k
or the queue is empty. A common pattern is to use a loop that runsk
times, each time expanding one level of neighbors. Alternatively, you check distance in the queue:- You can use a counter or include the distance in the queue tuples. For example, push
(node, dist)
in the queue. When you pop from queue, ifdist == k
, record the value. - Another approach: perform BFS level by level. Increment a distance counter each time you finish exploring one level of the tree. When the counter reaches
k
, stop and collect all nodes currently in the queue (they will all bek
distance away).
- You can use a counter or include the distance in the queue tuples. For example, push
-
Result Collection: All nodes that have distance exactly
k
from the target will have been reached whend == k
. Collect those node values into a result list. If the BFS finishes (queue empties) before reachingk
(which means the tree does not have nodes that far away), then the result list remains empty (as in Example 3).
Step-by-Step (BFS Approach):
-
Build Parent Map: (If not building full adjacency list) Do a DFS or BFS on the tree to fill a map
parent[node] = parent_node
for each node (similar to DFS approach step 1). -
Initialize Queue: Use a queue to start BFS from the target. For example,
queue = deque()
, enqueuetarget
with distance 0 (you can store a tuple(target_node, 0)
). -
Initialize Visited:
visited = { target_node }
to mark the target as visited. This prevents immediate backtracking to it. -
BFS Loop: Set
current_distance = 0
. While the queue is not empty andcurrent_distance < k
:-
Determine the number of nodes at the current level:
level_size = len(queue)
. -
For each node in this level, dequeue it. Enqueue all its unvisited neighbors (children + parent) into the queue. Mark them visited.
-
After processing all nodes of this level, increment
current_distance += 1
. Now the queue contains all nodes atcurrent_distance
from target. -
If
current_distance == k
, you can break out of the loop or proceed to collect results.
-
-
Collect Results: The queue now contains all nodes that are exactly
k
distance from target (ifk
levels were fully processed). Extract their values into a list. If the queue became empty before reachingk
, the result is empty. -
Return the Result List.
Why it works: BFS is an ideal fit because it explores neighbors in waves. The first wave (distance 1) explores all nodes 1 step away, the second wave (distance 2) explores all nodes 2 steps away, and so on. By the time BFS has taken k
steps, it has reached all nodes at distance k
. Using the parent map allows BFS to go upward from the target to its parent, effectively exploring the tree in all directions. The visited set ensures each node is processed only once, preventing infinite loops in the undirected graph formed by the tree connections.
Comparison of DFS vs BFS approaches: Both approaches require an initial traversal to record parent pointers (or build adjacency). The main difference lies in how the distance-k
nodes are collected:
-
DFS uses recursion or an explicit stack to depth-first search all paths up to length
k
. It may be slightly more complex to implement because you need to manage the distance count in recursive calls and stop exactly atk
. -
BFS uses a queue to naturally traverse layer by layer. It can be a bit more straightforward to stop at exactly
k
by using level counts. -
Either way, the time complexity and overall work done are similar. It often comes down to which paradigm you’re more comfortable with. For clarity, many prefer BFS for this particular problem, as it aligns with the “find nodes by distance” requirement.
Code Implementation
Below are implementations in Python and Java for this problem. Both implementations follow the approach of first mapping parent pointers (to allow upward traversal), and then using BFS from the target node to collect all nodes at distance k
. We include some test cases to verify the solutions.
Python Implementation
Explanation: In the Python code above, populate_parents
does a DFS to fill the parent
dictionary. Then a BFS is performed using a queue of (node, distance)
pairs. When distance == k
, the node’s value is added to the result. Neighbors (left, right, parent) are enqueued with distance + 1 when distance < k
. A visited set prevents revisiting nodes. The helper build_tree
is used for constructing test trees from level-order lists for demonstration purposes. The test cases correspond to the examples discussed: they print out the list of nodes at distance k
from the target for each scenario.
Java Implementation
Explanation: The Java solution uses a similar strategy. We first define a TreeNode
class for the tree structure. The buildParentMap
method records each node’s parent in a HashMap. The distanceK
method then performs BFS: it uses a Queue<TreeNode>
to traverse outward from the target, and a HashSet
to mark visited nodes. We iterate level by level until we reach the desired distance k
. Once currentDistance == k
, we collect all node values remaining in the queue (these are exactly k
distance away) and break out of the loop. The main
method sets up the example trees and prints outputs for verification.
Note: In both implementations, the target is provided as a TreeNode
reference (as per the problem signature). In a real LeetCode scenario, you would already have the target
node given. In our test code, we manually identify the target node (e.g., target1 = root1.left
for value 5 in Example 1). If the target were given by value only, you would first need to search the tree for the node with that value.
Complexity Analysis
Both the DFS and BFS approaches have similar complexity characteristics:
-
Time Complexity: O(N) where N is the number of nodes in the tree. We perform a DFS to build the parent map which takes O(N), and then a BFS/DFS from the target which in the worst case visits all nodes once (also O(N)). Thus the overall time is linear in the size of the tree. Each node is processed a constant number of times (once when building parent links, and at most once again in the search phase).
-
Space Complexity: O(N). Storing the parent pointers (or an adjacency list) uses O(N) space. The BFS queue or DFS recursion stack in the worst case can also hold O(N) nodes (for example, if the tree is skewed or k is near N). The visited set will also grow to O(N) in the worst case. Therefore, the auxiliary space usage grows linearly with the number of nodes. (Note: The space complexity excludes the output list. The output list size is at most N in the worst case, if k is 0 or if all nodes are at the same distance from the target in some scenario.)
Between the two approaches, the constant factors might differ (BFS uses a queue, DFS uses recursion and possibly multiple function calls), but both are efficient for N up to 500 (per constraints) and scale linearly.
Common Mistakes and Edge Cases
Common Mistakes:
-
Not marking visited nodes: A very frequent mistake is to forget to mark nodes as visited when traversing in the “graph” of the tree. If you don’t mark visited nodes, you could go from a child to its parent, and then back to the child in an endless loop. Always use a visited set (or modify the tree structure if allowed) to ensure you don’t revisit nodes.
-
Incorrect Parent Mapping: Make sure that you correctly map each node to its parent. A mistake here can lead to wrong results or null pointer errors. For example, if you forget to map the root’s children to the root as parent, then when BFS/DFS tries to go upward from those children, it won’t know how. Also ensure not to override the parent map incorrectly if using global structures.
-
Off-by-One Errors in Distance: When using DFS, be careful to add the node’s value to result exactly when the distance equals k, and not continue searching beyond that. Similarly for BFS, one must stop expanding after reaching the k-th level. It’s easy to accidentally collect nodes at distance < k or > k if the loop conditions are wrong. A good practice is to test the code with small trees for various k (including 0 and larger than tree height).
-
Not Handling k = 0 Properly: If k = 0, the expected answer is just [target’s value]. Make sure your code handles this (in BFS approach, the initial check
if currentDistance == k
will catch this immediately and return the target; in DFS, you should handle the base case where k = 0 by adding the target if not already). -
Assuming target is the value, not the node: In some implementations, people confuse the target node’s value with the target node. The function is given a target node (or in problems like this, the target value along with the root to locate it). If you only have the value, you need to find the node first. In our approach, we relied on having the actual node to start from.
-
Tree Traversal Direction: If implementing the DFS approach without an explicit parent map (by recursively searching upward), some get confused with how to backtrack correctly. Using a parent map simplifies the logic by turning the problem into an undirected graph traversal.
-
Modifying the Tree Structure: A pitfall is trying to modify the tree (like adding parent pointers to TreeNode class or altering pointers) and not restoring it. It’s usually safer to use external maps or data structures rather than altering the given tree nodes, especially in an interview or contest setting.
Edge Cases to Consider:
-
Target is the Root: Then all nodes at distance k are simply the nodes in the k-th level of the tree (descendants). The algorithm still works – parent of root is null and is never enqueued/visited.
-
Target is a Leaf: If the target is a leaf node and k > 0, the search will go upwards to its parent and then possibly down other branches. For example, in Example 3 with a single node tree (leaf which is also root), any k > 0 returns no nodes. If the tree had more nodes and target is a leaf, the result will come from exploring upward and then down the other sibling side.
-
k = 0: Should return [target] (the target itself).
-
k larger than tree height: Returns an empty list, because you cannot go more steps than the tree’s height from the target. For instance, if the longest path from the target has length 3 and k = 4, no node exists at distance 4.
-
Tree with Single Node: If the tree has just one node (which is also the target):
- k = 0 returns [target’s value].
- any k > 0 returns [] (no other nodes to reach).
-
All Nodes Same Distance (Uncommon): A contrived scenario: consider a target that is exactly in the middle of the tree’s height, and k equals that height. Then the only nodes at distance k might be leaves on the opposite end. The algorithm should still find them correctly. (This is just to illustrate the algorithm handles multiple branches.)
(Note: The problem constraints ensure the tree isn’t empty and the target is always in the tree, so we don’t have to handle cases of a null root or invalid target input in this problem. But it’s good to be aware of these in a general sense.)
Alternative Approaches and Related Problems
This problem can be approached in more than one way, and it is related to several other tree distance problems:
-
Alternative DFS (single traversal) Approach: Instead of explicitly building a parent map and then doing a separate search, one could write a single DFS that starts at the root and searches for the target and, on the way back up, computes distances. For example, a recursive function that returns the distance from the current node to the target (if the target is in its subtree). When the recursion finds the target, it can then use that information to start collecting nodes at distance k in that subtree, and also propagate the distance up to parent calls to explore other branches. This is a more complex recursive solution (often done with two recursive functions or by returning distances). It avoids an explicit parent map by using the call stack as you return from the recursion. However, this approach is harder to get right for beginners and is less straightforward than the parent map + BFS/DFS approach.
-
“Burning Tree” or Infection analogy: A known variation of this problem is the Burning Tree problem (also phrased as the time to burn a binary tree from a starting node). In that scenario, a fire starts at the target node and spreads to adjacent nodes (parent and children) each minute. The nodes that catch fire at minute
k
are exactly the nodes at distancek
from the start. This is essentially the same problem framed in a time context. LeetCode has a related problem “Amount of Time for Binary Tree to Be Infected” which asks for the time to burn the whole tree from the start node – solving that involves finding the farthest node distance. The approach to that problem is very similar: you find all distances using BFS or DFS with parent pointers. If you understood this problem, you can apply it to find the maximum distance (instead of all nodes at a specific distance). -
Closest Leaf in a Binary Tree (LeetCode 742): In that problem, you’re given a target node and asked to find the closest leaf to that node. The approach is almost identical to this one: you perform a BFS from the target using parent pointers to find the nearest leaf node. Solving the distance K problem makes it easy to understand how to do that – it’s essentially distance K where you stop when you find any leaf (which is a distance D, and that D is the answer).
-
Distance Between Two Nodes in a Binary Tree: A related concept is finding the distance between any two nodes. One way to do that is to find the distance from the target to every other node (like we do here), but that’s not efficient for arbitrary pairs. A better approach for two arbitrary nodes uses the Lowest Common Ancestor (LCA) – the distance between node A and B is
dist(root, A) + dist(root, B) - 2*dist(root, LCA(A,B))
. However, when one of the nodes is fixed (like our target), doing a BFS/DFS from that node to find the distance to others (like we did) is perfectly fine. -
Nodes at Distance K from Root: A simpler variant is when the target is always the root. Then it reduces to just collecting all nodes at depth K in the tree (which can be done with a simple BFS or DFS without needing a parent map). This is a good warm-up to understand the concept of “nodes at distance K” before tackling the general target case.
-
General Graph Problems: This problem is a specific instance of finding all vertices at a certain distance in an unweighted graph. Recognizing that the tree can be treated as an undirected graph helps connect this to graph theory. If you encounter problems of finding distances in grids or networks, a BFS approach is typically used. Practicing this problem helps reinforce using BFS with a visited set on graph-like data.
-
Other Related LeetCode Problems: Some other problems that use similar tree-to-graph conversion or distance concepts include:
-
“Binary Tree Vertical Order Traversal” – while not exactly distance k, it involves traversing a tree with additional constraints which sometimes is tackled with BFS and storing additional info (like column index).
-
“K Distance from a Node” on GeeksforGeeks – which is essentially the same as this problem, often solved with a recursive approach or BFS.
-
Path-related problems in trees (like Path Sum, Lowest Common Ancestor, etc.) which require traversing up and down the tree, though usually not in an undirected way like we did here.
-
GET YOUR FREE
Coding Questions Catalog
