616. Add Bold Tag in String - Detailed Explanation
Problem Statement
Description:
Given a string s
and an array of strings dict
, add a pair of bold tags <b>
and </b>
to wrap substrings in s
that match any string in dict
. If two such substrings overlap or are consecutive, you must merge them into one continuous bold section.
Examples:
-
Example 1:
Input:s = "abcxyz123" dict = ["abc", "123"]
Output:
"<b>abc</b>xyz<b>123</b>"
Explanation:
The substring "abc" (indices 0-2) and "123" (indices 6-8) are found in the dictionary. They are separately bolded since there is no overlap. -
Example 2:
Input:s = "aaabbcc" dict = ["aaa", "aab", "bc"]
Output:
"<b>aaabbc</b>c"
Explanation:
"aaa" and "aab" match overlapping regions, and "bc" matches part of the string immediately after. The intervals merge into one bold section covering indices 0–5. -
Example 3:
Input:s = "abcdef" dict = ["gh", "ij"]
Output:
"abcdef"
Explanation:
None of the words in the dictionary appear ins
, so no bold tags are added.
Constraints:
- 1 ≤ s.length ≤ 1000
- 1 ≤ dict.length ≤ 100
- 1 ≤ dict[i].length ≤ 100
s
anddict[i]
consist of lowercase English letters.
Hints Before Diving into the Solution
-
Hint 1:
Think about using an auxiliary array (or list) of booleans to mark which indices ins
should be wrapped in bold tags. For each dictionary word, mark all positions that are part of a match. -
Hint 2:
After marking the indices, consider how you can merge consecutive or overlapping marked intervals. How would you know where one bold section starts and ends?
Approach 1: Brute Force / Marking Method
Idea:
-
Marking the Bold Positions:
Create an arraybold
of the same length ass
and initialize all values toFalse
.
For each word indict
, scans
and mark indices asTrue
if they belong to a matching substring. -
Merging Intervals:
Traverse thebold
array. When you encounter aTrue
value that starts a sequence (or continues from a previous bold region), mark the start index. When you hit aFalse
after a series ofTrue
values, mark the end index of that bold section. Insert<b>
before the start and</b>
after the end.
Detailed Steps:
-
Step 1:
For each word indict
, iterate throughs
using a sliding window to check ifs[i:i+len(word)]
equals the word. If yes, mark all positions fromi
toi+len(word)-1
asTrue
inbold
. -
Step 2:
Iterate throughbold
and whenever you see a transition fromFalse
toTrue
, insert a<b>
tag. When transitioning fromTrue
toFalse
, insert a</b>
tag.
Visual Example (Using Example 1):
- s: "a b c x y z 1 2 3"
- Indices: 0 1 2 3 4 5 6 7 8
- Marking "abc": Mark indices 0, 1, 2 as
True
. - Marking "123": Mark indices 6, 7, 8 as
True
. - Bold Array:
[T, T, T, F, F, F, T, T, T]
- Insert Tags:
- Start bold at index 0, end at index 2 → wrap "abc"
- Start bold at index 6, end at index 8 → wrap "123"
Approach 2: Using Interval Merging
Idea:
-
Finding Intervals:
Instead of marking every index, find the start and end indices for each matching substring and store them as intervals. -
Merge Overlapping Intervals:
Sort the intervals by their start index. Merge intervals if they overlap or are consecutive. -
Build the Final String:
Use the merged intervals to add<b>
and</b>
tags to the corresponding substrings ins
.
Detailed Steps:
-
Step 1:
For each word indict
, iterate overs
and find intervals[i, j]
wheres[i:j]
matches the word. -
Step 2:
Sort the intervals and merge any intervals that overlap or touch. -
Step 3:
Build the resulting string by iterating overs
and inserting tags at the boundaries defined by the merged intervals.
Visual Example (Using Example 2):
- s: "aaabbcc"
- Intervals Found:
- For "aaa": [0, 2]
- For "aab": [1, 3]
- For "bc": [4, 5]
- Merge Intervals:
- [0, 2] and [1, 3] merge to [0, 3]
- [0, 3] and [4, 5] are consecutive (if adjacent characters are to be merged) or may be merged based on problem specifics (in this example, they merge into [0, 5]).
- Insert Tags:
Wrap indices 0–5 with<b>
and</b>
, then append the rest of the string.
Python Code
Java Code
Complexity Analysis
-
Time Complexity:
- In the marking approach, for each word in
dict
, we usefind
on strings
. In the worst case, if there arem
words andn
characters ins
, and each word has lengthk
, the worst-case complexity can be approximated as O(m * n * k). - The process of traversing the
bold
array to build the result is O(n).
- In the marking approach, for each word in
-
Space Complexity:
- An extra boolean array of size
n
is used, so the space complexity is O(n). - The space for the output string is also O(n).
- An extra boolean array of size
Step-by-Step Walkthrough & Visual Example
Consider s = "abcxyz123"
and dict = ["abc", "123"]
:
-
Marking:
- For word
"abc"
:- Find
"abc"
at index0
. Mark indices0, 1, 2
as bold.
- Find
- For word
"123"
:- Find
"123"
at index6
. Mark indices6, 7, 8
as bold.
- Find
- Bold array after processing:
[T, T, T, F, F, F, T, T, T]
- For word
-
Building the Result String:
- Start at index
0
:bold[0]
isTrue
→ Insert<b>
- Append characters
a
,b
,c
while bold remainsTrue
.
- Append characters
- At index
3
:bold[3]
isFalse
→ Insert</b>
and then appendx
,y
,z
. - At index
6
: Bold begins again → Insert<b>
- Append characters
1
,2
,3
while bold remainsTrue
.
- Append characters
- End of string → Insert
</b>
- Final result:
"<b>abc</b>xyz<b>123</b>"
- Start at index
Common Mistakes
-
Not Merging Overlapping Intervals:
Forgetting to combine overlapping or consecutive matches can result in multiple unnecessary tags. -
Incorrect Index Handling:
Off-by-one errors while marking indices may either omit necessary characters or include extra ones. -
Ignoring Edge Cases:
Not considering cases where no dictionary word is found, or whendict
is empty.
Edge Cases & Alternative Variations
-
Edge Case 1:
Input:s = ""
(empty string) ordict = []
Expected: Returns
as-is without modifications. -
Edge Case 2:
When multiple dictionary words overlap in complicated patterns.
Solution: Ensure your merging logic correctly covers consecutive or overlapping intervals. -
Alternative Variation:
Instead of bold tags, you might be asked to wrap matching substrings with other markers (like<em>
for emphasis). The logic remains the same.
Related Problems for Further Practice
- Merge Intervals: Practice merging overlapping intervals in various contexts.
- Substring Search: Work on problems that involve searching for substrings within a given string.
- Longest Common Prefix: Solve problems related to finding common patterns among strings.
- Text Justification: Explore problems involving formatting text with specific rules.
- Implement Trie (Prefix Tree): Optimize substring matching by pre-processing dictionary words.
GET YOUR FREE
Coding Questions Catalog
