10. Regular Expression Matching - Detailed Explanation
Problem Statement
Description:
Given an input string s
and a pattern p
, implement regular expression matching with support for the special characters .
and *
. The matching must cover the entire string (not just a substring). The rules are:
.
(dot) matches any single character. For example,"a"
matches"."
because.
can represent"a"
(or any other single char).*
(star) matches zero or more of the preceding character. It means the character immediately before*
can repeat any number of times (including 0 times). For example,"bo*"
can match"b"
,"bo"
,"boo"
,"booo"
, etc., because*
allows'o'
to appear 0 or more times.
The goal is to return a boolean indicating whether s
matches p
exactly according to these rules.
Examples:
-
Input: s =
"aa"
, p ="a"
– Output: false.
Explanation: The pattern"a"
is a single character and cannot match the two-character string"aa"
(it would only match a single"a"
). -
Input: s =
"aa"
, p ="a*"
– Output: true.
Explanation:'*'
means "zero or more of the preceding element". Here the preceding element is'a'
. So"a*"
can represent""
(zero a's),"a"
,"aa"
,"aaa"
, etc. By repeating'a'
once,"a*"
becomes"aa"
, which matches the string. -
Input: s =
"ab"
, p =".*"
– Output: true.
Explanation:".*"
means "zero or more of.
". The.
can match any single character, and*
allows any number of that. So".*"
can match any string, including"ab"
(here,".*"
can expand to two characters, matching"ab"
). -
Input: s =
"aab"
, p ="c*a*b"
– Output: true.
Explanation: The pattern can be read as"c*"
followed by"a*"
followed by"b"
."c*"
can match zero'c'
characters (since the string doesn’t havec
),"a*"
can match"aa"
(two'a'
characters), and then"b"
matches"b"
. All characters ins
are matched by the pattern in order, so it returns true. -
Input: s =
"mississippi"
, p ="mis*is*p*."
– Output: false.
Explanation: The pattern"mis*is*p*."
is trying to match"mississippi"
, but it fails. For instance,"mis*"
can match"mis"
(thes*
part can match"s"
), and"is*"
can match"iss"
or"is"
, etc. Ultimately, no matter how the*
wildcards are expanded, the pattern cannot fully cover"mississippi"
correctly, so the result is false.
Constraints:
-
1 <= s.length <= 20
-
1 <= p.length <= 20
-
s
contains only lowercase English letters. -
p
contains only lowercase English letters,'.'
, and'*'
. -
The pattern
p
is guaranteed to be well-formed, meaning every'*'
has a valid preceding character (no standalone*
at the start or double**
in pattern).
Hints
-
Think Recursively: Try to break the matching problem into smaller subproblems. Consider matching the first character of the string and pattern and then recursively solving the rest. How would
.
or*
affect this process? For example, if the pattern’s second character is*
, you have two options: use the*
(consume a character ofs
if it matches the preceding pattern char) or skip the*
(treat the preceding char as repeating zero times). This hint suggests a branching (backtracking) approach for handling*
. -
Simple Case First: If there were no
*
in the pattern, the problem would be simpler – you’d just check each character (with.
matching anything) and ensure the lengths match. Use this as a sub-solution. Once you handle direct character matches and.
, incorporate the*
logic. -
Dynamic Programming: Consider using DP to store results of subproblems (e.g., whether
s[i:]
matchesp[j:]
). There are overlapping subproblems in the recursive solution (the same substring and subpattern might be evaluated multiple times), so caching those results can help. Think of a DP tabledp[i][j]
meaning: Does the substrings[0..i-1]
match the subpatternp[0..j-1]
? If you can derive a recurrence fordp[i][j]
based on smaller indices, you’re on the right track. -
Base Cases: What happens when the string is empty or the pattern is empty? An empty pattern only matches an empty string. A pattern like
"x*"
can match an empty string becausex*
can repeatx
zero times. These cases are important to handle explicitly.
Multiple Approaches
There are several ways to solve this problem. We’ll discuss a brute-force backtracking approach and then an optimized dynamic programming approach. The brute-force solution is conceptually simpler but inefficient for larger inputs, while the DP solution is optimal for this problem.
Brute-Force Approach (Recursion & Backtracking)
Idea: Try all possibilities using recursion. We attempt to match the string and pattern from the start, character by character, handling .
and *
with special rules. Whenever we encounter a *
in the pattern, we branch out into two possibilities :
- Use the
*
: If the preceding character of the pattern matches the current character of the string (or is.
which matches anything), consume one character from the string and stay on the same pattern index (since*
can represent more of that character). This allows the*
to cover one more character of the string. - Skip the
*
: Ignore the"<char>*"
in the pattern (treat it as matching zero occurrences of that char). This means we move past the*
in the pattern (effectively deleting that part of the pattern), without consuming any character from the string.
We recursively try both options if applicable. If at any point characters don’t match (and the pattern char isn’t .
or the pattern is exhausted while string remains), that path is a dead end (returns false). If we reach the end of both string and pattern simultaneously, we have a match (return true).
Why it’s inefficient: In the worst case, the branching can explode exponentially. For example, a pattern like "a*a*a*..."
and a string of all 'a'
can cause many recursive calls by either skipping or using each *
in different ways. Without any optimization, the same subproblems get solved repeatedly. For instance, you might recompute whether "issippi"
matches "s*p*."
many times in a naive recursion. This redundant work leads to an exponential time complexity in the worst case. A pure backtracking solution may time out for larger strings. (It can pass within the given constraints since lengths are ≤ 20, but it’s not efficient or reliable for bigger inputs.) In summary, the brute-force approach explores a huge decision tree of matches and has exponential time complexity, so we need to optimize.
Optimal Dynamic Programming Approach (Bottom-Up)
Idea: Use dynamic programming to avoid recomputation of subproblems. We create a DP table where dp[i][j]
is a boolean indicating whether the first i
characters of s
(i.e. s[0..i-1]
) can be matched by the first j
characters of p
(p[0..j-1]
). The final answer will be dp[n][m]
where n = len(s)
and m = len(p)
. We fill this table iteratively using the rules of matching. Dynamic programming works here because the problem exhibits optimal substructure (a solution can be built from solutions of subproblems) and overlapping subproblems (the same substrings get checked multiple times in brute force).
DP Table Setup:
-
We use dimensions
(n+1) x (m+1)
because we include the case of the empty string and empty pattern (index 0 represents "no characters").dp[0][0]
(empty string vs empty pattern) istrue
– that's our base case. -
The first row
dp[0][j]
(empty string vs some pattern) needs initialization. An empty string can only match a pattern that is capable of representing an empty sequence. This would be patterns like"a*"
or"x*y*z*"
, etc., where each*
can eliminate the preceding character. Specifically, fordp[0][j]
(withj > 0
),dp[0][j]
is true if and only if the pattern up toj
ends in a*
that can represent zero occurrences. In practice, we can initialize: for any pattern likep[0..j-1]
that is something like"A*B*C*..."
(even length, every second char is*
),dp[0][j] = true
. Typically, we setdp[0][j] = dp[0][j-2]
ifp[j-1] == '*'
(meaning we drop thatchar*
pair). For example, ifp = "c*"
thendp[0][2]
should be true becausec*
can match an empty string by taking 0 occurrences of'c'
. -
The first column
dp[i][0]
(non-empty string vs empty pattern) will be allfalse
fori > 0
because a non-empty string cannot match an empty pattern.
Transition Rules: We fill the table row by row (or recursively compute states) using these rules for each position (i, j)
:
-
Direct Character Match or
.
: If the current pattern characterp[j-1]
is a letter or.
(not*
), then it matches the current string characters[i-1]
if and only if either they are the same letter or the pattern char is.
. In that case, the result fordp[i][j]
depends on the previous statedp[i-1][j-1]
(the prefixes without these characters). In formula:if p[j-1] == s[i-1] or p[j-1] == '.': dp[i][j] = dp[i-1][j-1]
This means if the current characters match, we have a match up to
i,j
if and only if the prefixes up toi-1,j-1
matched. -
*
Wildcard: If the current pattern character is*
(atp[j-1]
), then we have to consider it as representing two scenarios:-
Zero occurrences of the preceding character: We can "drop" the
<char>*
pair from the pattern. This means we look two pattern characters back and see if that matches the currenti
characters of string:dp[i][j] = dp[i][j-2]
(we ignore thex*
and see ifdp[i][j-2]
was true). For example, if pattern is"b*"
and we want to see if it matches"abc"
up to some point, theb*
could match nothing and we rely on whatever was before it. -
One or more occurrences of the preceding character: The
*
can match at least one occurrence of the char before it. This requires that the preceding pattern characterp[j-2]
matches the current string characters[i-1]
(or is.
), and if so, we can consume that character froms
and still have the patternj
stay at the same position (because*
can handle more of this char). In terms of DP, ifp[j-2]
matchess[i-1]
, then we look atdp[i-1][j]
.dp[i-1][j]
represents that the string up to i-1 can be matched by the pattern up to this*
. If that was true, and the current char matches, thendp[i][j]
can also be true by extending that match with this one character. In formula:
if p[j-1] == '*': dp[i][j] = dp[i][j-2] // treat x* as empty or ( (p[j-2] == s[i-1] or p[j-2] == '.') and dp[i-1][j] )
This encapsulates the two possibilities: ignore the
x*
or use it to match this character ofs
(and maybe more). -
By filling out the table with these rules, we eventually compute the answer. The dynamic programming ensures each subproblem (combination of i
and j
) is solved once, giving an overall polynomial time solution.
Step-by-Step Walkthrough of DP Solution
To illustrate the DP approach, let’s walk through a concrete example and build the DP table.
We’ll work through the example where:
- s = "aab"
- p = "cab"
Our goal is to fill a DP table where:
- dp[i][j] is true if the first i characters of s (i.e. s[0..i‑1]) match the first j characters of p (i.e. p[0..j‑1]).
- We create a table with dimensions (n+1) x (m+1), where n = len(s) = 3 and m = len(p) = 5.
- The extra row and column (index 0) represent the empty string/pattern.
Step 1. Table Layout
We label the rows and columns as follows:
-
Rows (i):
- i = 0: Empty string
""
- i = 1:
"a"
- i = 2:
"aa"
- i = 3:
"aab"
- i = 0: Empty string
-
Columns (j):
- j = 0: Empty pattern
""
- j = 1: Pattern
"c"
- j = 2: Pattern
"c*"
- j = 3: Pattern
"c*a"
- j = 4: Pattern
"c*a*"
- j = 5: Pattern
"c*a*b"
- j = 0: Empty pattern
Step 2. Base Case Initialization
-
dp[0][0] = True
(Empty string matches empty pattern.) -
First Row (i = 0): Matching Empty String with Pattern
We must decide whether an empty string can match a non-empty pattern. Only patterns that can “disappear” (via*
) may match an empty string.- dp[0][1]: Pattern =
"c"
"c"
cannot match empty. dp[0][1] = False.
- dp[0][2]: Pattern =
"c*"
- Here,
*
means zero or more'c'
. Taking zero occurrences,"c*"
becomes empty.
dp[0][2] = dp[0][0] = True.
- Here,
- dp[0][3]: Pattern =
"c*a"
- Although
"c*"
can vanish, the trailing"a"
must match something. dp[0][3] = False.
- Although
- dp[0][4]: Pattern =
"c*a*"
- Now both
"c*"
and"a*"
can vanish.
dp[0][4] = dp[0][2] = True.
- Now both
- dp[0][5]: Pattern =
"c*a*b"
- The ending
"b"
cannot vanish. dp[0][5] = False.
- The ending
- dp[0][1]: Pattern =
After initialization, the first row (i = 0) is:
dp[0] = [T, F, T, F, T, F]
Step 3. Fill the Table Row by Row
Now, we fill in the rest of the table using the recurrence rules.
Recall the rules:
-
For a direct match or a dot (
.
):
If p[j‑1] equals s[i‑1] or is.
, then:
dp[i][j] = dp[i‑1][j‑1]
-
For a star (
*
):
Let the star be at p[j‑1] and its preceding character be p[j‑2]. Then:dp[i][j] = dp[i][j‑2] // Treat "x*" as matching zero occurrences OR (if (p[j‑2] == s[i‑1] or p[j‑2] == '.') then dp[i‑1][j]) // Use "x*" to match one or more occurrence
Row i = 1 (s = "a")
s[0] = 'a'
-
dp[1][0]:
Non-empty string vs. empty pattern → False. -
dp[1][1]: Pattern
"c"
vs."a"
'c'
does not match'a'
→ False.
-
dp[1][2]: Pattern
"c*"
vs."a"
- Two choices:
- Skip
"c*"
: dp[1][0] = False. - Use
"c*"
: Check if'c'
matches'a'
(it does not).
- Skip
- So, dp[1][2] = False.
- Two choices:
-
dp[1][3]: Pattern
"c*a"
vs."a"
- Last char in pattern is
'a'
which matches'a'
. - dp[1][3] = dp[0][2] (what was the result for empty string vs.
"c*"
?) - From above, dp[0][2] = True.
- Thus, dp[1][3] = True.
- Last char in pattern is
-
dp[1][4]: Pattern
"c*a*"
vs."a"
- Here p[3] is
'*'
. Two cases:- Skip
"a*"
: dp[1][4] = dp[1][2] = False. - Use
"a*"
: Check if preceding char'a'
(p[2]) matches'a'
(s[0]). It does.
Then, dp[1][4] = dp[0][4].
From the first row, dp[0][4] = True.
- Skip
- So, dp[1][4] = True.
- Here p[3] is
-
dp[1][5]: Pattern
"c*a*b"
vs."a"
- Last char in pattern is
'b'
but s[0] is'a'
. - dp[1][5] = False.
- Last char in pattern is
Now row i = 1 becomes:
dp[1] = [F, F, F, T, T, F]
Row i = 2 (s = "aa")
s = "aa", s[1] = 'a'
-
dp[2][0]: → False.
-
dp[2][1]: Pattern
"c"
vs."aa"
'c'
does not match'a'
→ False.
-
dp[2][2]: Pattern
"c*"
vs."aa"
- Choices:
- Skip
"c*"
: dp[2][0] = False. - Use
"c*"
: Check if'c'
matches s[1] ='a'
→ No.
- Skip
- So, dp[2][2] = False.
- Choices:
-
dp[2][3]: Pattern
"c*a"
vs."aa"
- Last char
'a'
should match s[1] ='a'
. - Then, dp[2][3] = dp[1][2] (check the match for
"a"
vs. pattern"c*"
). - From row i = 1, dp[1][2] = False.
- So, dp[2][3] = False.
- Last char
-
dp[2][4]: Pattern
"c*a*"
vs."aa"
- p[3] is
'*'
→ two cases:- Skip
"a*"
: dp[2][4] = dp[2][2] = False. - Use
"a*"
: Check if preceding char'a'
(p[2]) matches s[1] ='a'
→ Yes.
Then, dp[2][4] = dp[1][4].
From row i = 1, dp[1][4] = True.
- Skip
- Thus, dp[2][4] = True.
- p[3] is
-
dp[2][5]: Pattern
"c*a*b"
vs."aa"
- Last char in pattern is
'b'
, which does not match s[1] ='a'
. - So, dp[2][5] = False.
- Last char in pattern is
Now row i = 2 becomes:
dp[2] = [F, F, F, F, T, F]
Row i = 3 (s = "aab")
s = "aab", s[2] = 'b'
-
dp[3][0]: → False.
-
dp[3][1]: Pattern
"c"
vs."aab"
'c'
does not match'b'
→ False.
-
dp[3][2]: Pattern
"c*"
vs."aab"
- Two cases:
- Skip
"c*"
: dp[3][2] = dp[3][0] = False. - Use
"c*"
: Check if'c'
matches s[2] ='b'
→ No.
- Skip
- So, dp[3][2] = False.
- Two cases:
-
dp[3][3]: Pattern
"c*a"
vs."aab"
- Last char in pattern is
'a'
, but s[2] is'b'
→ False.
- Last char in pattern is
-
dp[3][4]: Pattern
"c*a*"
vs."aab"
- p[3] is
'*'
.- Skip
"a*"
: dp[3][4] = dp[3][2] = False. - Use
"a*"
: Check if preceding char'a'
(p[2]) matches s[2] ='b'
→ No.
- Skip
- So, dp[3][4] = False.
- p[3] is
-
dp[3][5]: Pattern
"c*a*b"
vs."aab"
- Last char
'b'
in pattern should match s[2] ='b'
→ It does. - Then, dp[3][5] = dp[2][4] (we check if
"aa"
matches"c*a*"
). - From row i = 2, dp[2][4] = True.
- So, dp[3][5] = True.
- Last char
Now row i = 3 becomes:
dp[3] = [F, F, F, F, F, T]
Step 4. Final DP Table
Putting it all together, the complete DP table is:
j = 0<br>(Empty) | j = 1<br>"c" | j = 2<br>"c*" | j = 3<br>"c*a" | j = 4<br>"ca" | j = 5<br>"cab" | |
---|---|---|---|---|---|---|
i = 0<br>"" | T | F | T | F | T | F |
i = 1<br>"a" | F | F | F | T | T | F |
i = 2<br>"aa" | F | F | F | F | T | F |
i = 3<br>"aab" | F | F | F | F | F | T |
The final answer is given by dp[3][5], which is True. This means that the string "aab"
matches the pattern "c*a*b"
.
Summary
- We started by initializing dp[0][0] as true and filling out the first row using the rule for
*
. - Then, we filled each row using two rules: one for direct character (or dot) matches and one for handling
*
. - The final result dp[3][5] = True confirms that
"aab"
matches"c*a*b"
.
This step-by-step walkthrough shows how the DP table is built and used to solve the Regular Expression Matching problem.
Python Solution
Java Solution
Below is a Java implementation of the dynamic programming solution. We define a Solution
class with an isMatch
method that returns true/false for a given string s
and pattern p
. The main
method includes some test cases to demonstrate the usage and to verify the outputs against the examples discussed.
Explanation: The code uses a 2D boolean array dp
. We initialize dp[0][0] = true
(empty matches empty). Then we handle patterns that could match empty string (lines setting dp[0][j]
). After that, we iterate through each character of s
(i
) and p
(j
) and fill the table according to the rules:
-
If the pattern char is a letter or
.
, we setdp[i][j] = dp[i-1][j-1]
if they match (i.e., pattern char equals string char or is.
). -
If the pattern char is
*
, we consider it as zero occurrence (dp[i][j-2]
) or as one occurrence (if preceding pattern char matches current string char, then checkdp[i-1][j]
). We use anif
to avoid ArrayIndex issues and to combine the logic.
Finally, dp[n][m]
is returned, giving whether the full string matches the full pattern.
The main
method prints out results for some test inputs. For the given examples, it will output: false
, true
, true
, true
, false
(each on a new line), matching the expected results discussed earlier.
Complexity Analysis
-
Brute-Force Backtracking: The time complexity of the naive recursive solution is exponential in the worst case. Each
*
in the pattern can lead to branching (use it 0 times, 1 time, 2 times, ... until the string is exhausted or match fails). In the worst configuration, this can result in a huge number of possibilities. For example, with a pattern like"a*a*a*"
and string"aaaa"
, the number of ways to apply*
grows very fast. Although the input sizes here are small (≤ 20), exponential algorithms are not scalable. The space complexity of the recursion (without memo) is the recursion depth, which is O(n + m) for the call stack in the worst case (in each step you at least consume one character or one pattern element). If memoization is added to recursion, the complexity improves to O(n*m), which is essentially what the DP does. -
Dynamic Programming: The DP solution runs in O(n * m) time, where
n
is the length of the string andm
is the length of the pattern. This is because we fill an (n+1) by (m+1) table and each cell computation is O(1) work. Given the constraints (both up to 20), the DP at most does around 21*21 ≈ 441 iterations, which is very fast. The space complexity is also O(n * m) for the DP table. In this problem, that’s at most 21x21 booleans, which is negligible. We could optimize space by noticing that to computedp[i][*]
we only need the previous rowdp[i-1][*]
and the current row so far, but due to the dependency onj-2
(for the*
case) it’s a bit tricky to do a single-row optimization cleanly. Given the small size, a full 2D table is fine.
The DP approach is dramatically more efficient than brute force for larger strings/patterns. Even if the lengths were, say, 1000 each, DP would handle it (10^6 operations) whereas a naive recursion could never finish. In summary, DP brings the complexity down from exponential to polynomial (quadratic in this case).
Common Mistakes
-
Misinterpreting
*
: A very frequent mistake is to think'*'
works like "match any sequence of characters" (as it does in wildcard matching problems) – but here*
is not free-standing. It only applies to the character immediately to its left. For example, in the pattern"ab*"
, the*
applies to'b'
only, meaning it can match""
,"b"
,"bb"
, etc., after an'a'
. It does not mean "any substring aftera
". Confusing these can lead to wrong logic. -
Forgetting to account for zero occurrences with
*
: When implementing, one might handle the "one or more occurrence" case of*
but forget the "zero occurrence" case (or vice versa). Both possibilities must be checked. If you only handle one, the solution will fail on cases where the other is needed. For example, pattern"c*"
vs string"abc"
requires skipping thec
entirely (zero occurrence), whereas pattern"a*"
vs"aaa"
requires consuming multiple occurrences. -
Improper initialization of DP table: Not setting up
dp[0][j]
correctly for patterns like"x*y*"
can cause the algorithm to miss valid matches for empty string cases. Ensure that patterns consisting solely of pairs like"a*"
,"b*"
,"c*d*"
etc., are marked as matching the empty string. Another initialization pitfall is forgetting to setdp[0][0] = true
. -
Off-by-one errors in indices: Implementing the DP can be tricky with indices since
dp[i][j]
represents lengthsi
andj
. It’s easy to make mistakes like checkingp[j]
ors[i]
instead of the previous index. Many bugs arise from not aligning the DP indices with string indices properly. A robust approach is to leti
andj
loop from 1 to length, and uses[i-1]
andp[j-1]
when referring to the actual characters. -
Not considering entire string match: Some might write a recursion that checks if at least part of the string can match the pattern (or vice versa) and then mistakenly return true too early. Remember that the problem asks for a full match of the whole string to the whole pattern, not a substring match. This means your algorithm should only return true if both the string and pattern are fully consumed in the matching process.
Edge Cases
Consider the following edge cases while testing or reasoning about your solution:
-
Empty string or empty pattern:
s = ""
andp = ""
should return true (trivially, both are empty).s = ""
andp = "a*"
should return true because"a*"
can represent zero'a'
characters (empty string). In general, an empty string with a pattern that's effectively empty (like"b*c*"
which can drop both) is true.s = "abc"
,p = ""
should return false, since a non-empty string cannot match an empty pattern.
-
Pattern with consecutive
*
(or pattern starting with*
): By problem guarantee, patterns like"**"
or"*a"
won’t appear (the pattern is well-formed). If they did, it would be invalid input. But patterns like"a*b*"
(which have multiple*
in sequence but each applies to a different char) are valid and should be handled correctly by the DP (they can match empty or multiple characters as specified). -
Pattern
".*"
matching anything:".*"
should match absolutely any string (including empty string). It’s good to tests = ""
withp = ".*"
(should be true) and a few random strings with".*"
just to be sure the implementation doesn’t have a quirk that fails this universal match case. -
Single-character edge cases:
s = "a"
,p = ".*"
(should be true,.*
can match "a").s = "a"
,p = "."
(should be true, single wildcard matches single char).s = "a"
,p = "a*"
(should be true, becausea*
can be "a").s = "a"
,p = "b*"
(should be false,b*
can match "" or "b" or "bb"... but not "a").
-
Longer strings and patterns: with multiple
*
such ass = "abba"
vsp = ".*a*.*"
. This is just to ensure combinations of.
and*
together are handled. The DP should correctly identify which parts match.
Overall, ensure your solution handles the empty scenarios and doesn’t incorrectly allow a partial match.
Alternative Variations & Related Problems
This regex matching problem (with .
and *
) is a classic and is often compared to a similar problem:
-
Wildcard Matching (LeetCode #44): This is another problem where you have to match a string to a pattern, but the pattern uses
?
to match any single character and*
to match any sequence of characters (of any length). It’s a different interpretation of*
. The dynamic programming solution for wildcard matching is similar in spirit, but the transition rules differ because*
in that problem can span multiple different characters, not just repeats of one character. If you found regex matching tricky, you might want to practice wildcard matching as well once you’re comfortable, since it reinforces DP for pattern matching. -
Simpler Regex Variants: Sometimes interview or practice problems restrict the regex features further (for example, only
*
without.
or vice versa). Those can often be solved with simpler logic or a subset of this DP approach. For instance, matching with*
only (where*
means "repeat the previous char") can be handled similarly but without the dot-case. -
String Pattern Matching: Although not exactly a regex, problems like checking if a string matches a pattern of lowercase/uppercase (like ABBA pattern etc.) or other custom pattern matching can be solved with backtracking or DP. They are good practice for careful character-by-character logic.
-
Regular Expressions in real libraries: The problem we solved here is a simplified version of regex. Real regex engines support far more complex patterns (character classes, plus, question mark, braces, etc.). Understanding how this simplified engine works gives some insight into how you might implement or evaluate a full regex. (There’s a famous article “Regular Expression Matching Can Be Simple And Fast” that delves into implementing full regex matching with NFAs/DFA, but that’s beyond the scope of this problem.)
GET YOUR FREE
Coding Questions Catalog
