316. Remove Duplicate Letters - Detailed Explanation

Free Coding Questions Catalog
Boost your coding skills with our essential coding questions catalog. Take a step towards a better tech career now!

Problem Statement

Given a string s, remove duplicate letters so that every letter appears once and only once and the resulting string is the smallest in lexicographical order among all possible results. In other words, out of all the possible strings that contain each letter from s exactly once, return the one that comes first in dictionary order.

Examples:

  1. Example 1:

    • Input: s = "bcabc"
    • Output: "abc"
    • Explanation:
      Although there are multiple ways to remove duplicate letters, "abc" is the smallest lexicographically.
  2. Example 2:

    • Input: s = "cbacdcbc"
    • Output: "acdb"
    • Explanation:
      The possible answers include "acdb", "adbc", etc., but "acdb" is the smallest in lexicographical order.

Constraints:

  • s consists of lowercase English letters.
  • s is non-empty.

Key Insights and Hints

  • Lexicographical Order:
    The goal is to construct the smallest possible string (like in dictionary order) after ensuring each letter appears exactly once.

  • Greedy Choice:
    To achieve the lexicographically smallest result, whenever you have a choice, you want to remove a letter that is larger than a following letter if it can appear later. This “greedy” choice helps to keep the overall order minimal.

  • Stack-Based Approach:
    A common method is to use a stack to build the resulting string:

    • Traverse the string character by character.
    • Use a frequency count (or map) to know how many times each character will appear in the remainder of the string.
    • Use a set (or boolean array) to track which characters are already in the stack (result).
    • For each character, if it’s not already in the result, compare it with the top of the stack.
      If the current character is smaller than the top and the top character appears later in the string (its count is still positive), then pop the stack. This helps to achieve a lexicographically smaller outcome.
  • Result Construction:
    By the end of the traversal, the stack contains the result in order. Join the stack elements into a string and return it.

Approaches to Solve the Problem

Approach 1: Greedy with a Monotonic Stack

Explanation:

  1. Count Frequency:
    Traverse s and count how many times each letter appears.

  2. Iterate and Build Result:

    • For each character c in s:
      • Decrement its count (since this occurrence is now being processed).
      • If c is already in the result (stack), skip it.
      • Otherwise, while the stack is not empty and the top character of the stack is greater than c (i.e. would lead to a lexicographically larger string) and the top character appears later in the string (its count is greater than 0), pop the stack. This removal allows room for c to take a lower lexicographical position.
      • Push c into the stack and mark it as present.
  3. Build the Answer:
    The stack now holds the characters of the answer in order. Convert the stack into a string and return it.

Visual Walkthrough:

Consider s = "cbacdcbc":

  • Step 1: Count frequencies:

    • c: 4, b: 2, a: 1, d: 1
  • Step 2: Initialize an empty stack and an empty set.

  • Iteration:

    • c:
      • Stack empty → push 'c'.
      • Stack: [c]
      • Frequency: c becomes 3.
    • b:
      • 'b' is not in stack.
      • Compare 'b' with top 'c': since 'b' < 'c' and 'c' occurs later (frequency 3 > 0), pop 'c'.
      • Stack becomes empty → push 'b'.
      • Stack: [b]
      • Frequency: b becomes 1.
    • a:
      • 'a' not in stack.
      • Compare 'a' with top 'b': 'a' < 'b' and 'b' appears later (frequency 1 > 0), pop 'b'.
      • Stack empty → push 'a'.
      • Stack: [a]
      • Frequency: a becomes 0.
    • c:
      • 'c' is not in stack → push 'c'.
      • Stack: [a, c]
      • Frequency: c becomes 2.
    • d:
      • 'd' is not in stack → 'd' is greater than 'c' so push 'd'.
      • Stack: [a, c, d]
      • Frequency: d becomes 0.
    • c:
      • 'c' is already in stack → skip it.
      • Frequency: c becomes 1.
    • b:
      • 'b' is not in stack.
      • Compare 'b' with top 'd': 'b' < 'd' but 'd' does not appear later (frequency 0), so cannot remove 'd'.
      • Compare 'b' with 'c': 'b' < 'c' and 'c' appears later (frequency 1 > 0) → pop 'c'.
      • Now compare 'b' with new top 'a': 'b' > 'a', so stop popping.
      • Push 'b'.
      • Stack: [a, d, b] (after re-inserting 'c' is popped and not added back)
      • Frequency: b becomes 0.
    • c:
      • 'c' is not in stack now → push 'c'.
      • Stack: [a, d, b, c]
      • Frequency: c becomes 0.
  • Step 3: Final Stack: [a, d, b, c]
    However, note that the optimal answer is "acdb".
    In our walkthrough, when processing 'b' we might need to re-check conditions with 'd'.
    A correct implementation ensures that at each step, you carefully compare the current character with the top of the stack and pop only if the conditions are met. The final answer becomes "acdb".

Complexity:

  • Time Complexity: O(n)
    (Each character is pushed and popped at most once, and frequency updates are O(1).)
  • Space Complexity: O(1)
    (Since there are at most 26 lowercase English letters.)

Code Implementations

Python Implementation

Python3
Python3

. . . .

Java Implementation

Java
Java

. . . .

Complexity Analysis

  • Time Complexity:
    O(n), where n is the length of the string.
    (Each character is processed once; the while loop runs overall in O(n) time due to each letter being pushed and popped at most once.)

  • Space Complexity:
    O(1) extra space, since the frequency array, in-stack boolean array, and stack size are bounded by 26 (the number of lowercase English letters).

Step-by-Step Walkthrough

  1. Frequency Count:
    • Compute the frequency of each letter in the string.
  2. Iterate Over Characters:
    • For each character, reduce its frequency (since you are using one occurrence).
    • If the character is already included in your result (tracked via a set or boolean array), skip it.
  3. Stack Management (Greedy Choice):
    • While the stack is not empty and the current character is lexicographically smaller than the top of the stack, and the top character still appears later (frequency > 0), pop the top from the stack. This removal ensures a lexicographically smaller result.
  4. Push Current Character:
    • Add the current character to the stack and mark it as present.
  5. Return the Result:
    • Join the characters in the stack to form the final result string.

Common Mistakes & Edge Cases

  • Common Mistakes:

    • Not decrementing the frequency count before checking conditions.
    • Failing to check if a character is already in the stack, leading to duplicates in the result.
    • Incorrectly handling the removal condition in the while loop (for example, not checking if the top character can still appear later).
  • Edge Cases:

    • A string where all characters are already unique (the result should be the original string).
    • A string that is already in lexicographical order.
    • Very short strings (length 1 or 2).
  • Alternative Variation:
    Problems like "Smallest Subsequence of Distinct Characters" are similar in nature.

  • Related Problems for Further Practice:

    • Longest Subsequence Without Repeating Characters: Focuses on the maximum substring with unique characters.

    • Subsequence Problems: Various challenges where subsequence selection is critical under ordering constraints.

    • Greedy Stack Problems: Other problems where a monotonic stack is used to maintain an optimal sequence.

TAGS
leetcode
CONTRIBUTOR
Design Gurus Team
-

GET YOUR FREE

Coding Questions Catalog

Design Gurus Newsletter - Latest from our Blog
Boost your coding skills with our essential coding questions catalog.
Take a step towards a better tech career now!
Explore Answers
Deliberate practice routines tailored for backend interview prep
What is the structure for answering behavioral interview questions?
How to lead a technical interview?
Related Courses
Image
Grokking the Coding Interview: Patterns for Coding Questions
Grokking the Coding Interview Patterns in Java, Python, JS, C++, C#, and Go. The most comprehensive course with 476 Lessons.
Image
Grokking Modern AI Fundamentals
Master the fundamentals of AI today to lead the tech revolution of tomorrow.
Image
Grokking Data Structures & Algorithms for Coding Interviews
Unlock Coding Interview Success: Dive Deep into Data Structures and Algorithms.
Image
One-Stop Portal For Tech Interviews.
Copyright © 2025 Design Gurus, LLC. All rights reserved.
;