721. Accounts Merge - Detailed Explanation

Free Coding Questions Catalog
Boost your coding skills with our essential coding questions catalog. Take a step towards a better tech career now!

Problem Statement

You are given a list of accounts where each account is represented as a list of strings. The first string is a name, and the remaining strings are email addresses. Two accounts definitely belong to the same person if there is any common email between them. Even if two accounts have the same name, they may not belong to the same person unless they share an email.

Your Task:
Merge the accounts so that each account contains all the unique emails of a single person, then return the merged account in the following format:

  • The first element is the name.

  • The rest of the elements are the unique email addresses in sorted lexicographical order.

Example:

  • Input:
    [
      ["John", "johnsmith@mail.com", "john_newyork@mail.com"],
      ["John", "johnsmith@mail.com", "john00@mail.com"],
      ["Mary", "mary@mail.com"],
      ["John", "johnnybravo@mail.com"]
    ]
    
  • Output:
    [
      ["John", "john00@mail.com", "john_newyork@mail.com", "johnsmith@mail.com"],
      ["Mary", "mary@mail.com"],
      ["John", "johnnybravo@mail.com"]
    ]
    

Explanation:
The first two accounts belong to the same person (John) because they share the email "johnsmith@mail.com". They are merged into one account with all unique emails sorted in lexicographic order. The other two accounts do not share any common emails with any other accounts, so they remain separate.

Hints

  • Graph/Union-Find Idea:
    Think of each email as a node in a graph. Two emails are connected if they appear in the same account. The problem then reduces to finding all connected components (subgraphs) in the graph. A common way to solve this is to use Union-Find (also known as Disjoint Set Union, DSU).

  • Mapping Emails to Names:
    While grouping emails, it is essential to remember which name they belong to. For every email you process, store a mapping from the email to the corresponding account name.

  • Merging Process:
    Once you have grouped all emails using union-find (or DFS), sort each group of emails, prepend the name, and that forms one merged account.

Approach Explanation (Using Union-Find)

Step 1: Initialize Data Structures

  • Union-Find Structure:
    Create a union-find data structure to efficiently join connected emails.

  • Email-to-Name Mapping:
    For every email encountered, map it to the name from the account.

Step 2: Process Each Account

  • For each account, iterate through its emails:
    • For every pair of emails in the same account, use the union operation to join them.
    • This ensures that all emails in the same account are connected in the union-find structure.

Step 3: Group Emails by Root Parent

  • After processing all accounts, iterate through every email.
  • Use the union-find "find" operation to determine the root parent of each email.
  • Group emails by their root parent in a dictionary.

Step 4: Construct the Final Output

  • For each group (each unique root), sort the list of emails lexicographically.
  • Retrieve the name (using the mapping) and form the merged account as a list (with the name as the first element).

Python Code Implementation

Python3
Python3

. . . .

Java Code

Java
Java

. . . .

Complexity Analysis

  • Time Complexity:

    • Let ( N ) be the total number of emails across all accounts.

    • The union-find operations (find and union) take nearly O(α(N)) time (where α is the inverse Ackermann function, which grows very slowly and is nearly constant in practice).

    • Grouping and sorting each group of emails will take additional time. If ( k ) is the average number of emails in a group, sorting costs O(( k \log k )).

    • Overall, the algorithm runs in roughly O(N log N) time in the worst case.

  • Space Complexity:

    • The union-find data structure and the email-to-name mapping both require O(N) space.
    • The groups dictionary also uses O(N) space.

Common Mistakes and Edge Cases

  • Not Using Path Compression:
    Failing to compress paths in the union-find structure can lead to slower operations, especially for deep trees.

  • Incorrect Email-to-Name Mapping:
    Ensure that each email is correctly mapped to its owner. The name is taken from the first occurrence, and all emails in the same group must have the same name.

  • Handling Duplicate Emails:
    An account may list the same email more than once. Make sure union operations and grouping do not get affected by duplicates.

  • Edge Case – No Accounts:
    If the input is empty, return an empty list.

  • Accounts Merge is related to problems that involve connected components and graph traversal, such as:
    • Graph Connected Components using DFS/BFS

    • Number of Islands (finding connected regions in a grid)

    • Union-Find based clustering problems

TAGS
leetcode
CONTRIBUTOR
Design Gurus Team
-

GET YOUR FREE

Coding Questions Catalog

Design Gurus Newsletter - Latest from our Blog
Boost your coding skills with our essential coding questions catalog.
Take a step towards a better tech career now!
Explore Answers
How many rounds of interview does Uber have?
Which country is best for software engineering?
Is MongoDB still in demand?
Related Courses
Image
Grokking the Coding Interview: Patterns for Coding Questions
Grokking the Coding Interview Patterns in Java, Python, JS, C++, C#, and Go. The most comprehensive course with 476 Lessons.
Image
Grokking Modern AI Fundamentals
Master the fundamentals of AI today to lead the tech revolution of tomorrow.
Image
Grokking Data Structures & Algorithms for Coding Interviews
Unlock Coding Interview Success: Dive Deep into Data Structures and Algorithms.
Image
One-Stop Portal For Tech Interviews.
Copyright © 2025 Design Gurus, LLC. All rights reserved.
;