Topological Sorting Explained: Sorting Dependency Chains

type

status

date

summary

Kahn’s Algorithm (BFS-Based)

Kahn’s Algorithm uses a queue to process nodes with zero in-degree (no prerequisites). It works on graphs structured as { prerequisite: [dependents] }. It works by repeatedly remove nodes with in-degree = 0 (no prerequisites), and decrement their children’s in-degrees.

Figure credit: https://leetcode.com/problems/course-schedule-ii/editorial/

Steps

Initialization

Compute in-degree for every node.

Push all nodes with in-degree == 0 into a queue.

Process queue

Pop a node → append to result.

For each child, decrement its in-degree; if it hits zero, enqueue it.

Output:

Return topo_order if it includes all nodes.

Otherwise, detect a cycle.

Time Complexity:

Initialization: Calculating num_parents (in-degrees) takes O(V+E), where V is the number of nodes, and E is the number of edges.

Processing Nodes: Each node is processed once, and each edge is visited once, contributing O(V+E).

Overall Time Complexity: O(V+E).

Space Complexity:

num_parents and ready_queue: Require O(V) space.

Graph Representation: Requires O(V+E) space for adjacency list.

Overall Space Complexity: O(V+E).

DFS-Based Topological Sort

The DFS-based topological sort works by performing a post-order traversal. It works with either graphs structured with forward edges as graph ={dependent: prerequisites} or graph = { prerequisite: [dependents] }. It ensures a node is added to the resulting list only after all its dependent nodes (when traversing forward) or prerequisite nodes (when traversing backward) have been fully visited and processed. By collecting nodes upon the return of all recursive calls, this method guarantees that every node appears after all its dependencies have been listed, yielding the topological order.

`prereq -> dependents` (forward edges, like in Kahn's algorithm).

1. When dfs(A) is called, it immediately calls dfs(B), dfs(C), etc., where A is the prerequisite, B and C are its dependents.

2. The node A is appended to the topo_order list last, after all its dependent nodes (B, C, etc.) have finished processing.

3. This means the node that has the most prerequisites satisfied earliest is at the end of the list. That is, the list is built in reverse topological order, and we must call topo_order.reverse() at the end.

Time Complexity:

DFS Traversal: O(V+E), where V is the number of vertices and E is the number of edges.

Each edge is processed once, and each node is visited once.

Space Complexity:

Graph storage: O(V+E) for the adjacency list.

Recursion stack: O(V) in the worst case.

visited and visiting dictionaries: O(V).

`dependent -> prerequisites`

A node can only be added to the final ordering after all of its prerequisites nodes have been visited and processed (i.e., added to the list). By inserting a node into the result list only after the recursive calls to all of its prerequisites/neighbors return, we ensure all prerequisites come before the node itself.

Steps:

Traversal: Use a recursive Depth-First Search (DFS) to explore the graph.

Cycle Detection: Employ two sets (visiting and visited) to detect cycles during traversal:

visiting: Nodes currently in the recursion stack (a back-edge to a visiting node means a cycle).

visited: Nodes completely processed (all children visited).

Ordering: A node is appended to the topo_order list only after the DFS returns from all of its prerequisites/neighbors.

Comparison

Both methods are equally efficient in terms of theoretical complexity, offering a linear-time solution proportional to the size of the graph.

ㅤ	Kahn’s Algorithm (BFS)	DFS-Based Approach
Technique	Iterative: In-degree tracking + queue	Recursive: Postorder DFS + reverse (with `visiting` cycle check)
Graph orientation	`prereq → dependent`	Either `prereq → dependent` or `dependent → prereq`
Cycle detection	If not all nodes removed	Hitting a gray (`visiting`) node
Time / Space	`O(V+E)` / `O(V+E)`	`O(V+E)` / `O(V+E)`

Example: Course Schedule II

The problem gives an array prerequisites where prerequisites[i] = [ai, bi] indicates that you must take course bi first if you want to take course ai. And it asks for the ordering of courses you should take to finish all courses.

DFS Solution

Each course is a node; prerequisites form edges and build the graph prereq -> dependents.

Detect cycles and build the order using post-order DFS.

Kahn’s Algorithm (BFS)

Track in-degrees and process the courses with no any prerequisition first.

Endnote

Topological sorting elegantly transforms complex dependency graphs into linear execution sequences. Whether using Kahn’s queue-driven approach or DFS’s recursive depth, both methods reveal the underlying structure of directed acyclic graphs and empower solutions in scheduling, compilation, and course planning.

Topological Sorting Explained: Sorting Dependency Chains

Kahn’s Algorithm (BFS-Based)

DFS-Based Topological Sort

`prereq -> dependents` (forward edges, like in Kahn's algorithm).

`dependent -> prerequisites`

Comparison

Example: Course Schedule II

DFS Solution

Kahn’s Algorithm (BFS)

Endnote

Relate Posts

Depth-First Search: Exploring Deep Before Wide

Breadth-First Search: Level-Order Exploration

Topological Sorting Explained: Sorting Dependency Chains

Kahn’s Algorithm (BFS-Based)

DFS-Based Topological Sort

prereq -> dependents (forward edges, like in Kahn's algorithm).

dependent -> prerequisites

Comparison

Example: Course Schedule II

DFS Solution

Kahn’s Algorithm (BFS)

Endnote

Relate Posts

Depth-First Search: Exploring Deep Before Wide

Breadth-First Search: Level-Order Exploration

`prereq -> dependents` (forward edges, like in Kahn's algorithm).

`dependent -> prerequisites`