algorithms(bf-to-astar): add Dijkstra

Signed-off-by: Matej Focko <me@mfocko.xyz>
2025-04-15 17:33:40 +02:00 · 2024-01-03 12:50:29 +01:00 · 2024-01-03 12:50:29 +01:00 · 5164a31ce2
commit 5164a31ce2
parent 6a857d8ccd
4 changed files with 397 additions and 0 deletions
--- a/algorithms/11-paths/2024-01-01-bf-to-astar/02-dijkstra.md
+++ b/algorithms/11-paths/2024-01-01-bf-to-astar/02-dijkstra.md
@ -0,0 +1,318 @@
+---
+id: dijkstra
+slug: /paths/bf-to-astar/dijkstra
+title: Dijkstra's algorithm
+description: |
+  Moving from Bellman-Ford into the Dijsktra's algorithm.
+tags:
+- cpp
+- dynamic programming
+- greedy
+- dijkstra
+last_update:
+  date: 2024-01-03
+---
+
+## Intro
+
+Let's rewind back to the small argument in the previous post about the fact that
+we can safely bound the amount of iterations with relaxations being done.
+
+We have said that assuming the worst-case scenario (bad order of relaxations) we
+**need** at most $\vert V \vert - 1$ iterations over all edges. We've used that
+to our advantage to _bound_ the iterations instead of the `do-while` loop that
+was a risk given the possibility of the infinite loop (when negative loops are
+present in the graph).
+
+:::tip
+
+We could've possibly used both _boolean flag_ to denote that some relaxation has
+happened and the upper bound of iterations, for some graphs that would result in
+faster termination.
+
+Using only the upper bound we try to relax edges even though we can't.
+
+:::
+
+Now the question arises, could we leverage this somehow in a different way? What
+if we used it to improve the algorithm instead of just bounding the iterations?
+Would that be even possible?
+
+**Yes, it would!** And that's when _Dijkstra's algorithm_ comes in.
+
+## Dijkstra's algorithm
+
+I'll start with a well-known meme about Dijkstra's algorithm:
+![Dijkstra's algorithm meme](/img/algorithms/paths/bf-to-astar/dijkstra-meme.jpg)
+
+And then follow up on that with the actual backstory from Dijkstra himself:
+> What is the shortest way to travel from Rotterdam to Groningen, in general:
+> from given city to given city. It is the algorithm for the shortest path,
+> which I designed in about twenty minutes. One morning I was shopping in
+> Amsterdam with my young fiancée, and tired, we sat down on the café terrace to
+> drink a cup of coffee and I was just thinking about whether I could do this,
+> and I then designed the algorithm for the shortest path. As I said, it was
+> a twenty-minute invention. In fact, it was published in '59, three years
+> later. The publication is still readable, it is, in fact, quite nice. One of
+> the reasons that it is so nice was that I designed it without pencil and
+> paper. I learned later that one of the advantages of designing without pencil
+> and paper is that you are almost forced to avoid all avoidable complexities.
+> Eventually, that algorithm became to my great amazement, one of the
+> cornerstones of my fame.
+>
+> — Edsger Dijkstra, in an interview with Philip L. Frana, Communications of the
+> ACM, 2001
+
+:::caution Precondition
+
+As our own naïve algorithm, Dijkstra's algorithm has a precondition that places
+a requirement of _no edges with negative weights_ in the graph. This
+precondition is required because of the nature of the algorithm that requires
+monotonically non-decreasing changes in the costs of shortest paths.
+
+:::
+
+## Short description
+
+Let's have a brief look at the pseudocode taken from the Wikipedia:
+```
+function Dijkstra(Graph, source):
+    for each vertex v in Graph.Vertices:
+        dist[v] ← INFINITY
+        prev[v] ← UNDEFINED
+        add v to Q
+    dist[source] ← 0
+
+    while Q is not empty:
+        u ← vertex in Q with min dist[u]
+        remove u from Q
+
+        for each neighbor v of u still in Q:
+            alt ← dist[u] + Graph.Edges(u, v)
+            if alt < dist[v]:
+                dist[v] ← alt
+                prev[v] ← u
+
+    return dist[], prev[]
+```
+
+Dijkstra's algorithm works in such way that it always tries to find the shortest
+paths from a vertex to which it already has a shortest path. This may result in
+finding the shortest path to another vertex, or at least some path, till further
+relaxation.
+
+Given that we need to **always** choose the _cheapest_ vertex, we can use a min
+heap to our advantage which can further improve the time complexity of the
+algorithm.
+
+## Used techniques
+
+This algorithm leverages the _dynamic programming_ technique that has already
+been mentioned with regards to the _Bellman-Ford_ algorithm and also _greedy_
+technique. Let's talk about them both!
+
+_Dynamic programming_ technique comes from the fact that we are continuously
+building on top of the shortest paths that we have found so far. We slowly build
+the shortest paths from the given source vertex to all other vertices that are
+reachable.
+
+_Greedy_ technique is utilized in such way that Dijkstra's algorithm always
+improves the shortest paths from the vertex that is the closest, i.e. it tries
+extending the shortest path to some vertex by appending an edge, such extended
+path may (or may not) be the shortest path to another vertex.
+
+:::tip Greedy algorithms
+
+_Greedy algorithms_ are algorithms that choose the most optimal action
+**at the moment**.
+
+:::
+
+The reason why the algorithm requires no edges with negative weights comes from
+the fact that it's greedy. By laying the requirement of non-negative weights in
+the graph we are guaranteed that at any given moment of processing outgoing
+edges from a vertex, we already have a shortest path to the given vertex. This
+means that either this is the shortest path, or there is some other vertex that
+may have a higher cost, but the outgoing edge compensates for it.
+
+## Implementation
+
+Firstly we need to have some priority queue wrappers. C++ itself offers
+functions that can be used for maintaining max heaps. They also have generalized
+version with any ordering, in our case we need reverse ordering, because we need
+the min heap.
+```cpp
+using pqueue_item_t = std::pair<int, vertex_t>;
+using pqueue_t = std::vector<pqueue_item_t>;
+
+auto pushq(pqueue_t& q, pqueue_item_t v) -> void {
+  q.push_back(v);
+  std::push_heap(q.begin(), q.end(), std::greater<>{});
+}
+
+auto popq(pqueue_t& q) -> std::optional<pqueue_item_t> {
+  if (q.empty()) {
+    return {};
+  }
+
+  std::pop_heap(q.begin(), q.end(), std::greater<>{});
+  pqueue_item_t top = q.back();
+  q.pop_back();
+
+  return std::make_optional(top);
+}
+```
+
+And now we can finally move to the actual implementation of the Dijkstra's
+algorithm:
+```cpp
+auto dijkstra(const graph& g, const vertex_t& source)
+    -> std::vector<std::vector<int>> {
+  // make sure that ‹source› exists
+  assert(g.has(source));
+
+  // initialize the distances
+  std::vector<std::vector<int>> distances(
+      g.height(), std::vector(g.width(), graph::unreachable()));
+
+  // initialize the visited
+  std::vector<std::vector<bool>> visited(g.height(),
+                                         std::vector(g.width(), false));
+
+  // ‹source› destination denotes the beginning where the cost is 0
+  auto [sx, sy] = source;
+  distances[sy][sx] = 0;
+
+  pqueue_t priority_queue{std::make_pair(0, source)};
+  std::optional<pqueue_item_t> item{};
+  while ((item = popq(priority_queue))) {
+    auto [cost, u] = *item;
+    auto [x, y] = u;
+
+    // we have already found the shortest path
+    if (visited[y][x]) {
+      continue;
+    }
+    visited[y][x] = true;
+
+    for (const auto& [dx, dy] : DIRECTIONS) {
+      auto v = std::make_pair(x + dx, y + dy);
+      auto cost = g.cost(u, v);
+
+      // if we can move to the cell and it's better, relax¹ it and update queue
+      if (cost != graph::unreachable() &&
+          distances[y][x] + cost < distances[y + dy][x + dx]) {
+        distances[y + dy][x + dx] = distances[y][x] + cost;
+        pushq(priority_queue, std::make_pair(distances[y + dy][x + dx], v));
+      }
+    }
+  }
+
+  return distances;
+}
+```
+
+## Time complexity
+
+The time complexity of Dijkstra's algorithm differs based on the backing data
+structure.
+
+The original implementation doesn't leverage the heap which results in
+repetitive _look up_ of the “closest” vertex, hence we get the following
+worst-case time complexity in the _Bachmann-Landau_ notation:
+$$
+\Theta(\vert V \vert^2)
+$$
+
+If we turn our attention to the backing data structure, we always want the
+“cheapest” vertex, that's why we can use the min heap, given that we use
+Fibonacci heap we can achieve the following amortized time complexity:
+$$
+\mathcal{O}(\vert E \vert + \vert V \vert \cdot \log{\vert V \vert})
+$$
+
+:::tip Fibonacci heap
+
+Fibonacci heap is known as the heap that provides $\Theta(1)$ **amortized**
+insertion and $\mathcal{O}(\log{n})$ **amortized** removal of the top (either
+min or max).
+
+:::
+
+## Running the Dijkstra
+
+Let's run our code:
+```
+Normal cost: 1
+Vortex cost: 5
+Graph:
+#############
+#..#..*.*.**#
+##***.....**#
+#..########.#
+#...###...#.#
+#..#...##.#.#
+#..#.*.#..#.#
+#D...#....#.#
+########*.*.#
+#S..........#
+#############
+[Finite BF] Cost: 22
+[Bellman-Ford] Cost: 22
+[Dijkstra] Cost: 22
+```
+
+OK, so it seems to be working just fine. Now the question arises:
+
+> What happens when we have negative weights in our graph?
+
+## Busting the myth about looping Dijkstra
+
+One of the very common misconception about Dijkstra's algorithm is that it loops
+infinitely when you have negative weights or loops in the graph. Well, if we use
+our _propelling vortices_, not only we have the negative weights, but also the
+negative loops. Let's run our code! Our first naïve approach was actually
+looping:
+```
+Normal cost: 1
+Vortex cost: -1
+Graph:
+#############
+#..#..*.*.**#
+##***.....**#
+#..########.#
+#...###...#.#
+#..#...##.#.#
+#..#.*.#..#.#
+#D...#....#.#
+########*.*.#
+#S..........#
+#############
+[Finite BF] Cost: -240
+[Bellman-Ford] Found a negative loop
+[Bellman-Ford] Cost: -240
+[Dijkstra] Cost: 14
+```
+
+Well, it definitely doesn't loop. How much does `14` make sense is a different
+matter.
+
+:::info Variations
+
+There are multiple variations of the Dijkstra's algorithm. You **can** implement
+it in such way that with negative weights or loops it loops infinitely, but it
+can be countered. In our case we keep the track of the vertices that already got
+a shortest path established via the `visited`, that's how even multiple entries
+for one vertex in the heap are not an issue.
+
+:::
+
+## Summary
+
+Now we have an algorithm for finding the shortest path that is faster than our
+original naïve brute-force or Bellman-Ford. However we need to keep in mind its
+requirement of no negative weights for correct functioning.
+
+You can also see how we used our thought process of figuring out the worst-case
+time complexity for the naïve or Bellman-Ford algorithm to improve the original
+path-finding algorithms.
--- a/static/files/algorithms/paths/bf-to-astar/dijkstra.hpp
+++ b/static/files/algorithms/paths/bf-to-astar/dijkstra.hpp
@ -1,4 +1,80 @@
 #ifndef _DIJKSTRA_HPP
 #define _DIJKSTRA_HPP

+#include <algorithm>
+#include <cassert>
+#include <functional>
+#include <optional>
+#include <utility>
+#include <vector>
+
+#include "graph.hpp"
+
+namespace {
+using pqueue_item_t = std::pair<int, vertex_t>;
+using pqueue_t = std::vector<pqueue_item_t>;
+
+auto pushq(pqueue_t& q, pqueue_item_t v) -> void {
+  q.push_back(v);
+  std::push_heap(q.begin(), q.end(), std::greater<>{});
+}
+
+auto popq(pqueue_t& q) -> std::optional<pqueue_item_t> {
+  if (q.empty()) {
+    return {};
+  }
+
+  std::pop_heap(q.begin(), q.end(), std::greater<>{});
+  pqueue_item_t top = q.back();
+  q.pop_back();
+
+  return std::make_optional(top);
+}
+}  // namespace
+
+auto dijkstra(const graph& g, const vertex_t& source)
+    -> std::vector<std::vector<int>> {
+  // make sure that ‹source› exists
+  assert(g.has(source));
+
+  // initialize the distances
+  std::vector<std::vector<int>> distances(
+      g.height(), std::vector(g.width(), graph::unreachable()));
+
+  // initialize the visited
+  std::vector<std::vector<bool>> visited(g.height(),
+                                         std::vector(g.width(), false));
+
+  // ‹source› destination denotes the beginning where the cost is 0
+  auto [sx, sy] = source;
+  distances[sy][sx] = 0;
+
+  pqueue_t priority_queue{std::make_pair(0, source)};
+  std::optional<pqueue_item_t> item{};
+  while ((item = popq(priority_queue))) {
+    auto [cost, u] = *item;
+    auto [x, y] = u;
+
+    // we have already found the shortest path
+    if (visited[y][x]) {
+      continue;
+    }
+    visited[y][x] = true;
+
+    for (const auto& [dx, dy] : DIRECTIONS) {
+      auto v = std::make_pair(x + dx, y + dy);
+      auto cost = g.cost(u, v);
+
+      // if we can move to the cell and it's better, relax¹ it and update queue
+      if (cost != graph::unreachable() &&
+          distances[y][x] + cost < distances[y + dy][x + dx]) {
+        distances[y + dy][x + dx] = distances[y][x] + cost;
+        pushq(priority_queue, std::make_pair(distances[y + dy][x + dx], v));
+      }
+    }
+  }
+
+  return distances;
+}
+
 #endif /* _DIJKSTRA_HPP */
--- a/static/files/algorithms/paths/bf-to-astar/main.cpp
+++ b/static/files/algorithms/paths/bf-to-astar/main.cpp
@ -37,5 +37,8 @@ auto main() -> int {
  auto distances = bellman_ford(g, std::make_pair(1, 9));
  std::cout << "[Bellman-Ford] Cost: " << distances[7][1] << "\n";

+  distances = dijkstra(g, std::make_pair(1, 9));
+  std::cout << "[Dijkstra] Cost: " << distances[7][1] << "\n";
+
  return 0;
 }
--- a/static/img/algorithms/paths/bf-to-astar/dijkstra-meme.jpg
+++ b/static/img/algorithms/paths/bf-to-astar/dijkstra-meme.jpg