mirror of
https://github.com/mfocko/blog.git
synced 2024-11-21 12:33:57 +01:00
algorithms(bf-to-astar): add Dijkstra
Signed-off-by: Matej Focko <me@mfocko.xyz>
This commit is contained in:
parent
6a857d8ccd
commit
5164a31ce2
4 changed files with 397 additions and 0 deletions
318
algorithms/11-paths/2024-01-01-bf-to-astar/02-dijkstra.md
Normal file
318
algorithms/11-paths/2024-01-01-bf-to-astar/02-dijkstra.md
Normal file
|
@ -0,0 +1,318 @@
|
|||
---
|
||||
id: dijkstra
|
||||
slug: /paths/bf-to-astar/dijkstra
|
||||
title: Dijkstra's algorithm
|
||||
description: |
|
||||
Moving from Bellman-Ford into the Dijsktra's algorithm.
|
||||
tags:
|
||||
- cpp
|
||||
- dynamic programming
|
||||
- greedy
|
||||
- dijkstra
|
||||
last_update:
|
||||
date: 2024-01-03
|
||||
---
|
||||
|
||||
## Intro
|
||||
|
||||
Let's rewind back to the small argument in the previous post about the fact that
|
||||
we can safely bound the amount of iterations with relaxations being done.
|
||||
|
||||
We have said that assuming the worst-case scenario (bad order of relaxations) we
|
||||
**need** at most $\vert V \vert - 1$ iterations over all edges. We've used that
|
||||
to our advantage to _bound_ the iterations instead of the `do-while` loop that
|
||||
was a risk given the possibility of the infinite loop (when negative loops are
|
||||
present in the graph).
|
||||
|
||||
:::tip
|
||||
|
||||
We could've possibly used both _boolean flag_ to denote that some relaxation has
|
||||
happened and the upper bound of iterations, for some graphs that would result in
|
||||
faster termination.
|
||||
|
||||
Using only the upper bound we try to relax edges even though we can't.
|
||||
|
||||
:::
|
||||
|
||||
Now the question arises, could we leverage this somehow in a different way? What
|
||||
if we used it to improve the algorithm instead of just bounding the iterations?
|
||||
Would that be even possible?
|
||||
|
||||
**Yes, it would!** And that's when _Dijkstra's algorithm_ comes in.
|
||||
|
||||
## Dijkstra's algorithm
|
||||
|
||||
I'll start with a well-known meme about Dijkstra's algorithm:
|
||||
![Dijkstra's algorithm meme](/img/algorithms/paths/bf-to-astar/dijkstra-meme.jpg)
|
||||
|
||||
And then follow up on that with the actual backstory from Dijkstra himself:
|
||||
> What is the shortest way to travel from Rotterdam to Groningen, in general:
|
||||
> from given city to given city. It is the algorithm for the shortest path,
|
||||
> which I designed in about twenty minutes. One morning I was shopping in
|
||||
> Amsterdam with my young fiancée, and tired, we sat down on the café terrace to
|
||||
> drink a cup of coffee and I was just thinking about whether I could do this,
|
||||
> and I then designed the algorithm for the shortest path. As I said, it was
|
||||
> a twenty-minute invention. In fact, it was published in '59, three years
|
||||
> later. The publication is still readable, it is, in fact, quite nice. One of
|
||||
> the reasons that it is so nice was that I designed it without pencil and
|
||||
> paper. I learned later that one of the advantages of designing without pencil
|
||||
> and paper is that you are almost forced to avoid all avoidable complexities.
|
||||
> Eventually, that algorithm became to my great amazement, one of the
|
||||
> cornerstones of my fame.
|
||||
>
|
||||
> — Edsger Dijkstra, in an interview with Philip L. Frana, Communications of the
|
||||
> ACM, 2001
|
||||
|
||||
:::caution Precondition
|
||||
|
||||
As our own naïve algorithm, Dijkstra's algorithm has a precondition that places
|
||||
a requirement of _no edges with negative weights_ in the graph. This
|
||||
precondition is required because of the nature of the algorithm that requires
|
||||
monotonically non-decreasing changes in the costs of shortest paths.
|
||||
|
||||
:::
|
||||
|
||||
## Short description
|
||||
|
||||
Let's have a brief look at the pseudocode taken from the Wikipedia:
|
||||
```
|
||||
function Dijkstra(Graph, source):
|
||||
for each vertex v in Graph.Vertices:
|
||||
dist[v] ← INFINITY
|
||||
prev[v] ← UNDEFINED
|
||||
add v to Q
|
||||
dist[source] ← 0
|
||||
|
||||
while Q is not empty:
|
||||
u ← vertex in Q with min dist[u]
|
||||
remove u from Q
|
||||
|
||||
for each neighbor v of u still in Q:
|
||||
alt ← dist[u] + Graph.Edges(u, v)
|
||||
if alt < dist[v]:
|
||||
dist[v] ← alt
|
||||
prev[v] ← u
|
||||
|
||||
return dist[], prev[]
|
||||
```
|
||||
|
||||
Dijkstra's algorithm works in such way that it always tries to find the shortest
|
||||
paths from a vertex to which it already has a shortest path. This may result in
|
||||
finding the shortest path to another vertex, or at least some path, till further
|
||||
relaxation.
|
||||
|
||||
Given that we need to **always** choose the _cheapest_ vertex, we can use a min
|
||||
heap to our advantage which can further improve the time complexity of the
|
||||
algorithm.
|
||||
|
||||
## Used techniques
|
||||
|
||||
This algorithm leverages the _dynamic programming_ technique that has already
|
||||
been mentioned with regards to the _Bellman-Ford_ algorithm and also _greedy_
|
||||
technique. Let's talk about them both!
|
||||
|
||||
_Dynamic programming_ technique comes from the fact that we are continuously
|
||||
building on top of the shortest paths that we have found so far. We slowly build
|
||||
the shortest paths from the given source vertex to all other vertices that are
|
||||
reachable.
|
||||
|
||||
_Greedy_ technique is utilized in such way that Dijkstra's algorithm always
|
||||
improves the shortest paths from the vertex that is the closest, i.e. it tries
|
||||
extending the shortest path to some vertex by appending an edge, such extended
|
||||
path may (or may not) be the shortest path to another vertex.
|
||||
|
||||
:::tip Greedy algorithms
|
||||
|
||||
_Greedy algorithms_ are algorithms that choose the most optimal action
|
||||
**at the moment**.
|
||||
|
||||
:::
|
||||
|
||||
The reason why the algorithm requires no edges with negative weights comes from
|
||||
the fact that it's greedy. By laying the requirement of non-negative weights in
|
||||
the graph we are guaranteed that at any given moment of processing outgoing
|
||||
edges from a vertex, we already have a shortest path to the given vertex. This
|
||||
means that either this is the shortest path, or there is some other vertex that
|
||||
may have a higher cost, but the outgoing edge compensates for it.
|
||||
|
||||
## Implementation
|
||||
|
||||
Firstly we need to have some priority queue wrappers. C++ itself offers
|
||||
functions that can be used for maintaining max heaps. They also have generalized
|
||||
version with any ordering, in our case we need reverse ordering, because we need
|
||||
the min heap.
|
||||
```cpp
|
||||
using pqueue_item_t = std::pair<int, vertex_t>;
|
||||
using pqueue_t = std::vector<pqueue_item_t>;
|
||||
|
||||
auto pushq(pqueue_t& q, pqueue_item_t v) -> void {
|
||||
q.push_back(v);
|
||||
std::push_heap(q.begin(), q.end(), std::greater<>{});
|
||||
}
|
||||
|
||||
auto popq(pqueue_t& q) -> std::optional<pqueue_item_t> {
|
||||
if (q.empty()) {
|
||||
return {};
|
||||
}
|
||||
|
||||
std::pop_heap(q.begin(), q.end(), std::greater<>{});
|
||||
pqueue_item_t top = q.back();
|
||||
q.pop_back();
|
||||
|
||||
return std::make_optional(top);
|
||||
}
|
||||
```
|
||||
|
||||
And now we can finally move to the actual implementation of the Dijkstra's
|
||||
algorithm:
|
||||
```cpp
|
||||
auto dijkstra(const graph& g, const vertex_t& source)
|
||||
-> std::vector<std::vector<int>> {
|
||||
// make sure that ‹source› exists
|
||||
assert(g.has(source));
|
||||
|
||||
// initialize the distances
|
||||
std::vector<std::vector<int>> distances(
|
||||
g.height(), std::vector(g.width(), graph::unreachable()));
|
||||
|
||||
// initialize the visited
|
||||
std::vector<std::vector<bool>> visited(g.height(),
|
||||
std::vector(g.width(), false));
|
||||
|
||||
// ‹source› destination denotes the beginning where the cost is 0
|
||||
auto [sx, sy] = source;
|
||||
distances[sy][sx] = 0;
|
||||
|
||||
pqueue_t priority_queue{std::make_pair(0, source)};
|
||||
std::optional<pqueue_item_t> item{};
|
||||
while ((item = popq(priority_queue))) {
|
||||
auto [cost, u] = *item;
|
||||
auto [x, y] = u;
|
||||
|
||||
// we have already found the shortest path
|
||||
if (visited[y][x]) {
|
||||
continue;
|
||||
}
|
||||
visited[y][x] = true;
|
||||
|
||||
for (const auto& [dx, dy] : DIRECTIONS) {
|
||||
auto v = std::make_pair(x + dx, y + dy);
|
||||
auto cost = g.cost(u, v);
|
||||
|
||||
// if we can move to the cell and it's better, relax¹ it and update queue
|
||||
if (cost != graph::unreachable() &&
|
||||
distances[y][x] + cost < distances[y + dy][x + dx]) {
|
||||
distances[y + dy][x + dx] = distances[y][x] + cost;
|
||||
pushq(priority_queue, std::make_pair(distances[y + dy][x + dx], v));
|
||||
}
|
||||
}
|
||||
}
|
||||
|
||||
return distances;
|
||||
}
|
||||
```
|
||||
|
||||
## Time complexity
|
||||
|
||||
The time complexity of Dijkstra's algorithm differs based on the backing data
|
||||
structure.
|
||||
|
||||
The original implementation doesn't leverage the heap which results in
|
||||
repetitive _look up_ of the “closest” vertex, hence we get the following
|
||||
worst-case time complexity in the _Bachmann-Landau_ notation:
|
||||
$$
|
||||
\Theta(\vert V \vert^2)
|
||||
$$
|
||||
|
||||
If we turn our attention to the backing data structure, we always want the
|
||||
“cheapest” vertex, that's why we can use the min heap, given that we use
|
||||
Fibonacci heap we can achieve the following amortized time complexity:
|
||||
$$
|
||||
\mathcal{O}(\vert E \vert + \vert V \vert \cdot \log{\vert V \vert})
|
||||
$$
|
||||
|
||||
:::tip Fibonacci heap
|
||||
|
||||
Fibonacci heap is known as the heap that provides $\Theta(1)$ **amortized**
|
||||
insertion and $\mathcal{O}(\log{n})$ **amortized** removal of the top (either
|
||||
min or max).
|
||||
|
||||
:::
|
||||
|
||||
## Running the Dijkstra
|
||||
|
||||
Let's run our code:
|
||||
```
|
||||
Normal cost: 1
|
||||
Vortex cost: 5
|
||||
Graph:
|
||||
#############
|
||||
#..#..*.*.**#
|
||||
##***.....**#
|
||||
#..########.#
|
||||
#...###...#.#
|
||||
#..#...##.#.#
|
||||
#..#.*.#..#.#
|
||||
#D...#....#.#
|
||||
########*.*.#
|
||||
#S..........#
|
||||
#############
|
||||
[Finite BF] Cost: 22
|
||||
[Bellman-Ford] Cost: 22
|
||||
[Dijkstra] Cost: 22
|
||||
```
|
||||
|
||||
OK, so it seems to be working just fine. Now the question arises:
|
||||
|
||||
> What happens when we have negative weights in our graph?
|
||||
|
||||
## Busting the myth about looping Dijkstra
|
||||
|
||||
One of the very common misconception about Dijkstra's algorithm is that it loops
|
||||
infinitely when you have negative weights or loops in the graph. Well, if we use
|
||||
our _propelling vortices_, not only we have the negative weights, but also the
|
||||
negative loops. Let's run our code! Our first naïve approach was actually
|
||||
looping:
|
||||
```
|
||||
Normal cost: 1
|
||||
Vortex cost: -1
|
||||
Graph:
|
||||
#############
|
||||
#..#..*.*.**#
|
||||
##***.....**#
|
||||
#..########.#
|
||||
#...###...#.#
|
||||
#..#...##.#.#
|
||||
#..#.*.#..#.#
|
||||
#D...#....#.#
|
||||
########*.*.#
|
||||
#S..........#
|
||||
#############
|
||||
[Finite BF] Cost: -240
|
||||
[Bellman-Ford] Found a negative loop
|
||||
[Bellman-Ford] Cost: -240
|
||||
[Dijkstra] Cost: 14
|
||||
```
|
||||
|
||||
Well, it definitely doesn't loop. How much does `14` make sense is a different
|
||||
matter.
|
||||
|
||||
:::info Variations
|
||||
|
||||
There are multiple variations of the Dijkstra's algorithm. You **can** implement
|
||||
it in such way that with negative weights or loops it loops infinitely, but it
|
||||
can be countered. In our case we keep the track of the vertices that already got
|
||||
a shortest path established via the `visited`, that's how even multiple entries
|
||||
for one vertex in the heap are not an issue.
|
||||
|
||||
:::
|
||||
|
||||
## Summary
|
||||
|
||||
Now we have an algorithm for finding the shortest path that is faster than our
|
||||
original naïve brute-force or Bellman-Ford. However we need to keep in mind its
|
||||
requirement of no negative weights for correct functioning.
|
||||
|
||||
You can also see how we used our thought process of figuring out the worst-case
|
||||
time complexity for the naïve or Bellman-Ford algorithm to improve the original
|
||||
path-finding algorithms.
|
|
@ -1,4 +1,80 @@
|
|||
#ifndef _DIJKSTRA_HPP
|
||||
#define _DIJKSTRA_HPP
|
||||
|
||||
#include <algorithm>
|
||||
#include <cassert>
|
||||
#include <functional>
|
||||
#include <optional>
|
||||
#include <utility>
|
||||
#include <vector>
|
||||
|
||||
#include "graph.hpp"
|
||||
|
||||
namespace {
|
||||
using pqueue_item_t = std::pair<int, vertex_t>;
|
||||
using pqueue_t = std::vector<pqueue_item_t>;
|
||||
|
||||
auto pushq(pqueue_t& q, pqueue_item_t v) -> void {
|
||||
q.push_back(v);
|
||||
std::push_heap(q.begin(), q.end(), std::greater<>{});
|
||||
}
|
||||
|
||||
auto popq(pqueue_t& q) -> std::optional<pqueue_item_t> {
|
||||
if (q.empty()) {
|
||||
return {};
|
||||
}
|
||||
|
||||
std::pop_heap(q.begin(), q.end(), std::greater<>{});
|
||||
pqueue_item_t top = q.back();
|
||||
q.pop_back();
|
||||
|
||||
return std::make_optional(top);
|
||||
}
|
||||
} // namespace
|
||||
|
||||
auto dijkstra(const graph& g, const vertex_t& source)
|
||||
-> std::vector<std::vector<int>> {
|
||||
// make sure that ‹source› exists
|
||||
assert(g.has(source));
|
||||
|
||||
// initialize the distances
|
||||
std::vector<std::vector<int>> distances(
|
||||
g.height(), std::vector(g.width(), graph::unreachable()));
|
||||
|
||||
// initialize the visited
|
||||
std::vector<std::vector<bool>> visited(g.height(),
|
||||
std::vector(g.width(), false));
|
||||
|
||||
// ‹source› destination denotes the beginning where the cost is 0
|
||||
auto [sx, sy] = source;
|
||||
distances[sy][sx] = 0;
|
||||
|
||||
pqueue_t priority_queue{std::make_pair(0, source)};
|
||||
std::optional<pqueue_item_t> item{};
|
||||
while ((item = popq(priority_queue))) {
|
||||
auto [cost, u] = *item;
|
||||
auto [x, y] = u;
|
||||
|
||||
// we have already found the shortest path
|
||||
if (visited[y][x]) {
|
||||
continue;
|
||||
}
|
||||
visited[y][x] = true;
|
||||
|
||||
for (const auto& [dx, dy] : DIRECTIONS) {
|
||||
auto v = std::make_pair(x + dx, y + dy);
|
||||
auto cost = g.cost(u, v);
|
||||
|
||||
// if we can move to the cell and it's better, relax¹ it and update queue
|
||||
if (cost != graph::unreachable() &&
|
||||
distances[y][x] + cost < distances[y + dy][x + dx]) {
|
||||
distances[y + dy][x + dx] = distances[y][x] + cost;
|
||||
pushq(priority_queue, std::make_pair(distances[y + dy][x + dx], v));
|
||||
}
|
||||
}
|
||||
}
|
||||
|
||||
return distances;
|
||||
}
|
||||
|
||||
#endif /* _DIJKSTRA_HPP */
|
||||
|
|
|
@ -37,5 +37,8 @@ auto main() -> int {
|
|||
auto distances = bellman_ford(g, std::make_pair(1, 9));
|
||||
std::cout << "[Bellman-Ford] Cost: " << distances[7][1] << "\n";
|
||||
|
||||
distances = dijkstra(g, std::make_pair(1, 9));
|
||||
std::cout << "[Dijkstra] Cost: " << distances[7][1] << "\n";
|
||||
|
||||
return 0;
|
||||
}
|
||||
|
|
BIN
static/img/algorithms/paths/bf-to-astar/dijkstra-meme.jpg
Normal file
BIN
static/img/algorithms/paths/bf-to-astar/dijkstra-meme.jpg
Normal file
Binary file not shown.
After Width: | Height: | Size: 84 KiB |
Loading…
Reference in a new issue