Alex Dowad Computes

Stuff I Have Learned: Don’t use a coverage-guided fuzzer on an uninstrumented binary

2024-07-12T00:00:00+00:00

(Subtitle: Unless You Really Have To)

Coverage-guided fuzzing tools, such as LLVM’s libFuzzer, run a target program on many random inputs, record the path of control flow each time the target program executes (for example, which branch of each if statement is taken), and mutate the input in an effort to find as many unique control-flow paths as possible. It turns out that this heuristic is incredibly effective at guiding the random search to find interesting test cases.

But it only works if the fuzzer can actually trace the path of control flow through the target program! As I was so forcefully reminded today…

Before I go further, let me explain how coverage-guided fuzzers are able to record the path of program execution. Generally, these tools require the target program to be compiled with special options, which tell the compiler to insert some instrumentation code before every instance of certain machine instructions. For example, instrumentation code might be added before every conditional branch instruction.

For clang, the special option needed is -fsanitize=fuzzer. When you compile a C program with that option, the resulting binary will contain code like:

% objdump --disassemble testprogram

...output elided...
1b6995:       e8 36 79 e9 ff          call   4e2d0 <__sanitizer_cov_trace_const_cmp4>
1b699a:       8b 85 24 f7 ff ff       mov    -0x8dc(%rbp),%eax
1b69a0:       83 f8 00                cmp    $0x0,%eax
...more output elided...

Do you see the call to __sanitizer_cov_trace_const_cmp4? clang -fsanitize=fuzzer inserts the definitions of a couple dozen such functions into your binary, and adds function calls before every instance of an instruction which libFuzzer is interested in. The functions record what libFuzzer needs to know, in a place where libFuzzer can find it.

A while ago, I contributed some new functions to a certain open-source library, and also contributed fuzzers to test them. However, while the test driver programs were compiled with clang -fsanitize=address,fuzzer,undefined, the dynamically-linked library (.so file) with the definitions of the target functions was compiled by GCC, without any instrumentation!

Today, more than a year after the fact, I happened to look at my code and realized what was happening. After I adjusted the Makefile to build the dynamic library with clang -fsanitize=fuzzer-no-link (which is the right option for libraries, as opposed to executables), re-built the library and test drivers, and ran one of them for 10 seconds… it found a bug.

After I fixed that bug and ran the same fuzzer for another 10 seconds… it found another bug.

This cycle repeated eleven times. In each case, after fixing one bug, the fuzzer would find the next one, if not within seconds, then at least within a minute. After fixing all eleven bugs, I ran the fuzzer for several hours without finding any more.

Now, get this: the code in which the coverage-guided fuzzer found eleven bugs had passed a test suite with more than 19,000 unit tests! Further, after writing that library code, I had also fuzzed it for about 20 minutes (not knowing that the dynamic library was uninstrumented).

🤦🏻‍♂️

I sure hope I never pull one like that again!

⸻But why did the fuzzer originally seem to work?

Coverage-guided fuzzers, such as those based on libFuzzer, will not crash or print a warning or anything like that if part of the binary code under test is not instrumented. They just won’t be able to tell which way the path of execution is going in the uninstrumented part. Effectively, your “coverage-guided” fuzzer will degenerate into an unguided fuzzer which just throws random inputs at the code under test. This can make the fuzzer orders of magnitude less likely to find obscure bugs.

That’s why this article is subtitled “Unless You Really Have To”: if you have no way of instrumenting a binary (maybe because you don’t have the source code), but need to fuzz it, there’s nothing to say that you can’t use a coverage-guided fuzzer on it; you will just lose the benefit of coverage guidance.

⸻What kind of project has a test suite with 19,000 test cases??

Well, I was actually implementing standard algorithms for processing Unicode text. The Unicode Consortium publishes lists of test cases for unit testing such implementations. Have a look at the data files published by the Unicode Consortium if you’re curious.

Visualizing the Complex Sine and Cosine

2024-07-04T00:00:00+00:00

Mathematically inclined readers probably know the (real) sine and cosine functions like old friends; but are you just as intimate with the complex sine and cosine? This article will take you on a tour of their key properties, using interactive visualizations. (If you need a refresher on how complex numbers work, you could try this article from BetterExplained first.)

Before we start, what are the “complex sine and cosine” functions? One definition uses the power series for the real sine and cosine:

$sin (x) = x - \frac{x ^{3}}{3 !} + \frac{x ^{5}}{5 !} - \frac{x ^{7}}{7 !} + \dots$

$cos (x) = 1 - \frac{x ^{2}}{2 !} + \frac{x ^{4}}{4 !} - \frac{x ^{6}}{6 !} + \dots$

Take the same power series, and allow the value of $x$ to be a complex number. That's it! Or equivalently, you could take the exponential definition of the sine and cosine, and likewise, allow the input parameter $x$ to take complex values.

So what do we get by letting $x$ be complex? Does anything interesting happen? Do these new, generalized versions of the sine and cosine behave like the “normal” versions? Or do the familiar properties of $sin$ and $cos$ totally break down? Will graphing complex $sin$ and $cos$ result in some pretty pictures?^[1]

We will start by listing the most important properties of the real sine and cosine, then go through the list one by one and see what happens to each property when we move into the complex domain. Here is my list:

$sin (x)$ and $cos (x)$ ...

...are periodic, with period 2π.
...have bounded output values (from -1 up to 1).
...satisfy the Pythagorean Identity $sin^{2} (x) + cos^{2} (x) = 1$ (for all $x$ )
...are respectively odd and even ( $sin$ is odd, $cos$ is even)
...are identical when $cos$ is shifted forward (or $sin$ shifted back) by π/2.
...are closed under differentiation ( $\frac{d}{d x} sin (x) = cos (x)$ , $\frac{d}{d x} cos (x) = - sin (x)$ )

So what about $sin (z)$ and $cos (z)$ ?

1. Are they periodic, with period 2π?

Here are 3D plots of the real and imaginary parts of the complex sine. Rotate the view and look at them from different angles.

Drag to rotate, roll mouse wheel (or pinch) to zoom, Ctrl+drag to pan.

What do you conclude? Does it look periodic to you? (Click or tap to reveal the answer.)

2. Do their values range from -1 up to 1?

This one is obvious when you look at a graph. Here is a modular graph of the complex sine; the height of the graph represents the magnitude ( $Re (z)^{2} + Im (z)^{2}$ , also written as $∣ z ∣$ ) of the complex sine at that point. I've drawn a (white-colored) horizontal plane slicing the graph at $∣ z ∣ = 1$ .

Again, drag to rotate, roll mouse wheel (or pinch) to zoom, Ctrl+drag to pan. Try peeking under the white plane at $∣ z ∣ = 1$ to see the familiar shape of the real sine function.

What do you conclude?

3. Does the Pythagorean Identity still work?

For real numbers, the Pythagorean Identity $sin^{2} (x) + cos^{2} (x) = 1$ tells us that the sine and cosine of an angle can be thought of as the side lengths of a right triangle with unit hypotenuse. Like this:

Drag the arrow around the circle. The length of the red line is the sine, the length of the blue line is the cosine. Note that together with the arrow, they form a right triangle.

Using the interactive graph below, you can try to determine empirically if this is still true for the complex sine and cosine. Use your mouse (or finger on a mobile device) to drag the black point ⬤ $z$ all over the complex plane, and see what happens to ⬤ $sin (z)$ , ⬤ $cos (z)$ , ⬤ $sin^{2} (z)$ , ⬤ $cos^{2} (z)$ , and ⬤ $sin^{2} (z) + cos^{2} (z)$ .

What do you conclude? Does it seem that $sin^{2} (z) + cos^{2} (z) = 1$ holds?

4. Is $cos (z)$ even, and $sin (z)$ odd?

First, we need to think about what “even” and “odd” means in this context. For a real function, “even” means that the function is unchanged by reflecting across the y-axis, and “odd” means the function is negated by reflecting across the y-axis:

sin (x)

is odd

cos (x)

is even

Click (or tap) to flip the functions over like a couple of pancakes

For a complex function, the analogous operation is to reflect each point in the input space across the origin; instead of $f (z)$ , we take $f (- z)$ .

If the complex sine is odd, meaning that $sin (- z) = - sin (z)$ , we should be able to take a vertical slice of its graph in any direction (though the slice must go through the origin), and just like the above graph of the real sine, the path it traces out on the left and right sides of the origin should be mirrored and “upside down” relative to each other.

To check this property empirically, let's actually take some slices and see how they look! The 3D graph below plots the real part of the complex sine; move the slider to rotate the graph and take a slice at a different angle.

What do you conclude?

5. Are they identical after a π/2 phase shift?

Here is an overlay of the magnitudes of the complex sine and cosine; move the slider to shift the cosine along the real axis, and try to see at what shift (if any) the graphs coincide exactly. To make things easier to see, I'll only graph on one side of the real axis. Since the magnitudes of the two functions are symmetrical across the real axis, nothing is lost by only showing one side.

And the same for the angles of the complex sine and cosine:

What do you conclude?

6. Do their derivatives still work in the same way?

Algebraically, it's pretty obvious that we still have $\frac{d}{d z} sin (z) = cos (z)$ , $\frac{d}{d z} cos (z) = - sin (z)$ . Just look at the power series definitions (or the exponential definitions) of the two functions. Since the rules for differentiating polynomials and exponentials are the same for real and complex variables, the result follows easily from either set of definitions.

However, I still want you to see this point visually. But what does the graph of a derivative-antiderivative pair look like? Below you can see a few examples of real functions and their derivatives. Click on the pink buttons one by one to highlight some basic features of derivative-antiderivative graphs:

f (x)

f^{'} (x)

The same visual features apply (roughly) to the magnitude of a complex derivative. A visualization will follow soon, but to appreciate it, you need to know a bit more about complex derivatives.

Real derivatives tell you the best linear approximation to a real function near a point. In symbols: $f (x + Δ x) \approx f (x) + f^{'} (x) Δ x$ . The same relationship applies to complex derivatives, but now $f^{'} (z)$ and $Δ z$ are both complex. $f^{'} (z)$ is a complex number which multiplies small deviations $Δ z$ from the input point $z$ , giving a linear approximation to $f (z + Δ z)$ .

But remember, complex multiplication includes both scaling and rotation. So unlike a real derivative, which just scales small deviations from a given input point, a complex derivative can both scale and rotate small deviations from an input point to approximate the corresponding output point.

Here is a little demonstration to build your intuition for how complex derivatives work. The black point ⬤ $z$ , below, represents an input to a complex function. Imagine that our view is zoomed in extremely close, so we are looking at just a tiny piece of the complex plane. The red point ⬤ $f (z)$ is the output of the same function. (For this demonstration, we don't care what the function $f$ actually is; we are just interested in how its value changes for small changes of the input parameter.) Use your mouse cursor (or finger) to move the vector ⬤ $Δ z$ around and watch where ⬤ $f (z + Δ z)$ goes.

First, let's say that $f^{'} (z) = 2$ (pure real), at least at this particular point $z$ :

f^{'} (z) = 2

Now imagine our input point $z$ has moved to some other place on the complex plane. Let's say that here, $f^{'} (z) = i$ (pure imaginary):

f^{'} (z) = i

Finally, see what would happen if $z$ moved to another location on the complex plane where $f^{'} (z)$ is neither pure real nor pure imaginary:

f^{'} (z) = - 0.5 + 1.5 i

After playing with those demonstrations, can you summarize how the magnitude and angle of a complex derivative control how the value of a complex function changes?

So then, let's look at the complex sine and cosine again and see if we can identify visual features analogous to those highlighted in the above real derivative graphs. Move the input point ⬤ $z$ around the below grid, and the corresponding point on the sine and cosine plots will be highlighted. Try points where the magnitude of the cosine is large, those where it's small, and those where it's zero. Also, try points where the magnitude of the sine is large, small, or zero:

|sin(z)|

arg(sin(z))

|cos(z)|

arg(cos(z))

Can you see that both the magnitude and angle of the sine change fastest around points where the magnitude of the cosine is large? And likewise, the magnitude and angle of the cosine change fastest around points where the magnitude of the sine is large?

Digression: Most Complex Functions Don't Have a Derivative

If you have studied basic calculus, you may know several reasons why a real function might not have a derivative (at least at certain points):

The function might be undefined at some points.
The function might be discontinous.
Even if continuous, the function's graph might have sharp corners.

A complex function can fail to have a derivative for the same reasons, but there is another big reason why many, many complex functions do not have a derivative. Think about this: As was shown by the above interactive demonstration of how a complex derivative works, a complex derivative tells you how to scale and/or rotate small deviations from an input point to get the corresponding deviation from its output point. (In symbols: $f (z + Δ z) \approx f (z) + f^{'} (z) Δ z$ .) The demonstration showed that the rate of scaling and/or rotation caused by $f^{'} (z)$ does not depend on the direction of $Δ z$ ; that's how complex multiplication works!

Example: if $f^{'} (z) = i$ , then $Δ z$ will always be rotated by 90 degrees.

But now think about this: there are many, many functions (i.e. ways to map input points to output points) where $f (z + Δ z)$ depends very much on the direction of $Δ z$ . Here's a simple example:

⬤ is

z

, ⬤ is

f (z) = Re (z)

Drag the input point ⬤

z

around and observe how the output ⬤

f (z) = Re (z)

changes.

Try moving ⬤ $z$ around, and it will quickly become obvious that the direction which ⬤ $f (z)$ moves is not simply a rotated and/or scaled version of $Δ z$ ; while $z$ can move in any direction at all, $f (z)$ only moves to the left or right, along the real axis. Since neither the angle between $Δ z$ and $Δ f (z)$ , nor the ratio of their magnitudes, is constant for a given position of $z$ , the relationship between $Δ z$ and $Δ f (z)$ cannot be expressed as a simple complex multiplication by some number $f^{'} (z)$ ... but that's precisely what a “complex derivative” means! And there we have it: $f (z) = Re (z)$ does not have a derivative.

If you ever hear about “analytic functions”, that simply means “functions which have a complex derivative”. Like $sin (z)$ and $cos (z)$ !

Outro

I wanted to show that other trigonometric identities for the sine and cosine still work in the complex case, but couldn't come up with good visualizations. In particular, I found it very hard to “see” $sin (z + w) = sin (z) cos (w) + cos (z) sin (w)$ in the graphs of the complex sine and cosine. Can you see it? If so, please share!

In any case, many trig identities follow directly from the exponential definitions of sine and cosine. Since exponentials have the same basic properties regardless of whether the parameter is real or complex, such trig identities naturally work either way.

One last little fact: If you slice the graph of $sin (z)$ straight along the imaginary axis (going through the origin), you get the graph of $sinh (x)$ , the hyperbolic sine. It's the same for $cos (z)$ ; along the imaginary axis, it gives the graph of the hyperbolic cosine. If this intrigues you, have another look at the 3D graphs above and see if you can pick out the contours of the hyperbolic trig functions.

[1] Answers: “A lot”, “Yes”, “For the most part”, “Not really”, and “You can be the judge of that”. ⏎

A Toy Runge-Kutta Differential Equation Solver

2023-08-22T00:00:00+00:00

This post presents a simple, interactive differential equation solution graphing tool based on the classic Runge-Kutta method (which is really an algorithm).

It graphs solutions to one 1^st-order equation, one 2^nd-order equation, or a system of two 1^st-order equations. The right-hand side of the equation(s) must be entered using syntax similar to expressions in the C, Java, or JavaScript programming languages (syntax help). Since the solution of a differential equation depends on starting conditions, you can set the range of starting conditions which should be graphed and the “time” value at which the starting conditions apply. The tool will draw one line for each set of starting conditions.

Hover over a solution line to see what starting conditions it is based on. Roll your mouse wheel to zoom in and out; hold down the middle mouse button to pan. On mobile, use one finger to pan and pinch with two fingers to zoom.

Click these buttons to see a variety of samples:

Try playing with the parameters for some of the samples; you can get pretty wild pictures!

Graph solutions for:

y'' =

Start time: End time: Time step: t₀: Min y(0): Max y(0): # values to graph: Min y'(0): Max y'(0): # values to graph:

Classic Runge-Kutta, also known as “RK4”, generates four different estimates of the rate of change of each variable at each time step, and takes a weighted sum of those four estimates as a final estimate which is (usually) more accurate than any of the four. Then, we use those estimated derivatives to adjust the values of each variable, bump “time” forward by one step, and repeat until we reach the ending time of the simulation. Here is a simple implementation of RK4 for a system with just one dependent variable:

// Trace out evolution of our system using classic Runge-Kutta (AKA "RK4")
// Store results in a packed array of floats
function rk4trace(y, t, Δt, fn, array, i, Δi, limit) {
  while (i !== limit) {
    const half_Δt = Δt / 2.0;
    const next_t = t + Δt;
    const half_t = t + half_Δt;

    const k_1 = fn(t, y); // Slope at starting point
    const k_2 = fn(half_t, y + (half_Δt * k_1)); // Estimated slope at mid-point
    const k_3 = fn(half_t, y + (half_Δt * k_2)); // Another estimate of slope at mid-point
    const k_4 = fn(next_t, y + (Δt * k_3)); // Estimated slope at endpoint

    const slope = (k_1 + 2*k_2 + 2*k_3 + k_4) / 6.0; // Weighted average of those four slopes

    y += Δt * slope;
    array[i] = y;
    t += Δt;
    i += Δi;
  }
}

// Apply RK4 to find phase lines for a system with one dependent variable
function rk4solve(y_0, t_0, t_start, t_end, Δt, fn) {
  const timeSteps = Math.floor(((t_end - t_start) / Δt) + 1);
  // Packed array of variable values at each time step:
  const array = new Float64Array(timeSteps);

  let t = t_0, y = y_0, i = Math.floor((t_0 - t_start) / Δt);
  array[i] = y_0;

  // Trace out phase line from starting point
  rk4trace(y_0, t_0, Δt, fn, array, i+1, 1, array.length);

  // Trace out phase line in the opposite direction from the starting point
  rk4trace(y_0, t_0, -Δt, fn, array, i-1, -1, -1);

  return array;
}

The above implementation stores the values of the dependent variable in a Float64Array instead of a regular JavaScript Array; this is for speed and memory efficiency. It doesn't store the time value for each entry in the array, since that can be easily rederived from t_start, t_end, and Δt.

Expression Syntax Help for this Tool

Variables	`t` `y` `y'` (for 2^nd-order equations) `z` (for systems of two 1^st-order equations) `x` (alternative name for `t`)
Numbers	`1`, `2`, `-1`, etc `1.1234`... `e` (Euler's number, 2.71828...) `pi` or `π` (3.14159...) If you want any other mathematical constants, drop the author a line.
Arithmetic	`expression + expression` Other binary operators are `-`, ``, `/`, and `^` or `*` for exponentiation `-expression` for negation
Parentheses	`(expression)` Use parentheses to ensure expressions are grouped in the way you want
Functions	`sqrt(expression)` `sin(expression)` (for trig functions, the input parameter is in radians) `cos(expression)` `tan(expression)` `arcsin(expression)` (inverse trig functions) `arccos(expression)` `arctan(expression)` `ln(expression)` or `log(expression)` (natural logarithm) `log10(expression)` `log2(expression)` `sign(expression)` or `sgn(expression)` (0 for zero, 1 for positive numbers, -1 for negative numbers) If you want any other mathematical functions, drop the author a line.

Special thanks to Gilbert Strang for his text “Differential Equations and Linear Algebra”, which inspired me to make this tool.

Visualizing Nelder-Mead Optimization

2022-06-13T00:00:00+00:00

Recently, I ran across a fantastic article, “Why Train When You Can Optimize?”, which introduced me to the Nelder-Mead optimization algorithm. It's a lovely algorithm, and I couldn't wait to create an interactive version. First, though: what does “optimization” mean in this context?

An “optimization” algorithm takes some mathematical function as its input, and tries to find values for the parameters which make the output either as large or as small as possible. If you are like many computer programmers, your first impression might be that you are unlikely to ever use such an algorithm in your own programs. But optimization is a much more general and useful technique than it might seem. The article mentioned above gives a great example: a drawing program which detects when the user is trying to draw a straight line and replaces their jittery line with a perfectly straight one. (If you know other examples of good uses for optimization outside science and engineering, please let me know!)

Obviously, finding inputs for some function which give you the largest or smallest output value can be done without a special algorithm. You could just use brute force: test many inputs and pick the best one. But if the space of possible inputs is large, that could be too slow.

Optimization algorithms typically avoid exhaustively searching the input space by starting at some arbitrary point, then repeatedly searching for a nearby point which is better, until it hits a maximum or minimum and can't find any better point.

Many such algorithms require that you know how to calculate the derivative (slope) of the function at any given point; but Nelder-Mead doesn't need any derivatives, and combined with its general simplicity, this makes it easy to apply.

Now let me show you how Nelder-Mead works. Rather than starting with one test point and iteratively improving it, Nelder-Mead starts with $N + 1$ test points, when the input space has $N$ dimensions. (Or, in other words, when there are $N$ different input variables whose values need to be found.) For example, if your function has two parameters, there will be 3 starting points, which will form a triangle in the 2-dimensional plane of possible inputs:

(From here on, all examples will be 2-dimensional, but the algorithm generalizes naturally to any number of dimensions. Further, we will assume that we are searching for a minimum rather than a maximum.^[1])

Nelder-Mead repeatedly transforms the triangle of test points, replacing the worst point with a better one. This causes the triangle to move across the plane in whichever direction the function's value is dropping, and then contract around a local minimum when it finds one. When the triangle becomes small enough, then the algorithm terminates. Like this:

At each iteration, Nelder-Mead will apply one of four possible transformations to the triangle. Let's see them one by one (try dragging around the points or adjusting the coefficient values if you like):

Reflect. Move the worst point through the middle of the other two.

Expand. Like Reflect, but it moves further.

Contract. Move the worst point towards the middle of the other two. Depending on the situation, Contract can either stop short of the opposite side, or move slightly past it. We will call these variants “Contract Inside” and “Contract Outside”.

Shrink. Shrink the whole triangle towards the best point, maintaining its angles.

The magnitude of each transformation can be tuned by adjusting a coefficient. As shown in the above visualizations, the default coefficient values are 1.0, 2.0, 0.5, and 0.5.

To help get a feel for how these transformations can be used to explore the input space and find a minimum, let's play a game. The below square represents the space of input values for a function $f (x, y)$ . Just as Nelder-Mead only computes the function value at the corners of its triangle, I will only show you its value at those three points. Click on any three points to start, then click any of the five transformation buttons to transform your triangle. Once you contract the triangle to a sufficiently small size (or reach 50 iterations), I'll reveal the contours of the graph. You “win” only if your best point is close enough to a minimum point.

Click three points to start

This game is quite difficult; don't be surprised if you hardly ever “win”. The Nelder-Mead algorithm doesn't always reach a minimum point, either; or at least, not in a reasonable number of iterations. Sometimes it gets close to a minimum point... and then moves very, very slowly towards it.

For that reason, when implementing Nelder-Mead, you need to limit the number of iterations so it doesn't run for too long. Rather than running Nelder-Mead for a huge number of iterations, you will probably get better results by restarting it several times, with different starting points, and then picking the best overall solution found.^[2]

You may be wondering why the algorithm works the way it does. Here are some interesting questions to think about (click to reveal possible answers):

Why does the algorithm use a triangle rather than a single test point?

Why does the size of the triangle change? Why not use a fixed-size triangle which flips, rotates, and slides around the plane?

The final piece of the algorithm, which I haven't described yet, is how it chooses which transformation to use on each iteration. Here is the procedure:

Find the reflection point (the point which Reflect would move the worst point to) and compute the function's value there.
If the reflection point is better than the second-best point, but not better than the best point, then do Reflect.
Otherwise, if the reflection point is better than the best point, then check the expansion point (the one which Expand would move to). If the expansion point is better than the reflection point, do Expand. If not, do Reflect.
Otherwise, if the reflection point is worse than the second-best point but not worse than the worst point, check the outside contraction point. If it's better than the worst point, do Contract Outside. If not, do Shrink.
Finally, if the reflection point is worse than the worst point, check the inside contraction point. If it's better than the worst point, do Contract Inside. If not, do Shrink.

Does that seem to make sense? Perhaps these comments might make it more understandable:

While we expect that our function's graph has some kind of curved surface, Nelder-Mead can't “see” the curve; it only knows the value at its three test points (as you experienced when playing the above game). With only three numbers to work with, the best guess Nelder-Mead can make is that it should move the worst point in the direction of the better two. And in the absence of other information, a reasonable default is to move the worst point just far enough to maintain the size and shape of the triangle (that's what Reflect does). If the default was to move it more or less than that, then the triangle would tend to grow and grow or shrink and shrink, even when there was no reason to do so.

However, that guess isn't always right. Even if the triangle is sitting on a slope, it is possible that Reflect might overshoot the base of the slope and start going up the opposite slope. If the reflection point is worse than the existing points, that indicates that we are going too far and need to back off, perhaps by using Contract Inside or Contract Outside.

On the other hand, if the reflection point is better than all the existing points, that strongly suggests that the triangle really is on a slope and that Reflect really is moving in the right direction. In that case, we can try to go even further in the same direction with Expand. This not only moves the triangle further in a good direction, it also enlarges the triangle, which means the following steps will be bigger. In effect, as long as Nelder-Mead keeps picking good directions and each successive point is better and better than the previous ones, it will “accelerate downhill”. That helps the algorithm to move more quickly towards a minimum and converge in a smaller number of iterations.

As for Shrink, the original paper on the Nelder-Mead algorithm explained that Shrink is necessary for the algorithm to avoid getting stuck in some (rare) situations. One example is below. Try various combinations of Refresh, Expand, Contract Inside, and Contract Outside to see if you can get the triangle to close in on the minimum point (which is marked in red). Then try Shrink and see how it helps.

This last visualization will show all the points which Nelder-Mead considers on each iteration, and why it chooses the move which it does. Click the 'Next' button to move forward. Reflection points will be shown in ⬤ blue, expansion points in ⬤ orange, contraction points in ⬤ pink, and shrink points in ⬤ green.

For a sample implementation of Nelder-Mead optimization in JavaScript, see the latter part of Justin Meiners' math.js.

Special thanks to Justin Meiners for the article which inspired this post, to Peter Collingridge for his helpful post on making SVG elements draggable, to Mike Bostock for D3.js, to John Nelder and Peter Mead for their lovely algorithm, and... to you for reading all the way to the end!

[1] The version of the algorithm which searches for minimum points is all we need anyways, since if we want to find a maximum point, we can just search for a minimum of the negated function $g (x) = - f (x)$ instead. ⏎

[2] In this post by Enrico Schumann, the performance of a single run of Nelder-Mead with N iterations is compared empirically with 4 runs of N/4 iterations each, over varying values of N. The version which restarts 4 times completely dominates the one which doesn't restart. ⏎

[3] To be precise, the right term is “gradient”. ⏎

JPEG Series, Part II: Huffman Coding

2021-05-16T00:00:00+00:00

The previous article in this series explored how JPEG compression converts pixel values to DCT coefficients. A later stage of the compression process uses either a method called "Huffman coding" or another called "arithmetic coding" to store those coefficients in a compact manner. The Huffman coding algorithm is very simple, but powerful and widely used. If you've never learned how it works, I promise this will be interesting.

You have some data to compress. Your data can be viewed as a sequence of values; perhaps Unicode codepoints, pixel color samples, audio amplitude samples, or something comparable. In the context of Huffman coding, each of these values is called a "symbol".

We are going to encode each symbol using a unique, variable-length string of bits. For example, if each symbol is a letter, the letter "a" could be "10111", "b" could be "00011", and so on. We can pick any series of bits we like to represent each symbol, with the restriction that after all the symbols are converted to bitstrings, and all those little bitstrings are smashed together, it must be possible to figure out what the original sequence of symbols was.

That means the bitstrings for each symbol must be prefix-free; none of them can be a prefix of another. Example of a non-prefix-free code: say we choose "1100" to represent the letter "a", "11" to represent "b", and "00" for "c". When decoding text, we find "1100" somewhere. How are we supposed to know whether that was originally a letter "a", or the two letters "bc"? It's impossible to tell, precisely because "11" is a prefix of "1100". Fortunately, Huffman codes are always prefix-free.

Let's encode this sentence with such a code. Click the button below, and the computer will generate a random prefix-free code. Try clicking a number of times and see what the smallest total number of bits required to encode the sentence appears to be.

Generate Random Code

Obviously, the number of bits required to encode a sequence can vary wildly depending on the chosen code. Can you see why some prefix-free codes are more efficient than others? What is the key difference between an efficient code and an inefficient one? (Click to reveal.)

Out of the vast number of prefix-free codes which could be used, we want to find an optimal one; one which will encode our particular data in the smallest number of bits. (There will always be many optimal codes for any sequence, but we just need to find one of them.) At first, it might appear that we need to try millions of possible codes to be sure that we have an optimal one. Fortunately, that is not the case. Just count how many times each symbol appears in the input data, and in an almost trivially simple way, you can find an optimal prefix-free code. It will be easy, and fun!

I could tell you the algorithm right now, but it will be so much more enjoyable to discover it yourself. So I'll take this slow, and reason towards a solution step by deliberate step. If at any point you catch the scent of the solution, stop and think it out before continuing.

The first step is to represent a prefix-free code as a binary tree. Have a look:

Generate Random Code

Please make sure that you clearly see the correspondence between coding tables and binary trees before continuing.

We can see that the number of bits used to encode each symbol equals the number of links between its tree node and the root node, also called the "depth" of the node. Leaf nodes which are closer to the root (smaller depth) have shorter bitstrings.

We will add a weight to each leaf node, which is simply the number of times its symbol appears in the input data:

Generate Random Code

Now the total length in bits of the compressed output will equal weight times depth, summed over all the leaf nodes.

So now our goal is to find a binary tree which minimizes the sum of weight times depth. We don't really have an idea how to do that, though. At least we do know what the leaf nodes of the tree should be:

How are we going to find the right structure for the internal nodes? Well, we could try to do it top-down, meaning we figure out what child nodes the root node should have, then the nodes below those, and so on. Or we could work bottom-up, meaning we figure out which leaf nodes should become children of the same parent node, then find which parent nodes should be children of the same "grandparent" node, until the whole tree is joined together. A third option would be to work both up and down from the middle, but that is just as hopeless as it sounds. These animations may help you understand "top-down" and "bottom-up" tree construction:

Bottom-up

Top-down

Generate Random Examples

To build an optimal tree top-down, we would need a way to partition the symbols into two subsets, such that the total weights of each subset are as close as possible to 50%-50%. That might be tricky. On the other hand, if we can come up a simple criterion to identify two leaf nodes which should be siblings in the tree, we might be able to apply the same criterion repeatedly to build an optimal tree bottom-up. That sounds more promising.

Before we consider that further, take note of an important fact. How many internal nodes, including the root, does it take to connect N leaf nodes together into a binary tree? Watch the above animations again and try to figure it out:

Good. Another one: When building a tree bottom-up, every time we pick two subtrees and join them together as children of a new internal node, what happens to the depth of all the leaves in the combined subtree?

Remember that the depth of each leaf node equals the number of bits required to encode the corresponding symbol. So every time we join two subtrees, we are in a sense "lengthening" the bitstrings for all the symbols in the new subtree. Since we want the most common symbols to have the shortest bitstrings (equivalent: we want their nodes to be closest to the root), they should be the last ones to be joined into the tree.

With that in mind, can you now see what the first step in building an optimal tree bottom-up should be?

Yes! Just like this:

Just another small conceptual leap, and the complete solution will be ours. Here's what we need to figure out: Just now, we took the two lowest-weighted leaf nodes and joined them together. But how should we "weight" the resulting subtree? How will we know when and where to join it into a bigger subtree? More concretely: for the second step, we could either take our new 3-node subtree, and use it as one child of a new 5-node subtree, or we could pick two of the remaining single nodes, and join them into another 3-node subtree. How do we decide which choice is better?

Think about it this way. When we attach a single node into a subtree, the bitstring representation for its symbol is being "lengthened" by one bit. In a sense, it's like the total bit length of the final encoded message is being increased by the weight of the node.

When we attach a subtree into a bigger subtree, the same thing happens to all the leaf nodes in the subtree. All of their bitstrings are growing by one bit, so the final encoded message size is growing by the sum of their weights.

That was a giveaway if there ever was one. So answer now, how should we weight subtrees which contain multiple leaf nodes?

And then what is our algorithm for building an optimal tree bottom-up?

Type some text in the below entry field, and I'll animate the process for you:

Yes, that tree represents an optimal prefix-free code!

That can't be hard to code, can it? (It's not.) One thing, though: Since the symbol set might be large, we need a data structure which allows quick retrieval of the two lowest-weighted subtrees at each step. A minheap fits the bill perfectly. Here's an minimal implementation using JavaScript Arrays:

class Minheap {
  /* `comparator` must return true if first argument is 'larger' than second */
  constructor(comparator) {
    this.heap = [];
    this.compare = comparator;
  }

  get length() {
    return this.heap.length;
  }

  insert(item) {
    let index = this.heap.length;

    while (index > 0) {
      const parentIndex = ((index + 1) >>> 1) - 1;
      if (this.compare(item, this.heap[parentIndex]))
        break;
      this.heap[index] = this.heap[parentIndex];
      index = parentIndex;
    }

    this.heap[index] = item;
  }

  /* Remove and return the smallest item in the heap */
  pop() {
    const result = this.heap[0], item = this.heap.pop();

    /* If the heap is not empty, move items upward to restore the heap property,
     * until we find an appropriate place to put `item` */
    if (this.heap.length) {
      let index = 0;
      while (true) {
        const leftIndex = (index << 1) + 1, rightIndex = leftIndex + 1;
        let childIndex = leftIndex;

        if (rightIndex < this.heap.length) {
          if (this.compare(this.heap[leftIndex], this.heap[rightIndex]))
            childIndex = rightIndex;
        } else if (leftIndex >= this.heap.length) {
          break;
        }

        if (this.compare(item, this.heap[childIndex])) {
          this.heap[index] = this.heap[childIndex];
          index = childIndex;
        } else {
          break;
        }
      }
      this.heap[index] = item;
    }

    return result;
  }
}

It would be fun to animate the minheap operations and show you how they work, but that would have to be a different article.

The rest of the code to build Huffman trees is almost anticlimactic:

/* Count how many times each character appears in a string */
function histogram(string) {
  const histogram = new Map();
  for (const char of string)
    histogram.set(char, (histogram.get(char) || 0) + 1);
  return histogram;
}

function symbols(histogram) {
  return Array.from(histogram).map(([char, count]) => ({ value: char, weight: count }));
}

function huffmanTree(symbols) {
  const heap = new Minheap((a,b) => a.weight > b.weight);
  for (const symbol of symbols)
    heap.insert(symbol);

  while (heap.length > 1) {
    const a = heap.pop(), b = heap.pop();
    heap.insert({ value: [a, b], weight: a.weight + b.weight });
  }

  return heap.pop();
}

Modifying the Basic Algorithm for JPEG

The Huffman codes generated above have two important differences from those used to compress pixel data in JPEG files.

Difference #1: JPEG Huffman tables never use bitstrings which are composed of only 1's. "111" is out. "1111" is forbidden. And you can just forget about "111111".

BUT WHY? Because while sections of Huffman-coded data in a JPEG file must always occupy a whole number of 8-bit bytes, all those variable-length bitstrings will not necessarily add up to a multiple of 8 bits. If there are some extra bits left to fill in the last byte, "1" bits are used as padding. If bitstrings composed of only 1's were used, the padding in the last byte could be mistakenly decoded as an extraneous trailing symbol. By avoiding such bitstrings, it is always possible to recognize the padding.

How can we modify our algorithm to account for that? Can you think of an idea?

That just takes a few more lines of code:

@@ -7,7 +7,9 @@
 }
 
 function symbols(histogram) {
-  return Array.from(histogram).map(([char, count]) => ({ value: char, weight: count }));
+  const sym = Array.from(histogram).map(([char, count]) => ({ value: char, weight: count }));
+  sym.push({ value: "🃏", weight: 0, dummy: true });
+  return sym;
 }
 
 function huffmanTree(symbols) {
@@ -16,8 +18,13 @@
     heap.insert(symbol);
 
   while (heap.length > 1) {
-    const a = heap.pop(), b = heap.pop();
-    heap.insert({ value: [a, b], weight: a.weight + b.weight });
+    let a = heap.pop(), b = heap.pop();
+    if (a.dummy) {
+      /* Dummy must always be on the right-hand side */
+      let temp = a; a = b; b = temp;
+    }
+    const parent = { value: [a, b], weight: a.weight + b.weight, dummy: a.dummy || b.dummy };
+    heap.insert(parent);
   }
 
   return heap.pop();

This is optimal tree construction with a dummy node:

Difference #2: JPEG Huffman codes are always canonical.

In a canonical Huffman code, when the bitstrings are read as binary numbers, shorter bitstrings are always smaller numbers. For example, such a code could not use both "000" and "10", since the former bitstring is longer, but is a smaller binary number. Further, when all the bitstrings used in the code are sorted by their numeric value, each successive bitstring increments by the smallest amount possible while remaining prefix-free. Here's an example, courtesy of Wikipedia:

110

111

Interpreted as numbers, those are zero, two, six, and seven. Why wasn't the second bitstring "01", or one? Because then the first would have been its prefix. Likewise, if the third was "011" (three), "100" (four), or "101" (five), in each case one of the first two would have been a prefix. For the fourth, incrementing by one to "111" didn't create a prefix, so "111" it is. (Hopefully that example gives you the idea; hit me up if you need more!)

But WHY does JPEG use canonical codes? Because their coding tables can be represented in a very compact way^[1], which makes our JPEG files smaller and faster to decode. (Yes, JPEG files must contain not just Huffman-encoded pixel data but also the coding tables which were used.)

So given a symbol set and frequencies, how can we generate a canonical Huffman code? Unfortunately, there is no straightforward way to do it directly by building a binary tree. But we can use our existing method to generate a non-canonical (but optimal) code, and then rewrite the bitstrings to make them canonical while maintaining their length. Remember, it's the length of the bitstrings assigned to each symbol which makes a prefix-free code optimal. The exact bitstrings which are used don't matter; we can shuffle them around and assign different ones with the same length.

The algorithm suggested in the JPEG specification (Appendix K) gets a step ahead of the game by not explicitly building a binary tree with left and right child pointers. It just tracks what the depth of each leaf node would have been had they actually been built into a binary tree. So these depths can be incremented whenever two "subtrees" are "joined together", the leaf nodes for each subtree are kept on a linked list. "Subtrees" are "joined" by concatenating their linked lists. (Libjpeg uses this trick when saving a Huffman-encoded JPEG file.^[2])

Regardless of whether you actually build a binary tree or use the trick from Appendix K, once you know what the lengths of all the bitstrings in an optimal code should be, generating a canonical code is as simple as this:

/* `lengths` is a sorted array of bitstring lengths required for an optimal code
 *
 * In real applications, an array of counts would likely be passed: how many
 * bitstrings must have 1 bit, how many 2 bits, how many 3 bits, and so on
 *
 * Also, in real applications, the returned values would almost certainly
 * not be strings; integers would be more likely */
function makeCanonical(lengths) {
  let result = [], nextCode = 0;
  for (var i = 0; i < lengths.length; i++) {
    if (i > 0 && lengths[i] !== lengths[i-1])
      nextCode <<= 1;
    result.push(nextCode.toString(2).padStart(lengths[i], '0'));
    nextCode++;
  }
  return result;
}

Here is an example. Note that we are not using a dummy, so bitstrings with all 1 bits may be included.

Random Code	Sorted by Bitstring Length	Canonicalized

Generate Random Code

Huffman Coding in Practice

All through this article, ASCII characters have been used as Huffman symbols. But in reality, if you want to compress English text, Huffman coding with each character treated as a separate symbol would be a terrible way to do it. Note two big weaknesses with that approach:

Huffman coding is oblivious to patterns which involve the order of symbols. It only cares about their frequency. But real-life data usually has patterns related to the order of values, which can be exploited to achieve better compression.
Huffman coding always uses at least one bit for each symbol, and usually much more. So in the "ideal" case of a text file which just contains a single ASCII character repeated thousands of times, Huffman coding with one symbol per letter could only compress it to ⅛ of its original size. 8× compression may sound good, but any reasonable compression method should get far greater gains in that ridiculously easy-to-compress case.

So just what am I saying here? Is Huffman coding a bad algorithm?

Not at all! But it is just one piece of a practical compression method; it's not a complete compression method by itself. And to make Huffman coding work to greatest advantage, it may be necessary to find an alternative data representation which is well-suited to such coding. Just taking the most "natural" or intuitive representation and directly applying Huffman coding to it will probably not work well.

As an example, in JPEG, the values which we want to compress are quantized DCT coefficients (see the previous post for details), which have 8 bits of precision each.^[3] We could take the 256 possible coefficient values as 256 Huffman symbols and Huffman-code them directly, but this would be very suboptimal.

In the symbol set which is actually used, each symbol represents either:

Some specific number of successive zero coefficients (0-15 of them), and the number of significant bits in the following non-zero coefficient.
A run of zeroes filling the remainder of a 64-coefficient block.

Note that each symbol only tells us the number of significant bits in the next non-zero coefficient, not what those bits actually are. The actual coefficient value bits are simply inserted into the output data stream uncompressed. This is because the values of non-zero DCT coefficients don't actually repeat very much, so Huffman-coding them wouldn't really help. (See the demonstration in the previous post. Does it look like the coefficients within an 8-by-8 DCT matrix repeat much?) However, since the Huffman symbols tell us the number of significant bits, high-order zero bits can be discarded, which does help significantly.

JPEG files can use "arithmetic coding" as an alternative to Huffman coding (although this is not common). I dare say arithmetic coding is a more intriguing and fascinating algorithm than Huffman coding. So it will not surprise you that the next article in this series will focus on arithmetic coding. See you then!

[1] With a canonical code, only the number of bitstrings used of each possible length needs to be stored; how many are 1 bit long, how many 2 bits long, how many 3 bits long, and so on. The actual bitstrings can be quickly recovered from that. ⏎

[2] But interestingly, libjpeg does not use a minheap when generating a Huffman code. Instead, it uses an array of symbol frequencies, and scans the whole array at each step to find the two lowest-weighted subtrees. ⏎

[3] The JPEG standard actually allows DCT coefficients to be either 8-bit or 12-bit, but 8 bits is almost universally used. Libjpeg can theoretically handle JPEG files with 12-bit coefficients, but it must be specially configured to do so at compile time, and binary distributions are not generally built in that way. ⏎

JPEG Series, Part I: Visualizing the Inverse Discrete Cosine Transform

2021-04-18T00:00:00+00:00

A key step in JPEG image compression is converting 8-by-8-pixel blocks of color values into the frequency domain, so instead of storing color values, we store amplitudes of sinusoidal waveforms. This is a fun little bit of applied math, and you might enjoy seeing how it works.

It all really started in the early 1820's, when Joseph Fourier figured out that any periodic waveform can be broken down into a sum of sinusoids. Kalid Azad has explained this much better than I could over at BetterExplained, and if you are not familiar with the Fourier transform, I recommend you go learn about it from Kalid first. I'll be waiting right here.

Welcome back! Let's apply what you learned to a block of pixel values. How about this block of pixels right here?

Take a horizontal slice, 8 pixels wide, from that block. Take the position of each pixel as an x value and its brightness as a y value. Then the 8 pixels correspond to 8 (x, y) points on a plane, and we could find a combination of sinusoidal waves that would go through those 8 points.

We could, and we will. Let's do it now. Click on any row of the pixel grid below:

The heavy, black waveform is the sum of all the colored sinusoids. The height of the 8 black dots represent the brightness values of the 8 pixels in the selected row. Try comparing several different rows to see that in each case, the height of the dots matches the brightness of the pixels.

You might notice that for darker pixel values, the dots appear below the 'zero line', while for brighter pixels, they appear above it. This is because we subtracted 128 from each brightness value before converting to a waveform, so the range of possible values (0-255) would be centered on the zero line. This is also done when an image is stored in JPEG format.

You might have also noticed that one of the component waves doesn't look like a sinusoid; it's the red one. It is just a flat line. That is the zero frequency component; it represents the average of the 8 values. It shifts the black waveform up or down to just the right height for it to hit all 8 target points.

The legend displays the frequency and amplitude (on a scale of zero to one) of each component wave. Try clicking on the color swatches in the legend to see the component waves more clearly.

Of course, we could do exactly the same with columns of 8 pixels:

Now, you have seen that each row or column of pixel values in this "coffee cup" icon can be converted to a sum of sinusoidal waves. But could that just be a fluke? Can we really do this with any sequence of eight brightness values?

If the answer is obvious, just humour me here. Go back and try dragging any of the black points up or down. The pixel colors and waveform will update as you drag.

Looks cool, doesn't it?

Now... we need to talk.

I have misled you here. The link to BetterExplained above probably tricked you into thinking that these waveforms were derived using a Fourier transform. Not so. This page is about the Discrete Cosine Transform, not the Fourier transform. But it was good for you to understand the idea of the Fourier transform before learning about the DCT.

If you take some time to play with both the Fourier transform demonstration on BetterExplained and the DCT demonstration here, you might recognize some differences between these transforms. Even when given the same input, they produce a different series of component waves. (That's an interesting point; the same sequence of discrete time samples can be broken down into sinusoids in more than one way.)

Do you want to go back and give it a try? Either way, whenever you are ready, click to reveal two major differences:

Other differences which you can't see from the demonstrations are:

While the computation of the Fourier transform uses complex numbers, the DCT only involves real numbers.
The DCT is easier to compute. It's just a simple nested loop which evaluates a cosine function and a couple of adds and multiplies for each input sample.

Now, another important point. Look back at the interactive DCT. As you drag the target points up and down, what is the maximum number of component waves which are required to match the 8 target points?

Right. That is a key point for JPEG image compression. Remember, in a JPEG image, blocks of 8 pixels by 8 pixels are represented as a combination of component waves. The amplitude of each component is called a DCT coefficient. Since each block of input pixels converts to a fixed number of component waves, we just need to store a fixed number of coefficients for each such block. We don't need to store their frequencies, since those are known and are always the same. Nor do we need to store phase shifts, because all are at a constant phase.

Let's move into two dimensions now. We have demonstrated that if we just needed to represent 8 color values in a row, 8 coefficients would be enough. But how many coefficients do we need to represent 8-by-8, or 64, color values? Guess before clicking:

You might ask: Isn't JPEG a lossy format which compresses images into fewer bits? How can converting 64 numbers into 64 other numbers save storage space?

You are right; by itself, applying the DCT to a block of color samples doesn't result in any compression. It's like translating English text to French or Chinese; you are representing the same information in a different form. So it's not surprising that 64 color samples convert to 64 coefficients. However, the DCT is still a key step in achieving image compression. More on this later.

I want to show you examples of 8-by-8 images broken down into 64 two-dimensional waveforms. As in the one-dimensional case, the frequencies and phase of the 64 components will always be the same; only their amplitudes will vary. Before looking at any sample images, though, first let me show you the 64 components of the two-dimensional DCT at a fixed amplitude.

On the left is an 8 by 8 grid. In each cell is a (u, v) pair. (We talk about positions in a DCT coefficient matrix using u, v coordinates; coordinates in the corresponding block of pixels are named x and y.) On the right is a waveform graph. Click on each position in the coefficient matrix to see what the corresponding waveform looks like. You can click and drag on the graph to pivot.

Do you see how the 64 components in the two-dimensional case correspond to the 8 components in the one-dimensional case? Look again at the waveform for coefficient (0, 0), in the top-left corner. This one is very important, since it gives the average value of all 64 color samples in a block. It is called the DC coefficient. The other 63 coefficients represent all the deviations from the average and are called the AC coefficients.

Now you are ready to see the Inverse Discrete Cosine Transform at work. Click on each of the sample images below to see its DCT coefficients. Click on any coefficient to disable it and see what the image looks like with it removed; or Control-click to enable only a single coefficient and see its contribution to the image.

Here's a hint of one interesting thing to look for. If you try disabling various coefficients in the images, which coefficients generally seem to have a smaller effect on the picture? (Those at the top of the matrix, the left, right, bottom, or towards a certain corner?) This has much to do with JPEG compression. More on that towards the end of the post...

If you've made it this far, you might want to know how the DCT and IDCT are calculated. (Don't care about the math? Feel free to skip it.) For simplicity, we'll stick to eight pixel values in one dimension (a row or column). The two-dimensional transforms are very similar.

First, the DCT, which converts pixel values to coefficients:

S_{u} = \frac{1}{2} C_{u} x = 0 \sum 7 s_{x} cos \frac{( 2 x + 1 ) π u}{16}

where

C_{0} = \frac{1}{2}, C_{1 - 7} = 1

That's quite a mouthful in English: to calculate the DCT coefficient u, loop over all eight pixels and sum up: the pixel value times the cosine of: twice the pixel's x coordinate plus one, times π, times u, divided by 16. Halve the total. Further, if u is zero, divide the total again by root 2.

Or if you speak JavaScript:

const coefficients = [];
for (var u = 0; u < 8; u++) {
  var coeff = 0;

  for (var x = 0; x < 8; x++)
    coeff += pixelValues[x] * Math.cos(((2 * x) + 1) * Math.PI * u / 16);

  coeff /= 2;

  if (u === 0)
    coeff /= Math.sqrt(2);

  coefficients.push(coeff);
}

Note that $(2 x + 1) π /16$ ranges from just above zero to just below π; that is, half a complete cycle for the cosine function. So when u is one, the component wave only makes half a cycle as x moves from zero to seven. If u is two, then the input to the cosine function increments twice as fast, and a full cycle is made. Each increment of u increases the frequency of the component wave by 0.5Hz.

On the other hand, when u is zero, the cosine always evaluates to one, and we are essentially just summing up the eight pixel values.

Essentially, coefficient u expresses how well the eight values "fit" a (0.5u)Hz cosine wave. Each value which is positive where the cosine wave is positive (or negative where it is negative) increases the coefficient. Each value which is of opposite sign to the cosine wave at its location on the x-axis decreases the coefficient. If the coefficient value is very positive, that means the sequence of samples closely fits a cosine wave; or if the coefficient value is very negative, that means the samples are close to the opposite of a cosine wave (that is, a cosine wave shifted by 180 degrees).

The Inverse DCT is very similar. Shall we stick to JavaScript this time?

const pixelValues = [];
for (var x = 0; x < 8; x++) {
  var value = coefficients[0] / Math.sqrt(2);

  for (var u = 1; u < 8; u++)
    value += coefficients[u] * Math.cos(((2 * x) + 1) * Math.PI * u / 16);

  value /= 2;

  pixelValues.push(value);
}

Let me share another thing which I find fascinating about the DCT. Actually, no; let me show you and see if you can recognize it yourself.

Earlier I showed you waveform graphs demonstrating how the DCT converts a sequence of discrete color samples to a sum of sinusoid waves. The graphs were bounded tightly around the eight samples on the X-axis. This time let's stretch out the X-axis and let the waves carry on to the left and right. I will draw grey dots at evenly spaced intervals, so you can see if the same pattern repeats itself every eight time units or not.

Look carefully at the pattern created when the waves derived from the DCT are extended to the left and right. Is it simply repeating the same pattern every eight time units? Or...what?

This is part of why the DCT is useful for image compression. When an image is broken down into blocks of pixels, the color of the left edge of a block will often be different from the right edge, and likewise for the top and bottom edges. If we used a discrete Fourier transform on those color samples, the discontinuity between the colors of opposing edges would tend to produce strong high-frequency component waves. (When breaking a waveform down into sinusoids, any sharp "jumps" result in strong high-frequency components.) But since the DCT, in effect, buts the block up with a mirror image of itself on each side, that discontinuity doesn't exist, and the high-frequency components will usually be much weaker.

I still haven't told you why the DCT is useful for image compression. First, understand that not all the information contained in an image is equally important or noticeable to a human viewer. It happens that converting color samples to the frequency domain concentrates the information which is most detectable by our visual system in the coefficients at the top-left of the DCT matrix. Conversely, the information which is least perceptible to our visual system is concentrated in the coefficients at the bottom-right.

In this way, the DCT sets things up for subsequent stages of compression to work their fullest effect. First, quantization. This stage throws away part of the data in the less-significant bits of the DCT coefficients. Since we know that the values of the coefficients towards the bottom-right have less of an effect on what we see, those can be heavily quantized, while retaining more bits of the coefficients toward the top-left. That means we can discard a significant amount of data with little effect on visual quality.

The DCT works synergistically with quantization and the zig-zag ordering of coefficients to make the final entropy coding stage more effective. This stage applies a lossless compression algorithm to the quantized coefficients.

Many images will have smaller coefficient values toward the bottom right of the matrix, and after quantization is applied, these may become zeroes. So the zig-zag ordering of coefficients will tend to produce runs of zeroes toward the end of each block. Those consecutive zeroes can then be represented using an efficient run-length encoding.

Interestingly, both the WebP and AVIF compressed image formats also transform color samples into the frequency domain. Both can use either the Discrete Cosine Transform or a different transform which serves a similar purpose.

The other popular compressed image formats are PNG and GIF. Neither of these transform samples into the frequency domain.

The next post in this series will explore Huffman coding, a lossless compression algorithm which is another key ingredient of JPEG.

Peering into the Linux Kernel with trace

2020-06-04T00:00:00+00:00

Recently, I was working on a patch for a popular open-source project, and discovered that the test suite was failing intermittently. A closer look revealed that the last access time for some files in the project folder were changing unexpectedly, and this was causing a test to fail. (The failing test was not related to my patch.)

Looking at the project code, it seemed impossible for it to be unexpectedly accessing those files during the test in question. Running the test case under strace confirmed that this was not happening. But incontrovertibly, the access times were changing. Could another process on the same machine be reading those files? But why? Could it be a bug in the operating system? Were my tools lying to me?

Faced with a puzzle like this, the inclination might be to shrug one’s shoulders and forget about it, perhaps with a dismissive remark about the general brokenness of most software. (I’ve done that many times.) Anyways, it wasn’t my code which was failing. And yet, it seemed prudent to clear up the mystery, rather than bumbling along and hoping that what I didn’t know wouldn’t hurt me.

This seemed like a good opportunity to try out the BCC tools. This is a powerful suite for examining and monitoring Linux kernel activity in real-time. Support is built in to the kernel (starting from 4.1), so you can immediately investigate when a problem is occurring, without needing to install a special kernel or reboot with special boot parameters.

One of the more than 100 utilities included in the BCC tools is trace. Using this program, one can monitor when any function in the kernel is called, what arguments it receives, what processes are causing those calls, and so on. Having trace is really like having a superpower.

Of course, the argument(s) of interest might not just be integers or strings. They might be pointers to C structs, which might contain pointers to other structs, and so on… but trace still has you covered. If you point it to the appropriate C header files which your kernel was compiled with, it can follow those pointers, pick out fields of interest, and print them at the console. (The header files enable trace to figure out the layout of those structs in memory.)

The invocation of trace which did the job for me turned out to be:

sudo /usr/share/bcc/tools/trace -I/home/alex/Programming/linux/include/linux/path.h -I/home/alex/Programming/linux/include/linux/dcache.h 'touch_atime(struct path *path) "%s", path->dentry->d_name.name'

That says that every time a function called touch_atime (with parameter struct path *path) is called in the kernel, I want to see the string identified by the C expression path->dentry->d_name.name. In response, trace prints out a stream of messages like:

  2135    sublime_text    touch_atime      ld.so.cache
  2076    chrome          touch_atime
  2497    Chrome_ChildIOT touch_atime
  1071    Xorg            touch_atime
  2135    sublime_text    touch_atime      Default.sublime-package
  1566    pulseaudio      touch_atime

As you can see, it very helpfully shows some additional information for each call. From the left, that is the process ID, thread ID, command, function name, and then the requested string. Piping that into ripgrep revealed (within minutes) that my text editor had a background thread which was scanning the project files for changes, as part of its git integration. That is what was updating the access times and causing the erratic test failures.

What a difference it makes to be able to directly look inside a system and see what it is doing, instead of blindly groping using trial and error! This was the first time I harnessed the formidable power of trace, but it won’t be the last. It has a permanent home in my debugging toolbox now.

Eric Raymond’s “Rule of Transparency” sagely advises programmers: “Design for visibility to make inspection and debugging easier”. You said it, Eric, you said it.

⸻But how did you know the function to trace was touch_atime?

Just poking around in the kernel source a bit. I knew there should be a function somewhere in the fs subfolder, and grepped for functions with atime in their name. There are just a few, and touch_atime almost jumped out. Reading the code confirmed that it was the right one.

⸻OK. So how does trace work under the hood?

First, it parses the “probe specifications” which you provide, converts them to a little C program, and uses BCC to convert that C program into eBPF bytecode. (The VM which runs this bytecode is built-in to the Linux kernel.) A special system call is used to load the bytecode into the kernel.

Next, it registers a kprobe with the kernel. The “kprobe” mechanism allows arbitrary callbacks to be associated with almost any function (actually, any machine instruction) in the kernel binary, which will fire whenever that instruction is executed. When a kprobe is registered, the kernel stores the original instruction somewhere and overwrites it with a breakpoint instruction (such as an INT3 instruction on x86). Then it sets things up so that when the breakpoint fires, all the callbacks will be executed. Of course, the instruction which was overwritten will also be executed, so as not to break the function which is being traced.

There are a couple different APIs which user programs can use to create kprobes; one of them is by writing some specially formatted data to a “magic” file called /sys/kernel/debug/tracing/kprobe_events.

Then trace uses another API to tell the kernel to use the previously loaded eBPF bytecode as a callback for the new kprobe. Then it uses another API to get a file descriptor from the kernel, from which it can read the output generated by the BPF program.

It’s an intricate mechanism, but very, very flexible. Just thinking of the possibilities boggles the mind…

Alex Dowad Computes

Stuff I Have Learned: Don’t use a coverage-guided fuzzer on an uninstrumented binary

Visualizing the Complex Sine and Cosine

1. Are they periodic, with period 2π?

2. Do their values range from -1 up to 1?

3. Does the Pythagorean Identity still work?

4. Is cos(z) even, and sin(z) odd?

5. Are they identical after a π/2 phase shift?

6. Do their derivatives still work in the same way?

Digression: Most Complex Functions Don't Have a Derivative

Outro

A Toy Runge-Kutta Differential Equation Solver

Expression Syntax Help for this Tool

Visualizing Nelder-Mead Optimization

JPEG Series, Part II: Huffman Coding

Bottom-up

Top-down

Modifying the Basic Algorithm for JPEG

Huffman Coding in Practice

JPEG Series, Part I: Visualizing the Inverse Discrete Cosine Transform

Peering into the Linux Kernel with trace

4. Is $cos (z)$ even, and $sin (z)$ odd?