things I learned approximating an image with polygons

<!--
/* --><title>things I learned approximating an image with polygons</title>
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<link rel="stylesheet" href="../style.css" />

<meta name="og:title" content="things I learned approximating an image with polygons" />
<meta name="twitter:creator" content="@kragen" />
<meta name="og:description"
content="Some thoughts about my learning process on my first (?) hill-climbing exercise." />
<meta name="og:image" content="http://canonical.org/~kragen/sw/81hacks/see/see.png" />
<meta name="twitter:card" content="summary" />

<h1>things I learned <a href=".">approximating an image with polygons</a></h1>

Things I learned, or had forgotten and was reminded of:

<ul>

<li>I had this very frustrating debugging problem at the beginning
where it was telling me in the console log it had lots of triangles to
draw but nothing was showing up.  I could draw a triangle by typing a
line of JS into the debug console, but the code that was supposedly
drawing lots of triangles was doing nothing.  Unfortunately I don’t
remember what the actual problem was; maybe I wasn’t calling
ctx.fill() after drawing the triangles?  I didn’t check the broken
version of the code in to git, and I don’t remember.

<li>HTML5 &lt;canvas&gt; uses the “nonzero rule” by default for
filling polygons.  As a consequence, if you draw a path that consists
of a bunch of overlapping black triangles, some of those triangles
will dig holes in other triangles — when one is counterclockwise and
the other is clockwise — while others will not.  This is more or less
the rule used by METAFONT, if I remember right.  With either this rule
or the alternative even-odd rule, you can get really complicated
images with a fairly small number of polygon edges, because any edge
can divide a large number of previously existing regions in two.

<li>At first, my loss function was always returning the same number,
even though I saw triangles on the screen.  Eventually I realized that
this was because I was initially clearing the canvas, which fills it
with transparent black, and then I was drawing opaque black triangles
on the canvas, but my loss function was only comparing the RGB values
to the image, ignoring the transparency.  Erasing the canvas by
filling it with opaque white instead of transparent black solved that
problem.

<li>Also I had this problem where after I removed a triangle from the
list of triangles to draw, it kept showing up anyway, even though I
was erasing the canvas in between frames.  This is a bug I’ve had more
than once before with &lt;canvas&gt; — forgetting to call
ctx.beginPath(), with the result that every frame just adds more and
more lines onto the same current path, which then gets filled.  The
symptoms are that drawing gradually gets slower over time, since it’s
building a path with a truly unreasonable number of points; sometimes
things you thought you’d erased don’t actually get erased; and
sometimes you end up with weird lines to places you didn’t expect.

<li>Using triangles with corners randomly generated from a uniform
distribution over the whole canvas did the basic thing of filling the
canvas with color pretty quickly, but then convergence (improvement in
the loss function) slowed down a lot, because the size distribution of
such triangles is far from uniform, and in fact is strongly biased
toward large triangles that cover much of the canvas.  Once the canvas
is basically the right color, almost none of the generated triangles
are small enough to actually improve it.  So I switched to smaller
triangles, then made them pentagons.

<p>I actually still have this convergence slowdown problem, but to a
lesser extent.  Within a few minutes, it will reach over 1000
polygons, but if I leave it overnight, it’s at less than 2500 polygons
(<a href="overnight.png">and looks awesome</a>).  While it’s desirable
from an optimization perspective that it doesn't use too many
polygons — really this is a thing I should put into the loss
function — and from an artistic perspective that it doesn’t get too
close to the original image, from a perspective of using hill climbing
to solve optimization problems in general, this is kind of a serious
problem.

<p>Probably using a distribution of polygon sizes that is closer to
exponential (or maybe power-law) would work a <i>lot</i> better: if it's
10% polygons about half the size of the canvas, 10% polygons ¼ the
size, 10% polygons ⅛ the size, 10% polygons 1/16 the size, and so on
up to 1/2048 the size (133 pixels), then it would proceed even more
rapidly at first, but continue to improve even once the needed changes
were very small.  I should try out this approach, and I suspect it can
be generalized.

<p>I guess simulated annealing is another approach to solving that
problem with hill-climbing.

<li>I realized at some point that, because I wasn’t doing any blurring
in the loss function, and all the triangles I was adding were black,
in the limit, every part of the image that was darker than 50% gray
would have its loss minimized by being painted entirely black, and
every part of the image that was lighter would have its loss minimized
by being painted entirely white.  Which meant that visually it was not
going to look very much like the original image, more like a
black-and-white xerox of it, even though you can do a lot better than
that with just black and white by using dithering and halftoning and
whatnot.  I checked the image in GIMP to see what that would look
like, and it was kind of okay, but I decided to go ahead and use
random colors in the triangles instead.

<li>Performance was <i>much</i> better than in things like
“<a href="../engravecam/">engraving from the Webcam</a>”, and it
doesn’t crash Chrome on my Android, even though that other thing was
not doing much more than comparing the image pixels from a video frame
to a set of precomputed thresholds.  This suggests that the Vec object
I hacked together in those hacks is the root of the poor performance
(and occasional Android browser crashes), and I should expect a large
speedup from using at least an Array and maybe an actual TypedArray to
hold its data.

<li>Nevertheless, performance is still fairly bad.  The profiler
claims that the culprit (like 80% of the run time) is getImageData,
which is the call that turns the canvas pixels into a TypedArray.
It’s possible that it’s attributing the time to actually fill all the
pixels with colors to the getImageData call.  But maybe not.  I tried
improving performance by adding polygons in batches of two or four and
seeing if the batch as a whole improved the situation, but that made
convergence slower, and then I realized that this was kind of the same
as just using larger polygons.

<p>If I compute the loss function over a downsampled version of the
image, at least until the downsampled version is improving very
slowly, that should dramatically speed up performance and also make it
possible to use only black polygons.