|
The plugins are free, work with PSP and Photoshop, and are available with source code (side note: by strange coincidence, Alex Chirokov graduated from the same place as I did, only 14 years later).
The original 2400dpi scan proved to be too large for processing, so I scaled it down 5 times and cropped the margins. The resulting image of the text block is 3MB on disk and 9.5MB in memory (32 bits per pixel). Here are the exact dimensions:
Averaging the measurements above, we get a 18.95±0.05 dpmm image resolution. Thus, the real size of the image is 70.7±0.2mm × 131.4±0.3mm. TransformationFourier transformation converts a greyscale image into an equivalent form showing the phases and amplitudes of sine waves of various frequencies; the position of a pixel defines the angle and frequency of the wave, while the color information for the pixel stores the phase and amplitude/intensity of the wave (if you want a more detailed introduction, go here). In the original image space, each individual wave looks like a regular set of parallel bars with intensity alternating between darker and lighter shades of grey. The wave covers the whole picture and its average intensity is ½ (1=white, 0=black). The phase of a wave determines positions of its maxima; the interference of the waves of many different frequencies, amplitudes and phases recreates the original image. In theory, the transformation does not lose any information and is reversible. When applied to discrete images, rounding errors do occur, but in most cases, including ours, the result of direct transformation (FFT) followed by inverse transformation (IFFT) is visually indistinguishable from the original.
The intensity picture of the original page shows small-to-medium variations of the tone between adjacent pixels (representing waves of close frequency and direction). These changes, when blurred by the eye, give very smooth transitions with very little contrast except for a few places around the middle. The contrast can be artificially enhanced for analysis by averaging and posterizing the picture into small number of fixed intensities. I chose 3-bit posterization (8 intensity levels) and colored the result using map-like pseudocolors (blue to white through green). Also, since the averaging loses important intensity peaks, I added those peaks using intensity threshholding on the non-averaged channel. The peaks look like small white dots located in the central area. AnalysisWhen discrete Fourier transformation is applied to a finite image, there are natural limits for the waves frequencies: the shortest identifiable wavelength is two pixels, while the longest wavelength is equal to the size of the picture (if the picture is not a square, horizontal and vertical components of the frequency can have different maxima). The FFT image is laid out so that the information concerning the long waves is in the center of the image, while the shortest waves are represented by pixels close to the edges. Since amplitude channels of FFT images are symmetrical around the center, only one half of the image needs to be analyzed.
First, lets take a look at the peaks close to the center (I marked them with red dots). The center pixel and pixels around the center encode amplitudes of very long waves, responsible for background color and very smooth transitions of the tone in the original image; they aren't very interesting. There are half a dozen peaks right above and below the center point, spaced at equal intervals. The first three peaks above the center are marked vf, vf*2, and vf*3; they represent the main vertical wave and its harmonics (shorter waves with frequencies which are exact multiples of the main wave). As it is the case with any musical instrument, the main tone is accompanied by its harmonics with higher frequences; when superimposed, they shape the actual waveform, turning the pure sinusoidal shape into something more sophisticated. In our case, harmonics allow for more abrupt transitions between black and white, which will be presented below. Positions of the peaks can be measured and compared to the central pixel at 670,1245:
Given the vertical offset of the peak in pixels, the actual wavelength in pixels can be calculated as the quotient of the height of the image and the offset; for the main vertical wave we have 2490/33 = 75.5±2.5pix between maximums. In real terms, dividing by 18.95±0.05 dpmm resolution, we get a 4.0±0.1mm wavelength (distance between maxima). As you will see below, this wave corresponds to the vertical line period and its length is equal to the aldine classics line increment measured by Peter Burnhill (figure 35 in Type Spaces). The remaining points of interest are not peaks of high intensity, but rather centers of hills, responsible for many diagonal waves of close frequencies and orientation. These centers are marked df1, df2, and df3 (since the FFT image is symmetrical, the same points are present below the center). Here are the positions of the centers and their offsets from the center point:
With these numbers, we can easily calculate the orientation and wavelengths of the diagonal waves. It can be done componentwise, e.g. the horizontal component of df3's wavelength (x) is 1340/89 = 15pix between maximums along horizontal axis; the vertical component (y) is 2490/219 = 11.4pix. The actual wavelength is x×y/sqrt(x²+y²), which in our case will be 9.0±0.3pix or 0.48±0.02mm in real units. The angle is atan(x/y), or 53 degrees clockwise from vertical. Making these calculations for all centers, we get the following results (vf is added for completeness):
VisualizationNice thing about Fourier transform is that you can inverse it and get the original image. However, it is more interesting to tweak the amplitude channel before making the inverse transformationthis way one can filter out some unneeded frequencies or emphasize the important ones. Basic filters for low and high frequencies just sharpen or soften the image; we will try something more fancy here.
The most obvious match is the 9° wave (df1); it corresponds to the main repeated stem rate and is angled at the average ascender / descender angle. Its average horizontal period is 0.84mm, which corresponds to 2½ Burnhill's units (0.333mm × 2.5 = 0.833mm, ½ of the x-height). The next wave, df2, can be attributed to repeated connection strokes and frequent ligatures which have inclined shapes, different from those of standalone letters. At 37°, it is close to the prototype pen angle. The last one, df3, is less pronounced than the first two. Its angle is steeper (53°) and its maxima are closer together (although the horizontal spacing is the same as for df2). Closer inspection allows us to identify it with features of a, e, æ, and possibly with high connection strokes of standalone m and n. The details can be viewed by clicking on the pictures below. |
|
AcknowledgementsI am grateful to Alex Chirokov for making this work possible and to Andrew Pochinsky for his thoughtful comments. MiscellaneaTo see the role of phase information, observe what happens with the original picture when the FFT phase channel is zeroed. The amplitudes are the same, but all the waves are shifted to a meaningless common origin, which results in an unrecognizeable interference picture. |