The Artifacts of Motion

One thing that television and the movies have in common is that both portray motion by capturing and displaying a series of still pictures, albeit in very different ways. The capture and display of moving images in these ways can generate artifacts, which are visual effects created by the capture/display process that were not in the original scene.

Some of these artifacts are reversible and may be removed by artful processing, while some cannot be removed at all. Some artifacts are less obtrusive than the effects of their removal would be. It is possible, for example, to remove the visible scan lines from a television screen by placing a ground glass filter over it, but this will also remove enough detail from the picture that we would rather live with the scan lines.

FREQUENCY RECONSTRUCTION

Temporal sampling artifacts may be generated in both film and video capture. We know that sample theory dictates that we need at least two samples per second of any frequency that we capture, in order to perfectly reconstruct that frequency. In digital audio, we sample at a rate faster than twice the highest frequency of interest, and filter the input signal so that no frequency above the highest frequency of interest enters the sampler.

For example, we sample at 48 kHz, while limiting the highest input frequency to 20 kHz. We know that filters have finite slopes, and we have therefore chosen a sampling frequency sufficiently high that we may ensure flat response to 20 kHz; the frequencies within the filter's attenuation slope are well-below half the sampling frequency. Sampling at exactly half the sampling rate, in this case 24 kHz, is called critical sampling and is avoided in digital audio.

Sampling frequencies greater than half the sampling rate result in the replacement of these frequencies with aliases, which are frequencies that were not actually present in the original sampled material. Once aliases have been created, they may not be removed from the reconstructed material, as they are within the band of frequencies of interest.

THE SAMPLE THEORY

Any kind of sampling is subject to the sample theory, and this includes moving pictures. Motion picture photography may be used to illustrate the temporal aliasing phenomenon. A film camera that shoots 24 frames per second has a temporal sampling rate of 24 Hz. This means that any temporal frequency this camera photographs that is above 12 Hz will generate a temporal alias.

Because Westerns, America's own epic dramas, have been so prevalent in our cinematic and television-viewing experience, backward-turning wagon wheels are one of the most-frequently seen temporal aliases. The spokes in a wagon wheel form a repetitive pattern when sampled - that is, each snapshot of the wheel turning "freezes" the spokes at some given position. The frequency at which spokes pass a given point depends on how many spokes the wheel has and how fast it is turning.

WAGONS, HO!

If a wheel has 24 spokes, for example, the spokes are spaced at intervals of 360/24 = 15 degrees. This wheel may turn at a speed such that at the instant each spoke in turn reaches "zero degrees" (or straight up), a photograph is taken. In this case, the wheel is turning 15 degrees each 1/24 second, making one complete revolution each second.

For the moment, we will ignore the aperture effect and pretend that each snapshot is taken instantaneously, knowing that in reality the camera's shutter must remain open for a finite period of time, causing higher-speed motion to blur. Because all the spokes look alike, we cannot tell which particular spoke is passing zero degrees, or any of the other 23 points, in any given snapshot.

In fact, we cannot tell that anything is happening at all, because the wheel appears to be stationary. In this case, only one sample is being taken each second, and sample theory tells us that we cannot accurately reconstruct this frequency. We find this to be true, because the 24 Hz spoke travel frequency has been replaced by the alias frequency of zero Hz.

If the wheel is sped up to two revolutions per second (with spokes still miraculously in-sync with the camera shutter), a snapshot will be made as every other spoke passes zero degrees, and the wheel will again appear to be stationary. Another way to say this is that every integer multiple of 24 Hz will generate a zero Hz alias.

TALKIN' ÎBOUT A REVOLUTION

If the wagon wheel travels at one-half revolution per second, the frequency of spokes passing a given point is 12 Hz, or exactly half the sample frequency. As mentioned earlier, this is called critical sampling. The visual result of this will be that a given spoke will be at zero degrees for a given snapshot, while the spoke immediately ahead of it in the direction of travel will be at 15 degrees and the spoke immediately following it will be at 345 degrees.

At the next snapshot, the first (or "zero") spoke will be at 7.5 degrees; the spoke immediately ahead of it will be at 22.5 degrees; and the spoke immediately following it will be at 352.5 degrees. At one-quarter revolution per second, the spoke repetition rate will be 6 Hz, and the spokes in a given snapshot will be advanced by 3.75 degrees in the next snapshot.

The visual effect for spoke frequencies between zero and 12 Hz (the wheel is turning between zero and 1/2 revolution per second) is that the wagon wheel will appear to turn normally. If the wheel turns at three-quarter revolution per second, the spoke frequency becomes 18 Hz, and the "zero spoke" in our reference snapshot will appear advanced by 11.25 degrees in the next snapshot, while the spoke immediately behind it will be at 360 ö [15-11.25] = 356.25 degrees, or 3.75 degrees before the zero degree point.

The net result of this is that when the 18 Hz spoke frequency is sampled, it is replaced by an alias of 6 Hz. Further, the zero spoke will have advanced by more than half the spacing between two spokes, while the spoke behind it will be 3.75 degrees behind zero degrees.

To the eye, it will appear that the zero spoke has moved not forward but backward by an amount equal to a spoke travel frequency of 6 Hz. Visually, the wagon wheel appears to turn normally as its speed is increased between 0 Hz and 12 Hz - and as its speed is increased above 12 Hz, the real spoke frequencies are replaced by aliases, and the wheel appears to turn backward rapidly at just above 12 Hz. And as the wheel speeds up (progressively more slowly until, as a spoke frequency of 24 Hz is reached), the wheel appears to have stopped.

SPINNIN' WHEEL

Thus, the pattern between 12 Hz and 24 Hz is visually a mirror image of the pattern between 0 and 12 Hz. As the spoke frequency is increased above 24 Hz, the pattern repeats itself, but the wagon wheel will appear to be traveling at half its real speed, until at 48 Hz, it will appear stationary again. This pattern will repeat ad infinitum (or until the bearings burn out), with the wheel appearing stationary at every integer multiple of 24 Hz.

All the visual representations above a spoke frequency of 12 Hz are invalid, as the real frequencies have all been replaced by aliases, just as the sample theory tells us they will be. Further, none of these aliases may be removed, because they all fall within the 0-12 Hz frequency band.

The only way to prevent this aliasing is to filter out all the frequencies above 12 Hz before sampling, analogously to the way digital audio is sampled. However, this cannot be accomplished in film photography, as the requisite sharp-cutoff optical filters cannot be realized. We must, therefore, live with the aliases generated by optical sampling, and we do live with them on a daily basis - both in cinema and television.

When 24-frame film is transferred to 60 Hz video, another form of artifact is generated that we know as 3/2 judder. This column has discussed 3/2 pulldown in the past, but the essence of the process is that in order to transfer 24-frame material to 60-field interlaced video, 60 fields must be made from 48 discrete fields; that is, for every four film frames, not eight but 10 fields must be generated.

This is done by repeating a field after each four fields. For 60-field interlaced video (30 frames per second), one pair of fields is made from each film frame, and one of the fields generated by every second film frame is repeated, so that four film frames comprise five interlaced video frames. An analogous process is used to generate 60 Hz progressive video, except that full video frames replace fields.

FRAME REPITION

When 3/2 material is displayed, the repetition of film frames (as fields or progressive video frames) produces a distortion of what is sometimes called the optical flow axis, which interferes with eye's tracking of motion. As an object moves through the field of vision, the eye attempts to track it. Repetition of frames means that as the object moves, instead of each picture appearing at a unique place on the eye's retina, identical versions of each picture appear at different places on the retina - three identical pictures followed by two identical pictures, followed by three identical pictures, and so on.

This interferes with the eye's ability track the object, causing it to appear to "stutter" as it moves through the field of vision instead of moving smoothly as the eye knows it should. As one in five fields of progressive frames is repeated, the stutter frequency is 60 x 0.2 = 12 Hz.

When the camera pans to track a moving object, the background is not tracked, and the background judders. Judder is readily visible during pans, making cinematographers keenly aware of panning speeds. Unlike aliasing, this artifact of 3/2 pulldown is reversible.

Next month, we will explore these temporal artifacts in greater depth and look at some other artifacts that may be seen when film and video are displayed.

Randy Hoffner