History of Lip Sync & Industry Examples In the process of lip synchronization, the animator must have an understanding of how phonetics work, and how our mouths are positioned in order to produce certain sounds which are represented by letters and letter combinations. In this post, I want to briefly delve into the history of the technique and the basics of lip syncing for animation.
What is lip sync? Lip sync, which is short for lip synchronization, is a technical term for matching a speaking or singing person’s lip movements with prerecorded sung or spoken vocals that listeners hear through an audio output system. The term can refer to any number of different techniques and processes from film production, concerts, and dubbing, but for the purposes of this assignment, we are exploring the use of lip sync in animation.
This is the art of making an animated character appear to speak, to a pre-recorded track of dialogue or sound. This technique of making an animated character appear to talk involves a range of key ideas, but mainly figuring out the timings of the speech and the actual animation of the lips and mouth to match the dialogue track.
A Brief History Historically, the earliest examples of lip synchronization in animation were attempted by Max Fleischer in the 1926 short My Old Kentucky Home, but for obvious reasons (racial discrimination and insensitivity) I will not show it here. The technique, however, has since been refined and honed since this early example and is a staple of the animation industry to this day. The technique is used by all types of animated works, from feature films such as Cloudy with a Chance of Meatballs to American animated sitcoms like The Simpsons and Family Guy.
Lip synchronization typically begins by analyzing sound frame by frame, a method which dates back to the late 1920s with the origin of sound cartoons. Traditionally, this would involve transferring the dialogue tracks for animated films ‘onto sprocketed optical sound film’. The holes on this film material would exactly match ‘with those of the motion picture film’, which enabled sound and image to be mechanically locked together on editing and sound mixing machines.
This process is called 'track reading'. The dialogue track is analyzed and the information is charted up onto camera exposure sheets, sometimes called 'dope sheets' or 'camera charts', as a guide for the animator. With contemporary animation, we have the ability to analyze a sound using digital sound tools such as Audacity or the sound workspace within TVPaint itself. These allow us to scrub back and forth over ‘a graphical depiction’ of a sound wave, and see where the sound begins and finishes.
Industry Practices Approaches to the lip synchronization process can be just as varied as the character designs themselves: simple, elaborate, restrained or exaggerated. Every individual animator’s approach to the process is different, so in larger studios where more than one animator is in charge of the same character, extensive notes, drawings and sketches will be created to instruct the rest of the team how to animate the mouth consistently throughout a series. This can be seen in Adventure Time, where creator Pendleton Ward and lead designers for the show instruct animators working overseas how to keep protagonist Finn’s mouth shapes on-model at all times. For a large production like Adventure Time where the animation is done externally, this is a key industry practice.
Often, the way a mouth works is determined by the character’s head design: animators use tricks and techniques to cut down the workload by subtly separating the head or just the mouth, in the form of beards, necklaces or neckties that allow animators to work solely on the mouth and faces during the lip sync process. Sometimes, the mouth is hidden behind a wiggling mustache or replaced altogether with a mask in the example of The Spectacular Spider-Man animated series.
The Importance of Gesture Something that contemporary animation, in particular, has been found guilty of doing is the use of talking heads: this is commonly seen in shows such as Family Guy and similar digital 2D animated sitcoms where the characters don’t move and simply talk to each other. This can quickly get boring and is something I am going to try and consciously avoid during this project. A solution to this problem would be the use of gestures and body movement. Despite the fact we are encouraged to solely focus on the lip sync itself, gestures and body movements can communicate something about the personality of a character and the way it is feeling to an audience. This also makes for something much more interesting and engaging to watch, rather than simply watching the mouth. Small movements and gestures such as shrugging the shoulders or pointing can communicate ideas of emotion and what the character is thinking, helping sell the performance to the viewer.
Economical Animation When a character’s head remains static during a passage of dialogue, animators will take this opportunity to simply cycle the mouths onto a different layer, and place these onto of a character without a mouth. This is a key industry practice and saves lots of time and energy on the animator’s part. It’s important to maintain the same sort of perspective view in the mouth, otherwise, it will look separated from the face.
A more effective and engaging technique is treating the whole head as the lip-sync component. This allows movement of the bottom jaw that can actually open and drops lower, and also allows animators to work stretch and squash into the entire face. It is ‘rare for one mouth position to be onscreen for less than two frames’, as animating on ones tends to look far too quick. This is a lesson that I had to learn myself in my initial response to the assignment, animating the sequence frame by frame in TVPaint.
The Hold One important principle of modern lip sync animation is ‘the hold’. This is a technique which describes the animation of the body, which is held for numerous frames whilst the lip sync takes place. As a rule, animators work on gestures and body language first, and then go back and add the mouth in afterward. This will inform my own process in TVPaint, as I will begin the animation by developing the body movements first before moving onto the actual mouth shapes.
Whilst classic Disney animated films featured characters that rarely actually stopped moving, it is far more commonplace to have still illustrations be held for multiple frames whilst the lip sync animation takes place. This saves time and effort but also places the focus on a handful of expressive gestural poses that work to convey the emotion of the scene. This approach can be seen in almost any animated television production, we can see this in the example from Regular Show, in which only select parts of the two main characters move. The face and arms are animated, but the body stays still as to not detract from the lip sync.
As a rule, lip synchronization is ‘best underplayed’, with the exception of important accents and vowel sounds. Emphasis on the size and mechanics on the mouth can detract from a good lip-sync animation - this is an idea that I will be considering with my own animation.
It’s in the Eyes A final interesting idea that I found is the focus on the eyes of the characters: ‘the audience often watch the eyes, so emphasis and accents can be initiated here’ before the rest of the face and mouth is considered. It’s often said that the eyes are the window to the soul: it’s a powerful way of getting a character to communicate their inner feelings without actually saying anything, and even the act of thinking of words can speak can be expressed in the eyes. This is something that I found to be true in my own video reference for this assignment, as I observed how my eyes would move around as I was delivering the dialogue, in character.
Summary Through my own independent research into the history of lip synchronization in animation and other online sources presented below, I have been able to really develop my understanding of this crucial animation process. Having looked at a few industry examples here, next I want to continue this line of inquiry further and explore some actual tutorials by animators on some more basic principles of lip synchronization for animation.
References:
The Little Whirlwind | A Classic Mickey Short. (2017). Mickey Mouse. https://www.youtube.com/watch?v=REHJ1lz_HLQ
Damo and Darren 3 - Centrelink. (2015). Michael Cusack. https://www.youtube.com/watch?v=s96drQkfpPI
Regular Show. (2014). JG Quintel. https://www.youtube.com/watch?v=KOSdKV2z7_Q&t=45s
The Simpsons. (1987). Matt Groening. https://www.youtube.com/watch?v=nekvVuoiTyA&t=1s
The Spectacular Spider-Man. (2012). Disney Animation. https://www.youtube.com/watch?v=8v4PL2-EC1Q&t=5s
Lip Sync. (2015). AIM Animation. http://minyos.its.rmit.edu.au/aim/a_notes/anim_lipsync.html











