Facial motion capture

Facial motion capture is the process of electronically converting the movements of a person's face into a digital database using cameras or laser scanners. This database may then be used to produce computer graphics (CG), computer animation for movies, games, or real-time avatars. Because the motion of CG characters is derived from the movements of real people, it results in a more realistic and nuanced computer character animation than if the animation were created manually.

A facial motion capture database describes the coordinates or relative positions of reference points on the actor's face. The capture may be in two dimensions, in which case the capture process is sometimes called "expression tracking", or in three dimensions. Two-dimensional capture can be achieved using a single camera and capture software. This produces less sophisticated tracking, and is unable to fully capture three-dimensional motions such as head rotation. Three-dimensional capture is accomplished using multi-camera rigs or laser marker system. Such systems are typically far more expensive, complicated, and time-consuming to use. Two predominate technologies exist: marker and marker-less tracking systems.

Facial motion capture is related to body motion capture, but is more challenging due to the higher resolution requirements to detect and track subtle expressions possible from small movements of the eyes and lips. These movements are often less than a few millimeters, requiring even greater resolution and fidelity and different filtering techniques than usually used in full body capture. The additional constraints of the face also allow more opportunities for using models and rules.

Facial expression capture is similar to facial motion capture. It is a process of using visual or mechanical means to manipulate computer generated characters with input from human faces, or to recognize emotions from a user.

History

One of the first papers discussing performance-driven animation was published by Lance Williams in 1990. There, he describes 'a means of acquiring the expressions of realfaces, and applying them to computer-generated faces'.^[1]

Technologies

Marker-based

Traditional marker based systems apply up to 350 markers to the actors

cameras. This has been used on movies such as The Polar Express and Beowulf to allow an actor such as Tom Hanks to drive the facial expressions of several different characters. Unfortunately this is relatively cumbersome and makes the actors expressions overly driven once the smoothing and filtering have taken place. Next generation systems such as CaptiveMotion

utilize offshoots of the traditional marker based system with higher levels of details.

Active LED Marker technology is currently being used to drive facial animation in real-time to provide user feedback.

Markerless

Markerless technologies use the features of the face such as nostrils, the corners of the lips and eyes, and wrinkles and then track them. This technology is discussed and demonstrated at CMU,^[2] IBM,^[3] University of Manchester (where much of this started with Tim Cootes,^[4] Gareth Edwards and Chris Taylor) and other locations, using active appearance models, principal component analysis, eigen tracking, deformable surface models and other techniques to track the desired facial features from frame to frame. This technology is much less cumbersome, and allows greater expression for the actor.

These vision based approaches also have the ability to track pupil movement, eyelids, teeth occlusion by the lips and tongue, which are obvious problems in most computer-animated features. Typical limitations of vision based approaches are resolution and frame rate, both of which are decreasing as issues as high speed, high resolution

CMOS cameras

become available from multiple sources.

The technology for markerless face tracking is related to that in a Facial recognition system, since a facial recognition system can potentially be applied sequentially to each frame of video, resulting in face tracking. For example, the Neven Vision system^[5] (formerly Eyematics, now acquired by Google) allowed real-time 2D face tracking with no person-specific training; their system was also amongst the best-performing facial recognition systems in the U.S. Government's 2002 Facial Recognition Vendor Test (FRVT). On the other hand, some recognition systems do not explicitly track expressions or even fail on non-neutral expressions, and so are not suitable for tracking. Conversely, systems such as deformable surface models pool temporal information to disambiguate and obtain more robust results, and thus could not be applied from a single photograph.

Markerless face tracking has progressed to commercial systems such as Image Metrics, which has been applied in movies such as The Matrix sequels^[6] and The Curious Case of Benjamin Button. The latter used the Mova system to capture a deformable facial model, which was then animated with a combination of manual and vision tracking.^[7] Avatar was another prominent performance capture movie however it used painted markers rather than being markerless. Dynamixyz^{[permanent dead link]} is another commercial system currently in use.

Markerless systems can be classified according to several distinguishing criteria:

2D versus 3D tracking
whether person-specific training or other human assistance is required
real-time performance (which is only possible if no training or supervision is required)
whether they need an additional source of information such as projected patterns or invisible paint such as used in the Mova system.

To date, no system is ideal with respect to all these criteria. For example, the Neven Vision system was fully automatic and required no hidden patterns or per-person training, but was 2D. The Face/Off system^[8] is 3D, automatic, and real-time but requires projected patterns.

Facial expression capture

Technology

Digital video-based methods are becoming increasingly preferred, as mechanical systems tend to be cumbersome and difficult to use.

Using digital cameras, the input user's expressions are processed to provide the head pose, which allows the software to then find the eyes, nose and mouth. The face is initially calibrated using a neutral expression. Then depending on the architecture, the eyebrows, eyelids, cheeks, and mouth can be processed as differences from the neutral expression. This is done by looking for the edges of the lips for instance and recognizing it as a unique object. Often contrast enhancing makeup or markers are worn, or some other method to make the processing faster. Like voice recognition, the best techniques are only good 90 percent of the time, requiring a great deal of tweaking by hand, or tolerance for errors.

Since computer generated characters don't actually have muscles, different techniques are used to achieve the same results. Some animators create bones or objects that are controlled by the capture software, and move them accordingly, which when the character is rigged correctly gives a good approximation. Since faces are very elastic this technique is often mixed with others, adjusting the weights differently for the skin elasticity and other factors depending on the desired expressions.

Usage

Several commercial companies are developing products that have been used, but are rather expensive.

It is expected that this will become a major input device for computer games once the software is available in an affordable format, but the hardware and software do not yet exist, despite the research for the last 15 years producing results that are almost usable.

References

^ Performance-Driven Facial Animation, Lance Williams, Computer Graphics, Volume 24, Number 4, August 1990
^ AAM Fitting Algorithms Archived 2017-02-22 at the Wayback Machine from the Carnegie Mellon Robotics Institute
^ "Real World Real-time Automatic Recognition of Facial Expressions" (PDF). Archived from the original (PDF) on 2015-11-19. Retrieved 2015-11-17.
^ Modelling and Search Software Archived 2009-02-23 at the Wayback Machine ("This document describes how to build, display and use statistical appearance models.")
ISBN 978-3-540-63460-7

^ Borshukov, George; D. Piponi; O. Larsen; J. Lewis; C. Templelaar-Lietz (2003), "Universal Capture - Image-based Facial Animation for "The Matrix Reloaded"", ACM SIGGRAPH

^ Barba, Eric; Steve Preeg (18 March 2009), "The Curious Face of Benjamin Button", Presentation at Vancouver ACM SIGGRAPH Chapter, 18 March 2009.

^ Weise, Thibaut; H. Li; L. Van Gool; M. Pauly (2009), "Face/off: Live Facial Puppetry", ACM Symposium on Computer Animation

External links

Carnegie Mellon University

Delft University of Technology

Intel

Sheffield and Otago

v
t
e
Animation topics
By country

Bangladesh

Bhutan

China

Czechia

Estonia

India

Japan

Malaysia

Mexico

North Korea

Philippines

Portugal

Romania

South Africa

South Korea

Spain

Taiwan

Thailand

United States

Vietnam

History

Azerbaijan

Bangladesh

Brazil

Canada

China

France

Hungary

India

Iran

Japan

Korea

Russia

Ukraine

United Kingdom

United States
Silent Era

The Golden Age

World War II

Early TV broadcast era

Modern TV cable and streaming era

Industry

Animator
List

Animation department

Animation director

Story artist

Animation studios
List

Animation database

Art pipeline

Biologist simulators

Animation film festivals
international

regional

Highest-grossing films (Opening weekends)

Outsourcing

International Animation Day

Works

Films
Computer-animated

Feature-length

Lost or unfinished

Package

Short

Short series

Stop-motion

Adult animated films

Series
Adult animated

Computer-animated

Direct-to-video

Flash

Internet

Television

Techniques
Traditional

Barrier-grid and stereography

Flip book

Limited animation

Masking

Rotoscoping

Exposure sheet

Stop motion

Claymation
clay painting, strata-cut

Cutout (silhouette)

Graphic

Model
go motion

Object
Brickfilm

Pixilation

Puppetoons

Computer
(history, timeline)
2D

Flash

PowerPoint

SVG

CSS

Multi-sketch

Onion skinning

3D

T-pose

Cel shading

CGI

Crowd

Facial animation

Morph target

Motion capture
facial

hand tracking

eye tracking

Non-photorealistic rendering

Physically based animation

Procedural

Skeletal

Virtual cinematography

Puppetry

Traditional puppetry

Digital puppetry
Machinima

Aniforms

Virtual human

Live2D

Supermarionation

Mechanical

Animatronics
Audio-Animatronics

Linear Animation Generator

Direct manipulation animation

Humanoid animation

Idle animation

Ink-wash animation

Magic Lantern

Scanimate

Shadowmation

Squigglevision

Whiteboard animation

Other methods

Blocking

Chuckimation

Drawn-on-film

Erasure animation

Hydrotechnics

Inbetweening

Morphing

Paint-on-glass

Pinscreen

Pixel art

Pose to pose

Straight ahead

Rubber hose

Sand

Syncro-Vox

Zoetrope

Variants

Abstract animation (visual music
)

Adult animation

Animated cartoon

Animated sitcom

Animated documentary

Anime

Educational animation

Erotic animation

Independent animation

Instructional animation

Virtual newscaster

Related topics

Animation music
Bouncing ball

Mickey Mousing

Key frame

Cel

Character animation
model sheet

walk cycle

lip sync

off-model

Creature animation

Twelve basic principles

Motion comic

Films with live action and animation
highest grossing

Cartoon physics

Cartoon violence

Most expensive animated films

List of animated films by box office admissions

List of animated television series by episode count
anime series
anime franchises

Category

Portal

Retrieved from "https://en.wikipedia.org/w/index.php?title=Facial_motion_capture&oldid=1209445109"

[LW1990-1] Performance-Driven Facial Animation, Lance Williams, Computer Graphics, Volume 24, Number 4, August 1990

[2] AAM Fitting Algorithms Archived 2017-02-22 at the Wayback Machine from the Carnegie Mellon Robotics Institute

[3] "Real World Real-time Automatic Recognition of Facial Expressions" (PDF). Archived from the original (PDF) on 2015-11-19. Retrieved 2015-11-17.

[4] Modelling and Search Software Archived 2009-02-23 at the Wayback Machine ("This document describes how to build, display and use statistical appearance models.")

[5] ISBN 978-3-540-63460-7

[6] Borshukov, George; D. Piponi; O. Larsen; J. Lewis; C. Templelaar-Lietz (2003), "Universal Capture - Image-based Facial Animation for "The Matrix Reloaded"", ACM SIGGRAPH

[7] Barba, Eric; Steve Preeg (18 March 2009), "The Curious Face of Benjamin Button", Presentation at Vancouver ACM SIGGRAPH Chapter, 18 March 2009.

[8] Weise, Thibaut; H. Li; L. Van Gool; M. Pauly (2009), "Face/off: Live Facial Puppetry", ACM Symposium on Computer Animation

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]