Next: List of Participants Up: FINAL REPORT TO NSF Previous: Final Remarks

Goals Acheived and Recommendations

Some of the achieved goals of the workshop are

In comparing the list of phenomena to be modeled to the list of modeling techniques, it is clear that researchers have been working effectively to address many of the fundamental modeling problems of facial animation. However, In spite of the array of advanced modeling technologies that have been introduced into the field, it is still a far from trivial task for an end user to create a model of a specific person's face and to make it speak or perform other complex behaviors. In order to enable practical applications of facial animation, five basic research tasks should be undertaken: understand application needs, develop a data description language, collect an extensive database, formalize and validate modeling techniques, and perform basic research into modeling techniques. As improved computing technology becomes available, new applications of facial modeling are becoming feasible and cost-effective. These research tasks are designed to encourage efficient development of these applications.

FACS is the premier notation scheme being using for facial animation. However, FACS was designed as a recognitive scheme - defined actions were created as cognitively/visually distinct units rather than minimally generable units. In particular, (1) actions are imprecisely defined, and (2) actions combine in an unpredictable manner.

The imprecise definition is necessary with respect to the initial intent of FACS to notate general expression usage on human faces. However, describing precise animations can require the manipulation of the face at a sub-FACS level. The inability of FACS to exactly predict the result of an AU sequence is irrelevant to notators. A notator must be aware of what actions may not be currently visible; however, an animation control system must be able to explicitly state whether an action is visible. (Alternatively, facial region changes must be controllable at a sub-AU level, implying multiple levels of control even within the realm of FACS.) That FACS continues to function so well despite these weaknesses, in a manner almost completely converse to its initial design, testifies to its flexibility and accuracy.

A general notation scheme for describing changes to the human face needs to be designed. The general philosophy of FACS (i.e., representable actions are described in parallel with a structural model of matching complexity) will be maintained. We will refer to this scheme as FACS+.

Control systems can be defined at several levels. The face suggests at least four: (1) geometric, (2) structural, (3) expressive, and (4) conversational. Geometric facial models manipulate the object at a purely physical level. Structural models represent the face in terms of active regions, taking a simplistic view of the underlying geometry. Expressive models work at a grosser feature level, animating faces based on its most obvious features. Conversational representation and control operate at an even higher level, dealing with emotional intent and general facial actions.

FACS operates at a structural level. Its ability to operate at a geometric level is limited due to its lack of definition at that low a level. Likewise, its ability to operate at higher levels has been well-researched, but exact mappings from intent to FACS AUs are specified in an ad hoc manner. FACS+ will also operate at a structural level. We assume other schemes will be used to operate on geometric and feature-based models; FACS+ will act as an intermediary between the two.

Issues involved in the replacement of FACS center on its relationship to lower and higher level schemes as well as extensions needed to more fully represent the face. Based on this, we have identified the following areas of concern when extending FACS to produce FACS+:

  1. Downward links to physical model controls.

  2. Upward links to feature-based model controls.

  3. Static definition of expression changes.

  4. Expression dynamics.

  5. Muscle intensity.

  6. Extensibility.

  7. Fine/Coarse definition.

  8. Tongue action.
  9. Interactions.
  10. External interactions.

Attempts are already underway aimed at extending FACS by defining methods to automate FACS-coding, and to extend and improve the modeling, especially within the context of simulations, animations and human-machine interaction. Ekman and Sejnowski [39] are at present developing a neural net appraoch for recognizing FACS AUs. Yacoob and Davis [152] have developed a facial expression recognition system based on FACS. Essa and Pentland [44][42] are concentrating on extending the FACS model to FACS+ by observing real people making expressions and extracting spatial and temporal information from video to describe facial motion.



Next: List of Participants Up: FINAL REPORT TO NSF Previous: Final Remarks


pkitchin@graphics
Thu Nov 17 10:12:34 EST 1994