Könyv Multi-modal Scene Understanding Using Probabilistic Models Sven Wachsmuth

Multi-modal Scene Understanding Using Probabilistic Models

Szerző: Sven Wachsmuth
Nyelv: Angol
Kötés: Puha kötésű
Kiadó: ibidem
Elérhetőség: 50 % esély
Keressük az egész világon
10 559 Ft
How do we explain a picture to another person? We talk about the picture, describe the colors, shape...

Információk a könyvről

Szerző
Nyelv
Angol
Kötés
Könyv - Puha kötésű
Kiadva
2005
oldal
204
EAN
9783898212717
Enbook ID
01915514
Kiadó
Súly
280

Teljes leírás

How do we explain a picture to another person? We talk about the picture, describe the colors, shapes, and objects in it, mention how different objects are related to each other. How do we explain a verbal statement? We show a picture which visualizes the content of the utterance, the objects mentioned in it and how they are related. In everyday communication people use various ways in parallel in order to transmit their intention. They point on something, put on a special face, gesticulate, or refer to the common environment of the communication partners.§They use different modalities in order to communicate. It seems to be just natural to use the same way of interaction in humancomputer-interfaces. The consequence is a paradigm shift from passive interfaces, such as mouse clicks or text typing, to an active communication partner that interprets the auditive and visual environment, draws inferences using background knowledge, and requests missing information. Subsequently, such an active human-computer-interface will be called artificial communicator. However, the automatic interpretation of signals of a separate input modality, such as speech understanding, gesture recognition, or visual object recognition are only one part of the total. In order to build systems which communicate with people in a natural way, the integration of modalities is an essential task that is not trivial. Each modality has its own vocabulary and expressiveness. Pointing defines a region or direction of interest, a special face may represent an emotional feeling, speech understanding provides qualitative facts about the world, and vision perceives and interprets analogous shapes in the world. I think it is not questionable that different formalisms are needed for processing different modalities, and, indeed, this is the the fact in the current state of the art (see Sec. 2.2,2.3). The question is, and this thesis will be an experimental study in this topic, what is the most promising formalism to integrate the results of the specialized processing components of such a multi-modal system or artificial communicator? How should the individual components of the system be connected, and how should the processing be organized? This thesis will give an innovative answer to these questions and present a realization in a particular domain.

Érdekelheti

5 394 Ft
22 496 Ft
5 761 Ft

Miracle on Ice

Michael Burgan
3 621 Ft

Meal in a Mug

Denise Smart
6 786 Ft

Broken Angels Can't Fly

Dmin Dr Robert McElroy
8 742 Ft

Innocent Blood

Michael Lister
11 329 Ft
5 537 Ft

Love and Vandalism

Laurie Boyle Crompton
3 621 Ft
4 942 Ft

Backyard Volcano

Kathryn Lane
5 819 Ft
3 831 Ft

BLEEDING ARMENIA;

A[UGUSTUS] WILLIAMS
14 507 Ft

Cinders

Mette Bach
3 263 Ft

Azok a vásárlók, akik ezt a könyvet megvásárolták, a következőket is megvásárolták

8 433 Ft
6 427 Ft

Sokratische Pädagogik

Roland Mugerauer
11 382 Ft

RAPSODIE

DEBUSSY CLAUDE
22 666 Ft

Shadowscent

P.M. Freestone
9 068 Ft
4 158 Ft
3 478 Ft
7 663 Ft

Angel heart

William Hjortsberg
6 871 Ft