Computational Methods for Integrating Vision and Language - Brossura

Kanatani, Kenichi ; Sugaya, Yasuyuki

9783031006869: Computational Methods for Integrating Vision and Language

Brossura

ISBN 10: 3031006860 ISBN 13: 9783031006869

Casa editrice: Springer-Nature New York Inc, 2016

Vedi tutte le copie di questa edizione con ISBN

2 Usato

Da: EUR 61,29

14 Nuovo

Da: EUR 46,22

Modeling data from visual and linguistic modalities together creates opportunities for better understanding of both, and supports many useful applications. Examples of dual visual-linguistic data includes images with keywords, video with narrative, and figures in documents. We consider two key task-driven themes: translating from one modality to another (e.g., inferring annotations for images) and understanding the data using all modalities, where one modality can help disambiguate information in another. The multiple modalities can either be essentially semantically redundant (e.g., keywords provided by a person looking at the image), or largely complementary (e.g., meta data such as the camera used). Redundancy and complementarity are two endpoints of a scale, and we observe that good performance on translation requires some redundancy, and that joint inference is most useful where some information is complementary.Computational methods discussed are broadly organized into ones for simple keywords, ones going beyond keywords toward natural language, and ones considering sequential aspects of natural language. Methods for keywords are further organized based on localization of semantics, going from words about the scene taken as whole, to words that apply to specific parts of the scene, to relationships between parts. Methods going beyond keywords are organized by the linguistic roles that are learned, exploited, or generated. These include proper nouns, adjectives, spatial and comparative prepositions, and verbs. More recent developments in dealing with sequential structure include automated captioning of scenes and video, alignment of video and text, and automated answering of questions about scenes depicted in images.

Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.

Informazioni sull'autore

Kobus Barnard is a professor of computer science at the University of Arizona. He also has appointments in the School of Information: Science, Technology, and Arts (SISTA), Statistics, Cognitive Science, Electrical and Computer Engineering (ECE), and the BIO5 Institute. He leads the Interdisciplinary Visual Intelligence Laboratory (IVILAB.org). Professor Barnard received his Ph.D. in computer science in 2000 from Simon Fraser University (SFU) in the area of computational color constancy, where his dissertation received the Governor General gold medal awarded across all disciplines. He then spent two years at the University of California at Berkeley as a postdoctoral researcher working on modeling the joint statistics of images and associated text, followed by moving to the University of Arizona. His current research addresses problems in interdisciplinary computational intelligence by developing top-down statistical models that are predictive, semantic, and explanatory. Application domains include computer vision, multimedia data, biological structure and processes, astronomy, and human social interaction. His work has been funded by multiple grants from NSF including a CAREER award, DARPA, ONR, ARBC (Arizona Biomedical Commission), and the University of Arizona BIO5 Institute.

Le informazioni nella sezione "Su questo libro" possono far riferimento a edizioni diverse di questo titolo.

Editore: Springer-Nature New York Inc
Data di pubblicazione: 2016
Lingua: Inglese
ISBN 10: 3031006860
ISBN 13: 9783031006869
Rilegatura: Copertina flessibile
Numero edizione: 1
Numero di pagine: 228
Contatto del produttore: ProductSafety@springernature.com
ProductSafety@springernature.com

Poststr. 9
Darmstadt
64293
Germania

Compra usato

Condizioni: come nuovo

Unread book in perfect condition...

Visualizza questo articolo

EUR 61,29

Spedizione EUR 2,32
Spedito in U.S.A.

Aggiungi al carrello

Compra nuovo

Visualizza questo articolo

EUR 46,22

Spedizione EUR 5,50
Spedito da Italia a U.S.A.

Aggiungi al carrello

Risultati della ricerca per Computational Methods for Integrating Vision and Language

Foto dell'editore

Computational Methods for Integrating Vision and Language (eng)

Kanatani, Kenichi

Editore: Springer, 2016

ISBN 10: 3031006860 ISBN 13: 9783031006869

Nuovo Brossura

Print on Demand

Da: Brook Bookstore On Demand, Napoli, NA, Italia

Valutazione del venditore 5 su 5 stelle

Condizione: new. Questo � un articolo print on demand. Codice articolo 3SBXQLLOIL

Contatta il venditore

Compra nuovo

EUR 46,22

Spedizione EUR 5,50
Spedito da Italia a U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Computational Methods for Integrating Vision and Language

Kanatani, Kenichi; Sugaya, Yasuyuki

Editore: Springer, 2016

ISBN 10: 3031006860 ISBN 13: 9783031006869

Nuovo Brossura

Da: GreatBookPrices, Columbia, MD, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: New. Codice articolo 44570989-n

Contatta il venditore

Compra nuovo

EUR 52,32

Spedizione EUR 2,32
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Computational Methods for Integrating Vision and Language (Synthesis Lectures on Computer Vision)

Kanatani, Kenichi; Sugaya, Yasuyuki

Editore: Springer, 2016

ISBN 10: 3031006860 ISBN 13: 9783031006869

Nuovo Brossura

Da: California Books, Miami, FL, U.S.A.

Valutazione del venditore 4 su 5 stelle

Condizione: New. Codice articolo I-9783031006869

Contatta il venditore

Compra nuovo

EUR 61,48

Spedizione gratuita
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Computational Methods for Integrating Vision and Language

Kanatani, Kenichi; Sugaya, Yasuyuki

Editore: Springer, 2016

ISBN 10: 3031006860 ISBN 13: 9783031006869

Antico o usato Brossura

Da: GreatBookPrices, Columbia, MD, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: As New. Unread book in perfect condition. Codice articolo 44570989

Contatta il venditore

Compra usato

EUR 61,29

Spedizione EUR 2,32
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Computational Methods for Integrating Vision and Language (Synthesis Lectures on Computer Vision)

Kanatani, Kenichi; Sugaya, Yasuyuki

Editore: Springer, 2016

ISBN 10: 3031006860 ISBN 13: 9783031006869

Nuovo Brossura

Da: Ria Christie Collections, Uxbridge, Regno Unito

Valutazione del venditore 5 su 5 stelle

Condizione: New. In English. Codice articolo ria9783031006869_new

Contatta il venditore

Compra nuovo

EUR 60,62

Spedizione EUR 13,88
Spedito da Regno Unito a U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Computational Methods for Integrating Vision and Language

Kanatani, Kenichi

Editore: Springer 2016-04, 2016

ISBN 10: 3031006860 ISBN 13: 9783031006869

Nuovo PF

Da: Chiron Media, Wallingford, Regno Unito

Valutazione del venditore 5 su 5 stelle

PF. Condizione: New. Codice articolo 6666-IUK-9783031006869

Contatta il venditore

Compra nuovo

EUR 56,91

Spedizione EUR 17,95
Spedito da Regno Unito a U.S.A.

Quantit�: 10 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Computational Methods for Integrating Vision and Language

Yasuyuki Sugaya

Editore: Springer International Publishing Apr 2016, 2016

ISBN 10: 3031006860 ISBN 13: 9783031006869

Nuovo Taschenbuch

Print on Demand

Da: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germania

Valutazione del venditore 5 su 5 stelle

Taschenbuch. Condizione: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -Modeling data from visual and linguistic modalities together creates opportunities for better understanding of both, and supports many useful applications. Examples of dual visual-linguistic data includes images with keywords, video with narrative, and figures in documents. We consider two key task-driven themes: translating from one modality to another (e.g., inferring annotations for images) and understanding the data using all modalities, where one modality can help disambiguate information in another. The multiple modalities can either be essentially semantically redundant (e.g., keywords provided by a person looking at the image), or largely complementary (e.g., meta data such as the camera used). Redundancy and complementarity are two endpoints of a scale, and we observe that good performance on translation requires some redundancy, and that joint inference is most useful where some information is complementary. Computational methods discussed are broadly organized into ones for simple keywords, ones going beyond keywords toward natural language, and ones considering sequential aspects of natural language. Methods for keywords are further organized based on localization of semantics, going from words about the scene taken as whole, to words that apply to specific parts of the scene, to relationships between parts. Methods going beyond keywords are organized by the linguistic roles that are learned, exploited, or generated. These include proper nouns, adjectives, spatial and comparative prepositions, and verbs. More recent developments in dealing with sequential structure include automated captioning of scenes and video, alignment of video and text, and automated answering of questions about scenes depicted in images. 228 pp. Englisch. Codice articolo 9783031006869

Contatta il venditore