Structured Topic Models: Jointly Modeling Text with Its Accompanying Modalities - Brossura

Wang, Xuerui

 
9783639205572: Structured Topic Models: Jointly Modeling Text with Its Accompanying Modalities

Sinossi

The abundance of textual data in the information age poses an immense challenge for us: how to perform large-scale inference to understand and utilize this overwhelming amount of information. We develop effective and efficient statistical topic models for massive text collections by taking care of extra information from other modalities in addition to the text itself. Text documents are not just text, for example, research papers have author information, email messages contain social sender-recipient links, legislative resolutions are recorded with votes, and so on. These kinds of additional information are naturally interleaved with text. Most of the previous work, however, pay attention to only one modality at a time, and ignore the others. We present a series of probabilistic topic models to show how we can bridge multiple modalities of information, in a united fashion. Interestingly, joint inference over multiple modalities leads to many findings that can not be discovered from just one modality alone, which are clear evidence that we can better understand and utilize massive text collections when additional modalities are modeled jointly with text.

Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.

L'autore

Dr. Xuerui Wang is a Scientist at Yahoo! Labs, working on machine learning, topic models, social network analysis and online advertising. Dr. Andrew McCallum is an Associate Professor at University of Massachusetts, working on machine learning, natural language processing and information extraction.

Le informazioni nella sezione "Su questo libro" possono far riferimento a edizioni diverse di questo titolo.