Visualizing Uncertain Labels

untangleData with multiple uncertain labels are common in many situations. For examples, a movie may be associated with multiple genres with different levels of confidence, and a protein sequence may be probabilistically assigned to several structural subcategories. Despite their ubiquity, the problem of visualizing uncertain labels has not been adequately addressed. Existing approaches often either discard the uncertainty information, or map the data to a low-dimensional subspace where their associations with original labels are obscured. In this paper, we propose a novel visual technique, UnTangle, for visualizing uncertain multi-labels. In our proposed visualization, data items are placed inside a web of connected triangles, with labels assigned to the triangle vertices such that nearby labels are more relevant to each other. The positions of the data items are determined based on the probabilistic associations between items and labels. UnTangle provides both (a) an automatic label placement algorithm, and (b) adaptive interaction mechanisms that allow users to control the label positioning for different information needs. Our work makes a unique contribution by providing an effective way to investigate the relationship between data items and their uncertain labels, as well as the relationships among labels. Our user study suggests that the visualization effectively helps users discover emergent patterns and compare the nuances of uncertainty information in the data labels.

Publications

Nan Cao, Yu-Ru Lin, and David Gotz. UnTangle Map: Visual Analysis of Probabilistic Multi-Label Data. IEEE Transactions on Visualization and Computer Graphics (Published online first at IEEE Xplore on 21 April 2015; to appear in print).
[This article is available through the journal’s website.  View the article by clicking here.]

Yu-Ru Lin, Nan Cao, David Gotz and Lu Lu. UnTangle: Visual Mining for Data with Uncertain Multi-Labels Via Triangle Map. To Appear in the IEEE International Conference on Data Mining (ICDM) (2014).
[PDF, 1.2M]

Sch of Inform and Libr Science