TY - GEN
T1 - Associating the visual representation of user interfaces with their Internal structures and metadata
AU - Chang, Tsung Hsiang
AU - Yeh, Tom
AU - Miller, Rob
PY - 2011
Y1 - 2011
N2 - Pixel-based methods are emerging as a new and promising way to develop new interaction techniques on top of existing user interfaces. However, in order to maintain platform independence, other available low-level information about GUI widgets, such as accessibility metadata, was neglected intentionally. In this paper, we present a hybrid framework, PAX, which associates the visual representation of user interfaces (i.e. the pixels) and their internal hierarchical metadata (i.e. the content, role, and value). We identify challenges to building such a framework. We also develop and evaluate two new algorithms for detecting text at arbitrary places on the screen, and for segmenting a text image into individual word blobs. Finally, we validate our framework in implementations of three applications. We enhance an existing pixel-based system, Sikuli Script, and preserve the readability of its script code at the same time. Further, we create two novel applications, Screen Search and Screen Copy, to demonstrate how PAX can be applied to development of desktop-level interactive systems.
AB - Pixel-based methods are emerging as a new and promising way to develop new interaction techniques on top of existing user interfaces. However, in order to maintain platform independence, other available low-level information about GUI widgets, such as accessibility metadata, was neglected intentionally. In this paper, we present a hybrid framework, PAX, which associates the visual representation of user interfaces (i.e. the pixels) and their internal hierarchical metadata (i.e. the content, role, and value). We identify challenges to building such a framework. We also develop and evaluate two new algorithms for detecting text at arbitrary places on the screen, and for segmenting a text image into individual word blobs. Finally, we validate our framework in implementations of three applications. We enhance an existing pixel-based system, Sikuli Script, and preserve the readability of its script code at the same time. Further, we create two novel applications, Screen Search and Screen Copy, to demonstrate how PAX can be applied to development of desktop-level interactive systems.
KW - Accessibility API
KW - Graphical user interfaces
KW - Pixel
KW - Text detection
KW - Text segmentation
UR - http://www.scopus.com/inward/record.url?scp=80755169504&partnerID=8YFLogxK
U2 - 10.1145/2047196.2047228
DO - 10.1145/2047196.2047228
M3 - Conference contribution
AN - SCOPUS:80755169504
SN - 9781450307161
T3 - UIST'11 - Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology
SP - 245
EP - 255
BT - UIST'11 - Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology
T2 - 24th Annual ACM Symposium on User Interface Software and Technology, UIST'11
Y2 - 16 October 2011 through 19 October 2011
ER -