Research

Visit my Google Scholar for the full publication list.

Other undergraduate research projects under supervision.

Some projects I'm involved (in progress):

Generative scavenging hunts for foreign vocabulary acquisition
Language learning support for low-resource languages using machine translation
Culturally relevant and appropriate usage of LLMs for learning low-resource languages

Selected Projects

AnnotateGPT

AnnotateGPT is an AI-assisted document annotation system that combines the familiarity of handwritten feedback with the power of large language models. Instead of replacing human reviewers, AnnotateGPT works alongside them: reviewers simply circle, highlight, or mark text with a digital pen, and the AI interprets the intent behind each annotation to generate detailed, context-aware feedback. This approach preserves the natural workflow of handwritten grading while reducing the effort required to provide meaningful, personalized comments.

A study with novice teachers found that AnnotateGPT helped reviewers deliver more comprehensive and constructive feedback with less effort. Beyond education, the project demonstrates how pen-based interactions can be a powerful way to communicate with AI, opening opportunities for applications in design, creative work, and human-AI collaboration.

See more: https://vialab.github.io/AnnotateGPT/

Related publications

B. Leung, M. Shimabukuro, & C. Collins. (2026). AnnotateGPT: Designing human–AI collaboration in pen-based document annotation. In Proc. ACM CHI Conference on Human Factors in Computing Systems (CHI). 2026. https://doi.org/10.1145/3772318.3790867

B. Leung. Implicit Pen Annotation Assisted by Large Language Models. M.Sc. Thesis, Ontario Tech University, 2025. https://hdl.handle.net/10155/1990

"Don’t Wanna Miss A Thing"

"Don't Wanna Miss a Thing" presents a gaze-aware video player designed to make watching subtitled foreign-language videos less stressful and more engaging. Using eye tracking, the system detects when viewers become distracted and automatically helps them catch up without requiring manual rewinding. Depending on the situation, it can pause the video until the viewer looks back, keep missed subtitles visible on the screen, or temporarily switch the dialogue to an English text-to-speech dub so important information isn't lost while the viewer is looking elsewhere.

A user study found that all three gaze-aware interventions were preferred over a standard video player, reducing frustration from missed dialogue and making the viewing experience feel more seamless, even when distractions occurred. This work highlights how eye tracking can enable more adaptive and user-centered language learning experiences.

See more: https://vialab.ca/research/dont-wanna-miss-a-thing

Related publications

M. Ahmed, B. Leung, M. Shimabukuro, & C. Collins. Don't Wanna Miss a Thing: Exploring Gaze-Aware Interventions for Language Learning with Subtitled Videos. In Proc. ACM Symposium on Eye Tracking Research and Applications (ETRA) 2026.

AnchoR

AnchoR is a mobile augmented reality system that helps people explore complex 3D data and models together, even when they are not working simultaneously. Using an everyday tablet, users can bookmark interesting viewpoints, attach text or voice notes, and share their discoveries with others. AnchoR also records which perspectives have already been explored and suggests informative viewpoints, making it easier to understand complex visualizations and continue someone else's analysis without losing context. Early expert feedback suggests that these features can improve communication, navigation, and knowledge sharing in collaborative data exploration, engineering, and training applications.

See more: https://vialab.ca/research/anchor

Related publications

N. Shah, M. Chan, M. Shimabukuro, & C. Collins. AnchoR: Toward Asynchronous Collaboration and Guided Exploration for Mobile AR Visualizations. In Proc. ACM Spatial User Interfaces (SUI) 2025.

SwipeSense

SwipeSense explores a new way of interacting with smartphones by turning the back of the device into an input surface. Instead of touching the screen, users can perform swipe gestures on the back of their phone to scroll, navigate, or answer calls, reducing screen occlusion and making one-handed interaction more comfortable, especially when multitasking or holding something in the other hand. Unlike previous approaches, SwipeSense relies only on the phone's built-in motion sensors, requiring no additional hardware.

Using machine learning, SwipeSense accurately recognizes swipe gestures in eight directions and runs efficiently in real time on commercial smartphones. A user study found that back-of-device swipes felt intuitive and natural for everyday tasks, demonstrating the potential for more accessible and ergonomic mobile interactions while expanding how we think about smartphone input.

See more: <url>

Related publications

N. Shah, B. Leung, M. Shimabukuro, & A. Neshati. SwipeSense: Exploring the Feasibility of Back-of-Device Swipe Interaction Using Built-In IMU Sensors. Proceedings of the ACM on Human-Computer Interaction, Vol. 9, No. 5, Article MHCI030, 2025. https://doi.org/10.1145/3743734

GazeQ-GPT

GazeQ-GPT is an interactive learning system that personalizes comprehension questions for short educational videos using eye-tracking and large language models. By modeling learners’ visual attention on video subtitles, GazeQ-GPT identifies words and concepts that spark interest or difficulty, then automatically generates tailored questions and in-context glosses to support understanding. A user study shows that this gaze-driven approach produces more diverse, high-quality questions than generic LLM prompts, helping learners focus on what matters most to them and promoting more effective, personalized video-based learning.

See more: https://vialab.ca/research/gazeqgpt

Related publications

B. Leung, M. Shimabukuro, M. Chan, & C. Collins. GazeQ-GPT: Gaze-Driven Question Generation for Personalized Learning from Short Educational Videos. In Proc. Graphics Interface (GI) 2025.

LangEye

This project presents LangEye, an application that enables in-situ language learning. Learners can take a picture of real objects of their daily lives, save them as memories, and when ready review those memories. LangEye is a web mobile application for vocabulary learning and training in a foreign language.

LangEye applies the current machine translation, computer vision, large language models, and generative images technology to contextual vocabulary learning. LangEye draws from a theoretical framework for augmented reality and presents a practical application of those concepts using an ubiquitous and accessible platform, smartphones.

See more: https://vialab.ca/research/langeye

Related publications

M. Shimabukuro, D. Panchal, & C. Collins. LangEye: Toward ‘Anytime’ Learner-Driven Vocabulary Learning From Real-World Objects. In Proceedings of the ACL 2025 BEA Workshop on Innovative Use of NLP for Educational Applications. 2025.

M. Shimabukuro. Promoting Autonomy for Language Learning Powered by Artificial Intelligence and Eye Tracking. Ph.D. Dissertation, Ontario Tech University, 2024. https://hdl.handle.net/10155/1905

Card-it

Card-it is a web application for learning Italian verb morphology, in other words, Italian verb conjugations. Unlike other flashcard applications (i.e., Anki), Card-it’s offers (1) the semi-automatic creation of cards using a Finite-State Morphological (FSM) analyzer, reducing repetitive labour and human error inputting the morphological data, and (2) the possibility of classroom integration with student analytics supporting students, teachers and autonomous learners of Italian as a second language.

See more: https://vialab.ca/research/card-it

Related publications

M. Shimabukuro. Promoting Autonomy for Language Learning Powered by Artificial Intelligence and Eye Tracking. Ph.D. Dissertation, Ontario Tech University, 2024. https://hdl.handle.net/10155/1905

M. Shimabukuro, J. Zipf, S. Yama, and C. Collins. 2023. “Evaluating Classroom Potential for Card-it: Digital Flashcards for Studying and Learning Italian Morphology,” in Proc. of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023), pages 130–136, Toronto, Canada. ACL.

S. Yama, “Card-IT Versus: A Competitive Multiplayer Game for Testing Italian Verb Morphology,” Bachelors Thesis, 2022.

H-Matrix

This project presents a visualization technique for cross-linguistic error analysis in large learner corpora. H-Matrix combines a matrix, which is commonly used by linguists to investigate cross-linguistic patterns, with a tree diagram to aggregate and interactively re-weight the importance of matrix rows to create custom investigative views. Our technique can help experts to perform data operations, such as feature aggregation, filtering, ordering and language comparison interactively without having to reprocess the data. H-Matrix dynamically links the high-level multi-language overview to the extracted textual examples, and a reading view where linguists can see the detected features in context, confirm and generate hypotheses.

See more: https://vialab.ca/research/h-matrix

Related publications

M. Shimabukuro. Promoting Autonomy for Language Learning Powered by Artificial Intelligence and Eye Tracking. Ph.D. Dissertation, Ontario Tech University, 2024. https://hdl.handle.net/10155/1905

M. Shimabukuro, J. Zipf, M. El-Assady, and C. Collins, “H-Matrix: Hierarchical Matrix for Visual Analysis of Cross-Linguistic Features in Large Learner Corpora,” in Proceedings of the IEEE Conference on Information Visualization (short papers), 2019.

Abbreviation on demand

A known problem in information visualization labelling is when the text is too long to fit in the label space. There are some commonly known techniques used in order to solve this problem like setting a very small font size. On the other hand, sometimes the font size is so small that the text can be difficult to read. Wrapping sentences, dropping letters and text truncation can also be used. However, there is no research on how these techniques affect the legibility and readability of the visualization. In other words, we don’t know whether or not applying these techniques is the best way to tackle this issue. This thesis describes the design and implementation of a crowdsourced study that uses a recommendation system to narrow down abbreviations created by participants allowing us to efficiently collect and test the data in the same session. The study design also aims to investigate the effect of semantic context on the abbreviation that the participants create and the ability to decode them. Finally, based on the study data analysis we present a new technique to automatically make words as short as they need to be to maintain text legibility and readability.

See more: https://vialab.ca/research/abbreviating-text-labels-on-demand

M. Shimabukuro, “An Adaptive Crowdsourced Investigation of Word Abbreviation Techniques for Text Visualizations,” Master Thesis, 2017.

M. Shimabukuro and C. Collins, “Abbreviating Text Labels on Demand,” Proc. of IEEE Conf. on Information Visualization (InfoVis), 2017.

Word Cloud & Font Size Perception

Many visualizations, including word clouds, cartographic labels, and word trees, encode data within the sizes of fonts. While font size can be an intuitive dimension for the viewer, using it as an encoding can introduce factors that may bias the perception of the underlying values. Viewers might conflate the size of a word’s font with a word’s length, the number of letters it contains, or with the larger or smaller heights of particular characters (‘o’ vs. ‘p’ vs. ‘b’). We present a collection of empirical studies showing that such factors-which are irrelevant to the encoded values-can indeed influence comparative judgements of font size, though less than conventional wisdom might suggest. We highlight the largest potential biases and describe a strategy to mitigate them.

See more:https://vialab.ca/research/perceptual-biases-in-font-size-as-a-data-encoding

E. Alexander, C. Chang, M. Shimabukuro, S. Franconeri, C. Collins, and M. Gleicher, “Perceptual Biases in Font Size as a Data Encoding,” IEEE Transactions on Visualization and Computer Graphics, vol. 24, iss. 8, pp. 2397-2410, 2017.

E. Alexander, C. Chang, M. Shimabukuro, S. Franconeri, C. Collins, and M. Gleicher, “The Biasing Effect of Word Length in Font Size Encodings,” Proc IEEE Information Visualization (InfoVis), Posters, 2016.

Page updated

Google Sites

Report abuse