Three panels: user speaking to The Pen worn behind the ear; user whiteboarding while the device observes visual context; user removes the device and places it in a pocket before resuming social interaction.
The Pen is worn behind the ear during an assistance episode (left); the device observes visual context while the user whiteboards (middle); the user disengages by removing the device before resuming social interaction (right).

Wearable AI is often designed as always-available, yet continuous availability can conflict with how people work and socialize, creating discomfort around privacy, disruption, and unclear system boundaries. This paper explores episodic use of wearable AI, where assistance is intentionally invoked for short periods of focused activity and set aside when no longer needed, with a form factor that reflects this paradigm of wearing and taking off a device between sessions. We present The Pen, an ear-worn device resembling a pen, for episodic, situated cognitive assistance. The device supports short, on-demand assistance sessions using voice and visual context, with clear start/end boundaries and local processing. We report findings from an exploratory study showing how layered activation boundaries shape users' sense of agency, cognitive flow, and social comfort.

Wearable Computing Earables Voice User Interfaces Interaction Design Situated Assistance User Agency Intelligibility

The Pen

An ear-worn wearable that resembles a pen or pencil, designed for hands-free interaction during focused work episodes. Users put the device behind their ear at the start of a task and remove it when done — using the act of wearing to clearly mark the beginning and end of an assistance episode.

The Pen device poster figure showing the ear-worn wearable design and components.

The device consists of a microphone, camera, speaker, Force Sensitive Resistor (FSR), haptic motor, and UFL antenna. A single press captures a photo; press-and-hold initiates a voice query, with haptic feedback communicating system state.

The Pen interaction flow diagram.
1 · Physical
Wear / Remove

Wearing the device marks the beginning of an assistance episode; removing it marks the end. The form factor itself supports the episodic model; put it on when you need it and put it away when you don't.

2 · Interactional
Press / Hold

A deliberate trigger separates passive wearing from active assistance. A single press captures visual context; press-and-hold initiates a voice query, reducing ambiguity about when the system is listening.

3 · Perceptual
Audio / Haptic Feedback

Haptic and audio cues communicate system state, making the boundary between active and inactive assistance legible without requiring visual attention from the user.

Study

We conducted an exploratory study in two tasks — reading/studying and whiteboarding/brainstorming — to understand how episodic boundaries shape user experience in practice.

6
participants
(4M, 2F; ages 18-53)
2
task contexts
(Studying & Brainstorming)
12-30
minute sessions
(Full wear/remove flow)

Participants were instructed to treat wearing as the start of an episode and removal as its end, and were encouraged to invoke assistance whenever they wanted clarification or support. Post-session surveys and open-ended reflections were collected to capture perceptions of agency, flow, and social comfort.

Findings

01
Start boundaries require explicit feedback

Participants did not consistently interpret wearing the device as the beginning of an episode. Users sought haptic or audio confirmation to clarify activation. In contrast, removing the device was perceived as a clear and definitive end to interaction.

02
Episodic use enhances sense of agency

Users valued being in control of when assistance was invoked, with the majority preferring to engage the device only at selected moments rather than continuously. Voice interaction felt natural, without requiring a shift in attention.

03
Activation overhead can disrupt cognitive flow

Despite showing conceptual promise, the device was experienced as practically fragile. Sensing failures and the effort required to invoke assistance disrupted flow, revealing that episodic interaction introduces new overhead when sensing reliability is imperfect. This overhead was partially mitigated by opting for a physical boundary (pressing the side of the device) rather than using wake word activation, though this is still not without friction.

04
Privacy vs. utility tradeoff

Episodic use reframes the challenges of wearable AI devices, trading the risks of always-on systems for the comfort of explicit boundaries. However, participants expressed a desire for proactive assistance as well. Balancing the tradeoffs between proactive and intentionally invoked assistance remains a challenge.

BibTeX

@misc{tussa2026penepisodiccognitiveassistance, title = {The Pen: Episodic Cognitive Assistance via an Ear-Worn Interface}, author = {Yonatan Tussa and Andy Heredia}, year = {2026}, eprint = {2603.06564}, archivePrefix = {arXiv}, primaryClass = {cs.HC}, url = {https://arxiv.org/abs/2603.06564}, }
Home