Transformers Human Technology

Self-trained vision transformers mimic human gaze with surprising precision

Video clips from N2010 (Nakano et al., 2010) and CW2019 (Costela and Woods, 2019) were presented to ViTs. The gaze positions of each self-attention head in the class token ([CLS]) — identified as peak ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results

Feedback

Self-trained vision transformers mimic human gaze with surprising precision

Trending now