Writing in the Air: Unconstrained Text Recognition from Finger Movement Using Spatio-Temporal Convolution

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 40
  • Download : 0
In this article, we introduce a new benchmark dataset for the challenging writing in the air (WiTA) task-an elaborate task bridging vision and natural language processing (NLP). WiTA implements an intuitive and natural writing method with finger movement for human-computer interaction (HCI). Our WiTA dataset will facilitate the development of data-driven WiTA systems, which, thus, far have displayed unsatisfactory performance-due to lack of dataset as well as traditional statistical models they have adopted. Our dataset consists of five subdatasets in two languages (Korean and English) and amounts to 209 926 video instances from 122 participants. We capture finger movement for WiTA with red-green-blue (RGB) cameras to ensure wide accessibility and cost-efficiency. Next, we propose spatio-temporal residual network architectures inspired by 3-D ResNet. These models perform unconstrained text recognition from finger movement, guarantee a real-time operation [>100 frames per second (FPS)], and will serve as an evaluation standard.
Publisher
Institute of Electrical and Electronics Engineers Inc.
Issue Date
2023-12
Article Type
Article
Citation

IEEE Transactions on Artificial Intelligence, v.4, no.6, pp.1386 - 1398

ISSN
2691-4581
DOI
10.1109/tai.2022.3212981
URI
http://hdl.handle.net/10203/320039
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0