Two-Stream Retentive Long Short-Term Memory Network for Dense Action Anticipation

Fengda Zhao; Jiuhan Zhao; Xianshan Li; Yinghui Zhang; Dingding Guo; Wenbai Chen

doi:10.1155/2022/4260247

Two-Stream Retentive Long Short-Term Memory Network for Dense Action Anticipation

Fengda Zhao, Jiuhan Zhao, Xianshan Li^*, Yinghui Zhang, Dingding Guo, Wenbai Chen

^*Corresponding author for this work

Research output: Contribution to Journal › Article › peer-review

Abstract

Analyzing and understanding human actions in long-range videos has promising applications, such as video surveillance, automatic driving, and efficient human-computer interaction. Most researches focus on short-range videos that predict a single action in an ongoing video or forecast an action several seconds earlier before it occurs. In this work, a novel method is proposed to forecast a series of actions and their durations after observing a partial video. This method extracts features from both frame sequences and label sequences. A retentive memory module is introduced to richly extract features at salient time steps and pivotal channels. Extensive experiments are conducted on the Breakfast data set and 50 Salads data set. Compared to the state-of-the-art methods, the method achieves comparable performance in most cases.

Original language	English
Article number	4260247
Pages (from-to)	1-9
Number of pages	9
Journal	Computational Intelligence and Neuroscience
Volume	2022
Early online date	16 May 2022
DOIs	https://doi.org/10.1155/2022/4260247
Publication status	Published - 16 May 2022

Bibliographical note

Keywords

General Mathematics
General Medicine
General Neuroscience
General Computer Science
Neural Networks, Computer
Memory, Long-Term
Humans
Human Activities
Rivers
Memory, Short-Term

Access to Document

10.1155/2022/4260247Licence: CC BY

Zhao_etal_CIN_2022_Two-Stream_Retentive_Long_Short-Term_Memory_Network_for_Dense_Action_AnticipationFinal published version, 1.09 MBLicence: CC BY

Cite this

@article{2f9c742de24145e882b7502ac5586486,

title = "Two-Stream Retentive Long Short-Term Memory Network for Dense Action Anticipation",

abstract = "Analyzing and understanding human actions in long-range videos has promising applications, such as video surveillance, automatic driving, and efficient human-computer interaction. Most researches focus on short-range videos that predict a single action in an ongoing video or forecast an action several seconds earlier before it occurs. In this work, a novel method is proposed to forecast a series of actions and their durations after observing a partial video. This method extracts features from both frame sequences and label sequences. A retentive memory module is introduced to richly extract features at salient time steps and pivotal channels. Extensive experiments are conducted on the Breakfast data set and 50 Salads data set. Compared to the state-of-the-art methods, the method achieves comparable performance in most cases.",

keywords = "General Mathematics, General Medicine, General Neuroscience, General Computer Science, Neural Networks, Computer, Memory, Long-Term, Humans, Human Activities, Rivers, Memory, Short-Term",

author = "Fengda Zhao and Jiuhan Zhao and Xianshan Li and Yinghui Zhang and Dingding Guo and Wenbai Chen",

note = "Copyright {\textcopyright} 2022 Fengda Zhao et al.",

year = "2022",

month = may,

day = "16",

doi = "10.1155/2022/4260247",

language = "English",

volume = "2022",

pages = "1--9",

journal = "Computational Intelligence and Neuroscience",

issn = "1687-5265",

publisher = "Hindawi Publishing Corporation",

}

TY - JOUR

T1 - Two-Stream Retentive Long Short-Term Memory Network for Dense Action Anticipation

AU - Zhao, Fengda

AU - Zhao, Jiuhan

AU - Li, Xianshan

AU - Zhang, Yinghui

AU - Guo, Dingding

AU - Chen, Wenbai

PY - 2022/5/16

Y1 - 2022/5/16

N2 - Analyzing and understanding human actions in long-range videos has promising applications, such as video surveillance, automatic driving, and efficient human-computer interaction. Most researches focus on short-range videos that predict a single action in an ongoing video or forecast an action several seconds earlier before it occurs. In this work, a novel method is proposed to forecast a series of actions and their durations after observing a partial video. This method extracts features from both frame sequences and label sequences. A retentive memory module is introduced to richly extract features at salient time steps and pivotal channels. Extensive experiments are conducted on the Breakfast data set and 50 Salads data set. Compared to the state-of-the-art methods, the method achieves comparable performance in most cases.

AB - Analyzing and understanding human actions in long-range videos has promising applications, such as video surveillance, automatic driving, and efficient human-computer interaction. Most researches focus on short-range videos that predict a single action in an ongoing video or forecast an action several seconds earlier before it occurs. In this work, a novel method is proposed to forecast a series of actions and their durations after observing a partial video. This method extracts features from both frame sequences and label sequences. A retentive memory module is introduced to richly extract features at salient time steps and pivotal channels. Extensive experiments are conducted on the Breakfast data set and 50 Salads data set. Compared to the state-of-the-art methods, the method achieves comparable performance in most cases.

KW - General Mathematics

KW - General Medicine

KW - General Neuroscience

KW - General Computer Science

KW - Neural Networks, Computer

KW - Memory, Long-Term

KW - Humans

KW - Human Activities

KW - Rivers

KW - Memory, Short-Term

U2 - 10.1155/2022/4260247

DO - 10.1155/2022/4260247

M3 - Article

C2 - 35615551

SN - 1687-5265

VL - 2022

SP - 1

EP - 9

JO - Computational Intelligence and Neuroscience

JF - Computational Intelligence and Neuroscience

M1 - 4260247

ER -

Two-Stream Retentive Long Short-Term Memory Network for Dense Action Anticipation

Abstract

Bibliographical note

Keywords

Access to Document

Fingerprint

Cite this