ST-SIGMA: Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting

Cited 37 time in webofscience Cited 0 time in scopus
  • Hit : 276
  • Download : 64
DC FieldValueLanguage
dc.contributor.authorFang, Yangko
dc.contributor.authorLuo, Beiko
dc.contributor.authorZhao, Tingko
dc.contributor.authorHe, Dongko
dc.contributor.authorJiang, Bingbingko
dc.contributor.authorLiu, Qilieko
dc.date.accessioned2022-12-11T01:00:10Z-
dc.date.available2022-12-11T01:00:10Z-
dc.date.created2022-11-14-
dc.date.created2022-11-14-
dc.date.issued2022-12-
dc.identifier.citationCAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, v.7, no.4, pp.744 - 757-
dc.identifier.issn2468-6557-
dc.identifier.urihttp://hdl.handle.net/10203/302604-
dc.description.abstractScene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving (AD) system. However, most proposed methods aim at addressing one of the two challenges mentioned above with a single model. To tackle this dilemma, this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting (ST-SIGMA), an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework. ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird's-eye view (BEV) maps simultaneously. Specifically, an iterative aggregation network is first employed as the scene semantic encoder (SSE) to learn diverse scene information. To preserve dynamic interactions of traffic agents, ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder. Meanwhile, a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed. Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-the-art (SOTA) methods in terms of scene perception and trajectory forecasting, respectively. Therefore, the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in real-world AD scenarios.-
dc.languageEnglish-
dc.publisherWILEY-
dc.titleST-SIGMA: Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting-
dc.typeArticle-
dc.identifier.wosid000877526400001-
dc.identifier.scopusid2-s2.0-85141371596-
dc.type.rimsART-
dc.citation.volume7-
dc.citation.issue4-
dc.citation.beginningpage744-
dc.citation.endingpage757-
dc.citation.publicationnameCAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY-
dc.identifier.doi10.1049/cit2.12145-
dc.contributor.nonIdAuthorFang, Yang-
dc.contributor.nonIdAuthorLuo, Bei-
dc.contributor.nonIdAuthorZhao, Ting-
dc.contributor.nonIdAuthorJiang, Bingbing-
dc.contributor.nonIdAuthorLiu, Qilie-
dc.description.isOpenAccessY-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorfeature fusion-
dc.subject.keywordAuthorgraph interaction-
dc.subject.keywordAuthorhierarchical aggregation-
dc.subject.keywordAuthorscene perception-
dc.subject.keywordAuthorscene semantics-
dc.subject.keywordAuthortrajectory forecasting-
dc.subject.keywordPlusTRACKING-
Appears in Collection
Files in This Item
126681.pdf(2.64 MB)Download
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 37 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0