loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Reham Abobeah 1 ; Marwan Torki 2 ; Amin Shoukry 3 and Jiro Katto 4

Affiliations: 1 CSE Department, Egypt-Japan University of Science and Technology, Alexandria, Egypt, CSE Department, Al-Azhar University, Cairo and Egypt ; 2 CSE Department, Alexandria University, Alexandria and Egypt ; 3 CSE Department, Egypt-Japan University of Science and Technology, Alexandria, Egypt, CSE Department, Alexandria University, Alexandria and Egypt ; 4 Computer Science and Communication Engineering Department, Waseda University, Tokyo 169-8555 and Japan

Keyword(s): Temporal Alignment, Synchronization, Attention Mechanisms, Bi-directional Attention.

Abstract: In this paper, a novel technique is introduced to address the video alignment task which is one of the hot topics in computer vision. Specifically, we aim at finding the best possible correspondences between two overlapping videos without the restrictions imposed by previous techniques. The novelty of this work is that the video alignment problem is solved by drawing an analogy between it and the machine comprehension (MC) task in natural language processing (NLP). Simply, MC seeks to give the best answer to a question about a given paragraph. In our work, one of the two videos is considered as a query, while the other as a context. First, a pre-trained CNN is used to obtain high-level features from the frames of both the query and context videos. Then, the bidirectional attention flow mechanism; that has achieved considerable success in MC; is used to compute the query-context interactions in order to find the best mapping between the two input videos. The proposed model has been tr ained using 10k of collected video pairs from ”YouTube”. The initial experimental results show that it is a promising solution for the video alignment task when compared to the state of the art techniques. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.234.83.135

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Abobeah, R.; Torki, M.; Shoukry, A. and Katto, J. (2019). Bi-Directional Attention Flow for Video Alignment. In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP; ISBN 978-989-758-354-4; ISSN 2184-4321, SciTePress, pages 583-589. DOI: 10.5220/0007524505830589

@conference{visapp19,
author={Reham Abobeah. and Marwan Torki. and Amin Shoukry. and Jiro Katto.},
title={Bi-Directional Attention Flow for Video Alignment},
booktitle={Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP},
year={2019},
pages={583-589},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007524505830589},
isbn={978-989-758-354-4},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP
TI - Bi-Directional Attention Flow for Video Alignment
SN - 978-989-758-354-4
IS - 2184-4321
AU - Abobeah, R.
AU - Torki, M.
AU - Shoukry, A.
AU - Katto, J.
PY - 2019
SP - 583
EP - 589
DO - 10.5220/0007524505830589
PB - SciTePress