Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation----ICCV2021论文阅读Video-Sentence AlignmentCross-Sentence Relations MiningModel Training

95 阅读 0 评论 63 点赞

我是靠谱客的博主超帅期待，最近开发中收集的这篇文章主要介绍Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation----ICCV2021论文阅读Video-Sentence AlignmentCross-Sentence Relations MiningModel Training，觉得挺不错的，现在分享给大家，希望可以做个参考。

概述

目录

Video-Sentence Alignment

Matching Score.

Multi-Instance Learning.

Cross-Sentence Relations Mining

Temporal Consistency.

Semantic Consistency.

Model Training

这篇论文的Task是视频定位

只有视频文本的对应关系，但是没有ground truth的时间边界，因此是弱监督的

现有的弱监督解决方案首先分别定位不同的MoIs，但这不是最优的方案，因为它忽略了段落中的跨句子关系在时间定位中发挥了重要作用。

Video-Sentence Alignment

将video V 和 query Qj输入到一个Modalities Matching Network (MMN)，--》得到每一个proposal和query的matching score，具体来说：

MMN由堆叠的attention units组成

Attention units由自注意组成，

这里计算self attention 和 cross-attention

Matching Score.

将视频V通过sliding window strategy随机划分为Ls个proposals

将文本特征沿句子数量维度进行最大池化，得到Qj，文本向量

将文本特征和proposal特征融合在一起

将联合特征输入到一个线性分类器中

Multi-Instance Learning.

由于没有明确的时间边界的annotation，因此作者进行了video-level的优化，将得到的proposal score进行maxpooling，作为video query的matching score

随机采样其他两个视频作为negative，并使用相同的方式计算matching score

Cross-Sentence Relations Mining

Temporal Consistency.

视频帧自然地按照时间顺序展示给观众，不同MoIs的时间关系本质上应该按照它们在段落中的描述顺序进行编码

segments的发生的时间顺序应该和对应的sentence的时间顺序是一致的

即将时间一致的联合概率分数视为positive，反之negative

Semantic Consistency.

两个单个sentence和segment所对应的时间边界的交集应该和联合的sentence和segment所对应的时间边界应该保持一致

即将重合概率小于某一阈值的视为positive，反之negative

Model Training

最后

以上就是超帅期待为你收集整理的Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation----ICCV2021论文阅读Video-Sentence AlignmentCross-Sentence Relations MiningModel Training的全部内容，希望文章能够帮你解决Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation----ICCV2021论文阅读Video-Sentence AlignmentCross-Sentence Relations MiningModel Training所遇到的程序开发问题。

如果觉得靠谱客网站的内容还不错，欢迎将靠谱客网站推荐给程序员好友。

本图文内容来源于网友提供，作为学习参考使用，或来自网络收集整理，版权属于原作者所有。

点赞(63)

本文分类：无监督/半监督
浏览次数：95 次浏览
发布日期：2024-05-24 01:55:02
本文链接：https://www.kaopuke.com/article/k-p-k_13_u_7_o_26_f3_14__23__18_y.html

相关文章

NLP_拼写纠错

【NLP】篇章级机器翻译简介

2021-07-06论文笔记

MATLAB热障涂层成像,微波检测热障涂层孔隙率的可行性研究

MATLAB热障涂层成像,微波检测热障涂层孔隙率的可行性研究

Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation----ICCV2021论文阅读Video-Sentence AlignmentCross-Sentence Relations MiningModel Training

Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation----ICCV2021论文阅读Video-Sentence AlignmentCross-Sentence Relations MiningModel Training

MATLAB热障涂层成像,考虑孔隙细观特征的热障涂层脱粘缺陷超声检测数值模拟

MATLAB热障涂层成像,考虑孔隙细观特征的热障涂层脱粘缺陷超声检测数值模拟

js代码执行过程

【对话系统】对话系统评价方法综述-阅读笔记对话系统发展概述任务型对话评价方法非任务型对话系统评价方法

【对话系统】对话系统评价方法综述-阅读笔记对话系统发展概述任务型对话评价方法非任务型对话系统评价方法

评论列表共有 0 条评论

发表评论取消回复

立即
投稿返回
顶部