A novel cross-modal hashing algorithm based on multimodal deep learning

来源 :Science China(Information Sciences) | 被引量 : 0次 | 上传用户:jzy0403
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
With the growing popularity of multimodal data on the Web, cross-modal retrieval on large-scale multimedia databases has become an important research topic. Cross-modal retrieval methods based on hashing assume that there is a latent space shared by multimodal features. To model the relationship among heterogeneous data, most existing methods embed the data into a joint abstraction space by linear projections. However,these approaches are sensitive to noise in the data and are unable to make use of unlabeled data and multimodal data with missing values in real-world applications. To address these challenges, we proposed a novel multimodal deep-learning-based hash(MDLH) algorithm. In particular, MDLH uses a deep neural network to encode heterogeneous features into a compact common representation and learns the hash functions based on the common representation. The parameters of the whole model are fine-tuned in a supervised training stage.Experiments on two standard datasets show that the method achieves more effective results than other methods in cross-modal retrieval. With the growing popularity of multimodal data on the Web, cross-modal retrieval on large-scale multimedia databases has become an important research topic. Cross-modal retrieval methods based on hashing that there is a latent space shared by multimodal features. To model the relationship among heterogeneous data, most existing methods embed the data into a joint abstraction space by linear projections. However, these approaches are sensitive to noise in the data and are unable to make use of unlabeled data and multimodal data with missing values ​​in real- To address these challenges, we propose a novel multimodal deep-learning-based hash (MDLH) algorithm. In particular, MDLH uses a deep neural network to encode heterogeneous features into a compact common representation and learns the hash functions based on the common parameters. The parameters of the whole model are fine-tuned in a supervised training stage. Experiments on two standard datasets show that t he method achieves more effective results than other methods in cross-modal retrieval.
其他文献
作为中国画画科之一,山水画一直占据着十分重要的地位。明代山水画创作处于稳定发展的时期,涉足山水画的画家群体庞大,仅《明画录》中记载的山水画家就达400多人,其中不乏名
幼儿期是人一生发展中的关键时期,更是为其今后教育与发展奠定基础的时期。学前教育的迅速发展,也影响着整个教育体系中各个阶段和环节的发展。学前阶段的发展是为个体一生的
车型:配置3.0L发动机(CJTA)、NXR变速器。VIN:WVGAB97P0ED××××××。行驶里程:53521km。故障现象:客户反映,此车停一晚上就没电,造成无法启动。通过泵电启动后,行驶一整
期刊
巴斯夫:创新中心将灵感转化为解决方案rn日前,在广州召开的CHINAPLAS 2019媒体会上巴斯夫正式宣布,将在亚太区设立3个创新中心,分别位于中国上海、日本东京和印度孟买.据了解
期刊
2003年教育部颁布了《普通高中数学课程标准(实验)》,首次提出将数学文化融入高中数学课程中,并指出数学文化是贯穿高中数学课程的重要内容,进一步的明确了数学文化在高中数
本文从分析建筑表意的艺术属性入手,指出建筑表意性与建筑艺术的他律性是同一的,提出了从"塑造典型"、"创设象征"和"营构意境"三种艺术表意方式着手,系统的研究中国建筑表意
本研究主要从理论层面采取个案研究的方式,对城市回族社区幼儿园课程资源开发与利用进行了初级的探讨.研究紧扣幼儿园课程改革的基本理念,以甘肃省平凉市老东寺穆斯林幼儿园
写作是学生认知能力和语言文字表达能力的综合体现,是衡量学生语文水平高低的主要标志。小学阶段,是培养学生写作能力的关键时期,因此,小学作文教学显得尤为重要。然而,由于
对于天逸来说,讨喜的当然不仅仅只是名字,它的外观更加讨喜.与4008同平台的天逸并不像前者那样拥有复杂多变的车身线条,取而代之的是饱满圆润的平滑曲面,整体轮廓看起来简单
期刊
"解放军艺术大厦"建筑设计中,以解读城市母体为出发点,从建筑尺度、单元构成和空间序列的形成等方面,实现对城市空间的契合与归纳,整体形成连续、开放的空司体系.采用系统设